Senior Site Reliability Engineer: Platform as a Service for On-Premise Cloud (Hybrid)
JOB SCOPE
As a Senior Site Reliability Engineer in Platform as a Service for on-premise cloud, you will be responsible for ensuring the reliability, availability, and scalability of our PaaS infrastructure, IaaS platforms, cloud management platform, CI/CD pipeline, automation, and tooling. You will work cl
osely with our development, operations, and security teams to design, implement, and maintain a highly available and secure PaaS platform and managed services systems. Your primary focus will be on building and maintaining the infrastructure necessary to support our PaaS offerings, managed services tools, ensuring high availability, reliability, and performance of our services. You will be responsible for ensuring that our PaaS platforms, cloud management platform, and other managed services meets the needs of our internal and external stakeholders.
DUTIES AND RESPONSIBILITIES
#LI-HYBRID
#LI-AK
Here, employees don’t just have jobs, they build careers. That’s why we believe in offering a comprehensive pay and benefits package that rewards employees for their contributions to our success, supports all aspects of their well-being, and delivers real value at every stage of life.
The pay for this position has a salary range of $88,200.00 to $156,600.00. The actual salary offer will carefully consider a wide range of factors, including your skills, qualifications, experience and location. Also, certain positions are eligible for additional forms of compensation such as bonuses.
Get to Know Us Charter Communications is known in the United States by our Spectrum brands, including: Spectrum Internet®, TV, Mobile and Voice, Spectrum Networks, Spectrum Enterprise and Spectrum Reach. When you join us, you’re joining a strong community of more than 93,000 individuals working together to serve more than 32 million customers in 41 states and keep them connected to what matters most. Watch this video to learn more.
Who You Are Matters Here We’re committed to growing a workforce that reflects our communities, and providing equal opportunities for employment and advancement. EOE, including disability/vets. Learn about our inclusive culture.
As a Senior Site Reliability Engineer in Platform as a Service for on-premise cloud, you will be responsible for ensuring the reliability, availability, and scalability of our PaaS infrastructure, IaaS platforms, cloud management platform, CI/CD pipeline, automation, and tooling. You will work cl
osely with our development, operations, and security teams to design, implement, and maintain a highly available and secure PaaS platform and managed services systems. Your primary focus will be on building and maintaining the infrastructure necessary to support our PaaS offerings, managed services tools, ensuring high availability, reliability, and performance of our services. You will be responsible for ensuring that our PaaS platforms, cloud management platform, and other managed services meets the needs of our internal and external stakeholders.
DUTIES AND RESPONSIBILITIES
- Design, implement, and maintain a highly available and secure PaaS infrastructure
- Design, implement, and maintain a cloud management platform, CI/CD Pipeline, automation, and managed services tools
- Collaborate with development, operations, and security teams to design and implement highly available, reliable, and scalable systems and services related to PaaS and IaaS infrastructures
- Automate deployment, monitoring, and management of PaaS and IaaS services
- Ensure that our PaaS and IaaS platforms meets the needs of our customers, including internal and external stakeholders
- Develop and implement processes for incident management, problem management, and change management in alignment with Charter Incident and Change Management Polices
- Continuously monitor and analyze the performance of PaaS and IaaS services to identify and resolve issues proactively
- Develop and implement disaster recovery plans and procedures
- Participate in capacity planning and performance optimization efforts
- Mentor junior engineers and provide technical leadership to the team
- Participate in an On Call rotation to ensure 24x7 support of Cloud Services
- Perform other duties as requested.
- Bachelor's degree in Computer Science, Engineering or related field, and/or equivalent work experience
- Minimum of Six (6) years of experience in site reliability engineering, systems engineering, or software engineering
- Minimum five (5) years of experience designing and operating PaaS infrastructure on on-premise cloud
- Minimum of five (5) years of experience with Kubernetes, Docker, Rancher, and related container technologies
- Minimum of five (5) years of experience with infrastructure as code tools such as Morpheus, Terraform, Ansible, or Chef
- Minimum of five (5) years of experience with IaaS platforms and related technology for virtualization, compute, storage, and network
- Minimum of five (5) years of experience with monitoring, logging, and alerting tools such as Prometheus, Grafana, and Splunk
- Minimum of five (5) years of experience scripting or development with languages such as Python, Ruby, or Bash
- Knowledge of PaaS, IaaS, Kubernetes, Docker, Rancher, GIT, Repository Management, Server Compute, SAN Storage, Virtualization, IP Networks, Data Center Operations, Linux and Windows Systems Administration
- Ability to handle multiple projects and tasks
- Ability to mentor junior engineers and lead technical teams and programs
- Strong decision making and problem-solving skills while working under pressure
- Strong communication and collaboration skills
- Ability to use personal computer and software applications
- Knowledge of all FCC compliance reports and other rules and regulations
- Knowledge of Cable Television or related technologies
- Experience working in a DevOps or Site Reliability Engineering role
- Experience with Kubernetes and Docker or other similar container technologies
- Experience with Infrastructure as Code, scripting and development
- Experience with virtualization platforms such as VMware, Nutanix or OpenStack
- Experience with Public Cloud providers such as AWS, Google, or Azure
- Experience with Unix/Linux or Windows systems administration
- Experience with Compute in a Cloud Environment using Rack Mount and Blade Servers
- Experience with Storage in a Cloud Environment using SAN, HCI, or Software Defined block, file, and object
- Certifications in Virtualization, Kubernetes, Docker, Containers, Compute, Storage, Networking, Public Cloud and Operating System technologies.
#LI-HYBRID
#LI-AK
Here, employees don’t just have jobs, they build careers. That’s why we believe in offering a comprehensive pay and benefits package that rewards employees for their contributions to our success, supports all aspects of their well-being, and delivers real value at every stage of life.
The pay for this position has a salary range of $88,200.00 to $156,600.00. The actual salary offer will carefully consider a wide range of factors, including your skills, qualifications, experience and location. Also, certain positions are eligible for additional forms of compensation such as bonuses.
Get to Know Us Charter Communications is known in the United States by our Spectrum brands, including: Spectrum Internet®, TV, Mobile and Voice, Spectrum Networks, Spectrum Enterprise and Spectrum Reach. When you join us, you’re joining a strong community of more than 93,000 individuals working together to serve more than 32 million customers in 41 states and keep them connected to what matters most. Watch this video to learn more.
Who You Are Matters Here We’re committed to growing a workforce that reflects our communities, and providing equal opportunities for employment and advancement. EOE, including disability/vets. Learn about our inclusive culture.