Senior Cloud Site Reliability Engineer
A company is looking for a Sr. Cloud Site Reliability Engineer.
Key Responsibilities
Develop and refine monitoring and observability tools to validate system availability and performance
Collaborate with development teams to design solutions for higher availability in the cloud and manage Service Level Indicators (SLIs) and Objectives (SLOs)
Own the incident response process, proactively identify reliability risks, and conduct postmortems to drive improvements
Required Qualifications
5+ years of experience in Site Reliability Engineering, DevOps, or a similar role
Experience with major cloud providers (e.g., Google Cloud, AWS, Azure) and high-availability systems
Proficiency in Docker, Kubernetes, or similar containerization/orchestration platforms
Hands-on experience with observability tools such as Prometheus, Grafana, or Datadog
Bachelor's degree in Computer Science, Engineering, or a related field (or equivalent experience)
A company is looking for a Sr. Cloud Site Reliability Engineer.
Key Responsibilities
Develop and refine monitoring and observability tools to validate system availability and performance
Collaborate with development teams to design solutions for higher availability in the cloud and manage Service Level Indicators (SLIs) and Objectives (SLOs)
Own the incident response process, proactively identify reliability risks, and conduct postmortems to drive improvements
Required Qualifications
5+ years of experience in Site Reliability Engineering, DevOps, or a similar role
Experience with major cloud providers (e.g., Google Cloud, AWS, Azure) and high-availability systems
Proficiency in Docker, Kubernetes, or similar containerization/orchestration platforms
Hands-on experience with observability tools such as Prometheus, Grafana, or Datadog
Bachelor's degree in Computer Science, Engineering, or a related field (or equivalent experience)