Senior Cloud Site Reliability Engineer

A company is looking for a Sr. Cloud Site Reliability Engineer. Key Responsibilities Develop and refine monitoring and observability tools to validate system availability and performance Collaborate with development teams to design solutions for higher availability in the cloud and manage Service Level Indicators (SLIs) and Objectives (SLOs) Own the incident response process, proactively identify reliability risks, and conduct postmortems to drive improvements Required Qualifications 5+ years of experience in Site Reliability Engineering, DevOps, or a similar role Experience with major cloud providers (e.g., Google Cloud, AWS, Azure) and high-availability systems Proficiency in Docker, Kubernetes, or similar containerization/orchestration platforms Hands-on experience with observability tools such as Prometheus, Grafana, or Datadog Bachelor's degree in Computer Science, Engineering, or a related field (or equivalent experience)

Feb 9, 2025 - 22:10

A company is looking for a Sr. Cloud Site Reliability Engineer. Key Responsibilities Develop and refine monitoring and observability tools to validate system availability and performance Collaborate with development teams to design solutions for higher availability in the cloud and manage Service Level Indicators (SLIs) and Objectives (SLOs) Own the incident response process, proactively identify reliability risks, and conduct postmortems to drive improvements Required Qualifications 5+ years of experience in Site Reliability Engineering, DevOps, or a similar role Experience with major cloud providers (e.g., Google Cloud, AWS, Azure) and high-availability systems Proficiency in Docker, Kubernetes, or similar containerization/orchestration platforms Hands-on experience with observability tools such as Prometheus, Grafana, or Datadog Bachelor's degree in Computer Science, Engineering, or a related field (or equivalent experience)