Principal Engineer - Site Reliability

A company is looking for a Principal Engineer - Site Reliability. Key Responsibilities Ensure uptime for crucial services and systems through proactive monitoring and fault-tolerant design Design and architect complex, scalable, and reliable systems Develop and implement automation tools to streamline operations and improve efficiency Required Qualifications 8+ years of experience with highly available, fault-tolerant systems at scale Proficiency in Golang or Python Deep understanding of Terraform with real-world experience in infrastructure provisioning Experience with at least one cloud service provider (e.g., AWS, GCP, Azure) Solid experience with Kubernetes and automation tools

Feb 18, 2025 - 12:20
 0
Principal Engineer - Site Reliability
A company is looking for a Principal Engineer - Site Reliability. Key Responsibilities Ensure uptime for crucial services and systems through proactive monitoring and fault-tolerant design Design and architect complex, scalable, and reliable systems Develop and implement automation tools to streamline operations and improve efficiency Required Qualifications 8+ years of experience with highly available, fault-tolerant systems at scale Proficiency in Golang or Python Deep understanding of Terraform with real-world experience in infrastructure provisioning Experience with at least one cloud service provider (e.g., AWS, GCP, Azure) Solid experience with Kubernetes and automation tools