Platform Engineer for AI Systems

A company is looking for a Platform Engineer (DevOps for Applied AI Systems). Key Responsibilities Design and implement scalable AI model inference environments, optimizing GPU utilization and autoscaling strategies Develop and maintain infrastructure as code (IaC) using tools like Terraform, Pulumi, or CloudFormation Monitor system performance and implement failover mechanisms for model-serving reliability Required Qualifications 5-8+ years of experience in DevOps, Platform Engineering, or Cloud Infrastructure with a focus on AI/ML workloads 3+ years of experience with deploying and scaling AI models in production Expertise in containerization and orchestration tools (Docker, Kubernetes, Nomad) Strong knowledge of cloud platforms (AWS, GCP, Azure) and GPU-based model deployments Experience with CI/CD pipelines and infrastructure automation

Mar 19, 2025 - 15:59
 0
Platform Engineer for AI Systems
A company is looking for a Platform Engineer (DevOps for Applied AI Systems). Key Responsibilities Design and implement scalable AI model inference environments, optimizing GPU utilization and autoscaling strategies Develop and maintain infrastructure as code (IaC) using tools like Terraform, Pulumi, or CloudFormation Monitor system performance and implement failover mechanisms for model-serving reliability Required Qualifications 5-8+ years of experience in DevOps, Platform Engineering, or Cloud Infrastructure with a focus on AI/ML workloads 3+ years of experience with deploying and scaling AI models in production Expertise in containerization and orchestration tools (Docker, Kubernetes, Nomad) Strong knowledge of cloud platforms (AWS, GCP, Azure) and GPU-based model deployments Experience with CI/CD pipelines and infrastructure automation