Platform Engineer for AI Systems
A company is looking for a Platform Engineer (DevOps for Applied AI Systems).
Key Responsibilities
Design and implement scalable AI model inference environments, optimizing GPU utilization and autoscaling strategies
Develop and maintain infrastructure as code (IaC) using tools like Terraform, Pulumi, or CloudFormation
Monitor system performance and implement failover mechanisms for model-serving reliability
Required Qualifications
5-8+ years of experience in DevOps, Platform Engineering, or Cloud Infrastructure with a focus on AI/ML workloads
3+ years of experience with deploying and scaling AI models in production
Expertise in containerization and orchestration tools (Docker, Kubernetes, Nomad)
Strong knowledge of cloud platforms (AWS, GCP, Azure) and GPU-based model deployments
Experience with CI/CD pipelines and infrastructure automation
A company is looking for a Platform Engineer (DevOps for Applied AI Systems).
Key Responsibilities
Design and implement scalable AI model inference environments, optimizing GPU utilization and autoscaling strategies
Develop and maintain infrastructure as code (IaC) using tools like Terraform, Pulumi, or CloudFormation
Monitor system performance and implement failover mechanisms for model-serving reliability
Required Qualifications
5-8+ years of experience in DevOps, Platform Engineering, or Cloud Infrastructure with a focus on AI/ML workloads
3+ years of experience with deploying and scaling AI models in production
Expertise in containerization and orchestration tools (Docker, Kubernetes, Nomad)
Strong knowledge of cloud platforms (AWS, GCP, Azure) and GPU-based model deployments
Experience with CI/CD pipelines and infrastructure automation