ML Ops Engineer

A company is looking for an ML Ops Engineer to architect and implement efficient ML inference pipelines for large language models. Responsibilities: Design and implement high-performance inference pipelines Optimize model serving for throughput, latency, and cost across different workloads Collaborate with research and product teams to integrate inference into real-world applications Qualifications: Deep experience developing and tuning LLM inference frameworks (e.g. vLLM) Experience with cloud infrastructure (AWS, GCP, Azure) and Kubernetes Passion for AI and practical ML systems Experience building, deploying, and operating highly available, scalable, distributed cloud services

May 12, 2025 - 15:10

A company is looking for an ML Ops Engineer to architect and implement efficient ML inference pipelines for large language models. Responsibilities: Design and implement high-performance inference pipelines Optimize model serving for throughput, latency, and cost across different workloads Collaborate with research and product teams to integrate inference into real-world applications Qualifications: Deep experience developing and tuning LLM inference frameworks (e.g. vLLM) Experience with cloud infrastructure (AWS, GCP, Azure) and Kubernetes Passion for AI and practical ML systems Experience building, deploying, and operating highly available, scalable, distributed cloud services