Technical Staff for Model Serving
A company is looking for a Member of Technical Staff, Model Serving.
Key Responsibilities
Develop, deploy, and operate AI platforms delivering large language models through API endpoints
Serve optimized LLM models in production with low latency, high throughput, and high availability
Interface with customers to create customized deployments to meet specific needs
Required Qualifications
Experience with serving ML models in production
Experience designing, implementing, and maintaining a production service at scale
Strong understanding or working experience with distributed systems
Familiarity with cloud infrastructure (e.g., AWS, GCP)
Experience in Golang or other languages designed for high-performance scalable servers
A company is looking for a Member of Technical Staff, Model Serving.
Key Responsibilities
Develop, deploy, and operate AI platforms delivering large language models through API endpoints
Serve optimized LLM models in production with low latency, high throughput, and high availability
Interface with customers to create customized deployments to meet specific needs
Required Qualifications
Experience with serving ML models in production
Experience designing, implementing, and maintaining a production service at scale
Strong understanding or working experience with distributed systems
Familiarity with cloud infrastructure (e.g., AWS, GCP)
Experience in Golang or other languages designed for high-performance scalable servers