Principal Performance Engineer
A company is looking for a Principal Performance Engineer to lead the performance optimization of its Generative AI technology stack.
Key Responsibilities
Define and implement performance engineering strategies for AI services, LLMs, and RAG pipelines
Analyze and improve LLM inference performance, including latency and resource utilization
Collaborate with infrastructure teams to optimize hardware and software configurations for AI workloads
Required Qualifications
Bachelor's degree in Computer Science, Engineering, or a related field (Master's preferred)
10+ years of experience in performance engineering, focusing on large-scale distributed systems
2+ years of experience with AI/ML technologies
Experience with cloud computing platforms and containerization technologies
Strong programming skills in Python and experience with performance analysis tools
A company is looking for a Principal Performance Engineer to lead the performance optimization of its Generative AI technology stack.
Key Responsibilities
Define and implement performance engineering strategies for AI services, LLMs, and RAG pipelines
Analyze and improve LLM inference performance, including latency and resource utilization
Collaborate with infrastructure teams to optimize hardware and software configurations for AI workloads
Required Qualifications
Bachelor's degree in Computer Science, Engineering, or a related field (Master's preferred)
10+ years of experience in performance engineering, focusing on large-scale distributed systems
2+ years of experience with AI/ML technologies
Experience with cloud computing platforms and containerization technologies
Strong programming skills in Python and experience with performance analysis tools