Principal Performance Engineer

A company is looking for a Principal Performance Engineer to lead the performance optimization of its Generative AI technology stack. Key Responsibilities Define and implement performance engineering strategies for AI services, LLMs, and RAG pipelines Analyze and improve LLM inference performance, including latency and resource utilization Collaborate with infrastructure teams to optimize hardware and software configurations for AI workloads Required Qualifications Bachelor's degree in Computer Science, Engineering, or a related field (Master's preferred) 10+ years of experience in performance engineering, focusing on large-scale distributed systems 2+ years of experience with AI/ML technologies Experience with cloud computing platforms and containerization technologies Strong programming skills in Python and experience with performance analysis tools

Mar 28, 2025 - 01:02
 0
Principal Performance Engineer
A company is looking for a Principal Performance Engineer to lead the performance optimization of its Generative AI technology stack. Key Responsibilities Define and implement performance engineering strategies for AI services, LLMs, and RAG pipelines Analyze and improve LLM inference performance, including latency and resource utilization Collaborate with infrastructure teams to optimize hardware and software configurations for AI workloads Required Qualifications Bachelor's degree in Computer Science, Engineering, or a related field (Master's preferred) 10+ years of experience in performance engineering, focusing on large-scale distributed systems 2+ years of experience with AI/ML technologies Experience with cloud computing platforms and containerization technologies Strong programming skills in Python and experience with performance analysis tools