Machine Learning Kernel Developer
A company is looking for a Machine Learning Kernel Development Engineer to design and optimize low-level machine learning kernels for GPUs. Key Responsibilities Design and implement optimized ML kernels for AMD GPUs using ROCm Profile, debug, and tune kernel performance for AI workloads Collaborate with ML researchers to integrate kernels into AI frameworks Required Qualifications 2+ years of experience in GPU kernel development for machine learning (ROCm or CUDA) Proficiency in C/C++ and Python for performance-critical programming Strong understanding of ML frameworks like PyTorch and TensorFlow Basic knowledge of modern AI technologies such as LLMs and inference optimization Familiarity with parallel computing and hardware architectures
