Debug DeepSpeed LLM Training Code

I'm training the DeepSpeed R1 671b LLM model (650 GB) and encountering errors during model training. The specific errors are Value Errors and quantization errors. I'm looking for someone who can assist with both debugging the current code and rewriting it as necessary... (Budget: ₹600 - ₹1500 INR, Jobs: GPGPU, Machine Learning (ML), Python, Software Architecture, Statistical Analysis)

May 12, 2025 - 07:36
 0
Debug DeepSpeed LLM Training Code
I'm training the DeepSpeed R1 671b LLM model (650 GB) and encountering errors during model training. The specific errors are Value Errors and quantization errors. I'm looking for someone who can assist with both debugging the current code and rewriting it as necessary... (Budget: ₹600 - ₹1500 INR, Jobs: GPGPU, Machine Learning (ML), Python, Software Architecture, Statistical Analysis)