Opportunity Description
Transform AI inference systems as a Senior Software Engineer at NVIDIA. Focus on optimizing GPU performance and scaling workloads across innovative environments.
In this senior-level role, you will optimize the vLLM inference framework and develop GPU kernels for multi-GPU and multi-cloud environments. Your contributions will set performance benchmarks while collaborating with diverse teams on cutting-edge AI projects.
Key Responsibilities:
• Architect and implement efficient inference stacks for AI
• Optimize GPU kernels through fusion and autotuning
• Contribute to the MLPerf Inference benchmarking suite
• Partner on container orchestration for AI workloads
• Pursue research to enhance performance in ML systems
Requirements:
• Master’s in relevant field with 5+ years of experience
• Proficient in Python and C/C++ programming languages
• Strong foundation in distributed systems and deep learning
• Experience in container technologies like Docker
...
In this senior-level role, you will optimize the vLLM inference framework and develop GPU kernels for multi-GPU and multi-cloud environments. Your contributions will set performance benchmarks while collaborating with diverse teams on cutting-edge AI projects.
Key Responsibilities:
• Architect and implement efficient inference stacks for AI
• Optimize GPU kernels through fusion and autotuning
• Contribute to the MLPerf Inference benchmarking suite
• Partner on container orchestration for AI workloads
• Pursue research to enhance performance in ML systems
Requirements:
• Master’s in relevant field with 5+ years of experience
• Proficient in Python and C/C++ programming languages
• Strong foundation in distributed systems and deep learning
• Experience in container technologies like Docker
...
Interested in this opportunity? Apply now through Expertini.
Apply for this Position