Opportunity Description
Elevate your career with TR as a Lead AI Support Engineer, concentrating on inference optimization and deployment strategies in a hybrid work environment. Collaborate with industry experts to propel cutting-edge AI initiatives.
This role necessitates a minimum of three years' experience in deploying AI workloads on cloud platforms. Your primary focus will involve optimizing large language models (LLMs), implementing routing strategies, and ensuring the health of critical pipelines. You will partner closely with various teams to enhance AI performance and scalability.
Key Responsibilities:
• Optimize inference workloads through quantization and tuning
• Profile and enhance GPU/CPU performance
• Develop AI deployment strategies effectively
• Collaborate with platform teams for scalability
• Create containerized solutions for AI services
Requirements:
• 3+ years of machine learning model deployment experience
...
This role necessitates a minimum of three years' experience in deploying AI workloads on cloud platforms. Your primary focus will involve optimizing large language models (LLMs), implementing routing strategies, and ensuring the health of critical pipelines. You will partner closely with various teams to enhance AI performance and scalability.
Key Responsibilities:
• Optimize inference workloads through quantization and tuning
• Profile and enhance GPU/CPU performance
• Develop AI deployment strategies effectively
• Collaborate with platform teams for scalability
• Create containerized solutions for AI services
Requirements:
• 3+ years of machine learning model deployment experience
...
Interested in this opportunity? Apply now through Expertini.
Apply for this Position