ML Compiler Engineer , AWS Neuron, Annapurna Labs

Amazon Web Services (AWS)

toronto, on, Canada Full-time May 30, 2026

Opportunity Description

About the Role The Annapurna Labs team at Amazon Web Services (AWS) builds AWS Neuron, the software development kit used to accelerate deep learning and GenAI workloads on Amazon’s custom machine learning accelerators, Inferentia and Trainium. The Neuron Compiler team constructs a deep learning compiler stack that allows state‑of‑the‑art large language, vision, and multi‑modal models created in TensorFlow, PyTorch, and JAX to run efficiently on the accelerators. 
Responsibilities Our performance engineers collaborate across compiler, runtime, and framework teams to optimize machine learning workloads for our global customer base. They: 
Analyze and optimize system‑level performance of machine learning models across the entire technology stack, from frameworks to runtime. 
Conduct detailed performance analysis and profiling of ML workloads, identifying and resolving bottlenecks in large‑scale ML systems. 
        
            Full-time
            
            IT & Technology
            
Interested in this opportunity? Apply now through Expertini.

                 Apply for this Position

Location toronto, on

Country Canada

Type Full-time

Category IT & Technology

Posted May 30, 2026

Deadline July 09, 2026

ML Compiler Engineer , AWS Neuron, Annapurna Labs

Opportunity Description

About the Role

Responsibilities

Opportunity Details

About Amazon Web Services (AWS)

Amazon Web Services (AWS)