R

Machine Learning Engineer

Red Hat

toronto, on, Canada Full-time May 27, 2026

Opportunity Description

Job Summary

At Red Hat we believe the future of AI is open and we are on a mission to bring the power of open‑source LLMs and vLLM to every enterprise. The Red Hat AI Inference team accelerates AI for the enterprise and brings operational simplicity to GenAI deployments. As leading developers and maintainers of the vLLM project, and inventors of state‑of‑the‑art techniques for model quantization and sparsification, our team provides a stable platform for enterprises to build, optimize, and scale LLM deployments.

As a Machine Learning Engineer focused on model optimization algorithms, you will work closely with our product and research teams to develop SOTA deep learning software. You will collaborate with our technical and research teams to develop LLM training and deployment pipelines, implement model compression algorithms, and productize deep learning research. If you are someone who enjoys bridging research and production, optimizing large models, and contribu...

Full-time Engineering

Interested in this opportunity? Apply now through Expertini.

Apply for this Position