N

Senior Deep Learning Solution Architect

NVIDIA

Beijing, China, China Full-time June 10, 2026

Opportunity Description

NVIDIA is leading company of AI computing. At NVIDIA, our employees are passionate about AI, HPC , VISUAL, GAMING. Our SA team is more focusing to bring NVIDIA new technology into difference industries. We help to design the architecture of AI computing platform, analysis the AI and HPC applications to deliver our value to customers, focusing on defining and solving computational challenges in LLM inference and training acceleration, as well as network communication and data transfer optimization.


What You'll Be Doing:
+ Contribute to the development of open-source inference frameworks such as SGLang and vLLM, including feature and operator development, performance optimization, and model support, in collaboration with the community.
+ Develop and optimize KV cache offloading frameworks for LLM workloads, supporting multi-level cache offloading and reuse across CPU, SSD, and remote storage to improve inference efficiency.
+ Drive R&D on compute performance in di...
Full-time other-general

Interested in this opportunity? Apply now through Expertini.

Apply for this Position