Senior System Software Engineer - Dynamo-Triton Inference Server

NVIDIA • Santa Clara, CA, United States • Posted May 28, 2026

Location Santa Clara, CA
Job Type Full-time
Category other-general
Posted May 28, 2026
We are looking for a Senior System Software Engineer to work on Dynamo-Triton Inference Server (https://developer.nvidia.com/dynamo-triton) . NVIDIA is hiring software engineers for its GPU-accelerated deep learning software team. Academic and commercial groups around the world are using GPUs to power a revolution in AI, enabling breakthroughs in problems from image classification to speech recognition to natural language processing. We are a fast-paced team building a highly-performant AI inference platform to make design and deployment of new AI models easier and accessible to all users.


What you'll be doing:
+ Develop world-class GPU-accelerated AI inference serving software.
+ Contribute to feature development and drive broad customer adoption.
+ Drive the convergence of the Triton Inference Server and NVIDIA Dynamo stacks to establish a unified, high-performance inference platform. This platform will ensure feature parity and effectively serve both Large La...

Interested in this role?

Click the button below to start your application.

Apply Now