Principal Software Engineer - Performance

Microsoft Corporation • Mountain View, CA, United States • Posted May 28, 2026

Location Mountain View, CA
Job Type Full-time
Category other-general
Posted May 28, 2026
**Overview**

The Artificial Intelligence Cloud Inference team at Microsoft develops AI software that enables running AI models everywhere, from world’s fastest AI supercomputers, to servers, desktops, mobile phones, IoT devices and internet browsers. We collaborate with our hardware teams and partners, both internal and external, and operate at the intersection of AI algorithmic innovation, purpose-built AI hardware, systems, and software. We are a team of highly capable and motivated people that pride themselves on a collaborative and inclusive culture.

We own inference performance of OpenAI and other state of the art LLM models and work directly with OpenAI on the models hosted on the Azure OpenAI service serving some of the largest workloads on the planet with trillions of inferences per day in major Microsoft products, including Office, Windows, Bing, SQL Server, and Dynamics.

As a Principal Software Engineer - Performance on the team, you will have the o...

Interested in this role?

Click the button below to start your application.

Apply Now