SonicJobs Logo
Left arrow iconBack to search

Principal Software Engineering Manager - Substrate Efficiency

Microsoft
Posted 3 days ago, valid for 7 days
Location

Redmond, WA 98073, US

Salary

$142,800 - $304,200 per year

Contract type

Full Time

By applying, a Sonicjobs account will be created for you. Sonicjobs's Privacy Policy and Terms & Conditions will apply.

SonicJobs' Terms & Conditions and Privacy Policy also apply.

Sonic Summary

info
  • Microsoft is seeking a Principal Software Engineering Manager for its M365 Copilot inference team, which focuses on applied AI and large-scale machine learning.
  • The role requires a Bachelor's Degree in Computer Science or a related field, along with 6+ years of technical engineering experience in programming languages such as C, C++, C#, Java, JavaScript, or Python.
  • The position is based in Redmond, WA, and offers a salary range of USD $142,800 - $274,800 per year, with higher compensation available in certain locations like San Francisco and New York City.
  • Key responsibilities include leading a high-performing engineering team, optimizing GPU throughput, and collaborating with various teams to enhance performance and efficiency.
  • Candidates should have strong experience in improving system performance and resource utilization, with a preference for those who have 4+ years of people management experience.
Overview

M365 Copilot inference is a high-impact engineering team advancing applied AI and large-scale machine learning across Microsoft. We design and operate the platform powering Microsoft 365 Copilot experiences, delivering intelligent capabilities to millions of users.

Our team owns one of the world’s largest AI inference platforms, operating at massive GPU (Graphics Processing Unit) scale across global datacenters. We build the core LLM (large langguage model)  API (Application Programming Interface)  and routing services that enable low-latency, highly available AI experiences, and continuously push the boundaries of performance, scalability, and efficiency. 
As a Principal Software Engineering Manager you will lead a strategic initiative focused on maximizing throughput per GPU across the Copilot inference stack. This role is to drive inference engine efficiency by optimizing model execution and runtime performance, improving throughput per GPU, reducing cost per query, and unlocking capacity without additional hardware investment.

This role is based out of Redmond, WA and employees are expected to work from a designated Microsoft office at least three days a week.

Microsoft’s mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.



Responsibilities
  • Build and lead a high-performing engineering team focused on inference runtime efficiency and model execution performance.
  • Define and drive strategy to improve throughput per GPU through runtime optimizations.
  • Increase engineering agility, enabling faster experimentation, iteration, and rollout of performance improvements.
  • Partner across M365 Core, AI Core, Azure, and Microsoft Research to co-design and productionize advanced inference optimizations.
  • Establish metrics, telemetry, and experimentation frameworks to measure efficiency gains and guide investment decisions.
  • Own live-site performance, reliability, and operational excellence for inference engines at scale.
  • Drive alignment across partner teams on engine interfaces, performance goals, and optimization priorities.


Qualifications

Required Qualifications:

  • Bachelor's Degree in Computer Science or related technical field AND 6+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
    • OR equivalent experience.
 

Other Requirements:

Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include but are not limited to the following specialized security screenings:

  • Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter.
Preferred Qualifications:
  • Master's Degree in Computer Science or related technical field AND 8+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
    • OR Bachelor's Degree in Computer Science or related technical field AND 12+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python 
    • OR equivalent experience.
  • 4+ years people management experience.
  • Experience leading engineering teams building backend or distributed systems.
  • Hands-on experience improving system throughput, performance, and resource utilization across large-scale infrastructure.
  • Systems thinking, with the ability to identify and optimize bottlenecks across execution, scaling, and resource management.
  • Experience driving system-level improvements in areas such as workload execution, scheduling, batching, or infrastructure efficiency.
  • Experience with developing AI/ML inference systems or GPU-based workloads.
  • Familiarity with inference or training runtime optimization techniques.
  • Experience improving throughput per resource (e.g., cost per query) in large-scale systems.
  • Able to translate technical insights into clear engineering priorities and execution plans.
  • Comfortable collaborating across teams to align on goals and execution.


Software Engineering M5 - The typical base pay range for this role across the U.S. is USD $142,800 - $274,800 per year. There is a different range applicable to specific work locations, within the San Francisco Bay area and New York City metropolitan area, and the base pay range for this role in those locations is USD $188,000 - $304,200 per year.

Certain roles may be eligible for benefits and other compensation. Find additional benefits and pay information here:
https://careers.microsoft.com/us/en/us-corporate-pay


This position will be open for a minimum of 5 days, with applications accepted on an ongoing basis until the position is filled.



Microsoft is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, citizenship, color, family or medical care leave, gender identity or expression, genetic information, immigration status, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran or military status, race, ethnicity, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable local laws, regulations and ordinances. If you need assistance with religious accommodations and/or a reasonable accommodation due to a disability during the application process, read more about requesting accommodations.




Learn more about this Employer on their Career Site

Apply now in a few quick clicks

By applying, a Sonicjobs account will be created for you. Sonicjobs's Privacy Policy and Terms & Conditions will apply.

SonicJobs' Terms & Conditions and Privacy Policy also apply.