As a Senior Research Engineer at Microsoft, you will advance Microsoft’s mission to empower every person and every organization to achieve more. You will help build and integrate cutting-edge AI into Microsoft products and services within Experience + Devices (E+D) org, ensuring solutions are inclusive, ethical, and impactful. This role blends applied research, machine learning engineering, and product innovation. You will lead efforts to ship reliable, production-grade AI systems across the stack, from model development and fine-tuning to performance optimization and deployment.Â
We are in an era of unprecedented AI innovation. As Microsoft leads the way in foundation models, multimodal systems, and AI agents, our goal is to build an open architecture platform where users can interact with tailored AI agents that drive tangible, real-world outcomes. As a Senior Research Engineer, you will:Â
Bridge the gap between state-of-the-art research and customer-facing featuresÂ
Drive systems-level innovation across models, infrastructure, and deploymentÂ
Champion responsible AI by embedding fairness, safety, privacy, and performance from the ground upÂ
Responsibilities
Bringing State-of-the-Art Research to ProductsÂ
Design and implement AI systems using foundation models, prompt engineering, retrieval-augmented generation, multi-agent architectures, and classic MLÂ
Fine-tune large language models on domain-specific data and evaluate via offline and online methods such as A/B testing, telemetry, and shadow deploymentsÂ
Build and harden prototypes into production-ready services using robust software engineering and MLOps practicesÂ
Drive original research and thought leadership (whitepapers, internal notes, patents); convert insights into shipped capabilitiesÂ
Research Translation: Continuously review emerging work; identify high-potential methods and adapt them to Microsoft problem spacesÂ
End-to-End System DevelopmentÂ
ML Design & Architecture: Own end-to-end pipeline from data prep, training, evaluation, deployment, and feedback loopsÂ
Identify and resolve model quality gaps, latency issues, and scale bottlenecks using PyTorch, or TensorFlowÂ
Operate CI/CD and MLOps workflows including model versioning, retraining, evaluation, and monitoringÂ
Integrate AI components into Microsoft products in close partnership with engineering and product teamsÂ
Data-Driven InnovationÂ
Evaluation & Instrumentation: Build robust offline/online evals, experimentation frameworks, and telemetry for model/system performance.Â
Learning Loop Creation: Operationalize continuous learning from user feedback and system signals; close the loop from experimentation to deployment.Â
Experimentation & E2E Validation: Design controlled experiments, analyze results, and drive product/model decisions with data.Â
Develop proofs of concept that validate ideas quickly at realistic scalesÂ
Curate high-signal datasets, including synthetic and red-team corpora, and establish labeling protocols and data quality checks tied to evaluation KPIsÂ
Cross-Functional CollaborationÂ
Partner with software engineers, scientists, designers, and product managers to deliver high-impact AI featuresÂ
Translate research breakthroughs into scalable applications aligned with product prioritiesÂ
Communicate findings and decisions through internal forums, demos, and documentationÂ
Responsible AI & EthicsÂ
Identify and mitigate risks related to fairness, privacy, safety, security, hallucination, and data leakageÂ
Uphold Microsoft’s Responsible AI principles throughout the lifecycleÂ
Contribute to internal policies, auditing practices, and tools for responsible AIÂ
Operating AltitudesÂ
Paper level (ideas and math): Read, critique, and adapt the latest research; identify gaps; design methods with clear trade-offs and guarantees; communicate complex ideas clearly.Â
Example: “This objective is brittle under our data regime. Here is a tighter analysis and a revised loss we can test this sprint.”Â
Code level (implementation): Turn ideas into robust, tested, maintainable modules; integrate with CI/CD; profile and optimize for latency and throughput.Â
Example: “Refactored the prototype into a reusable PyTorch component, added unit tests and benchmarks, and cut P95 inference latency by 30%.”Â
Specialty Technical AreasÂ
Large-scale training and fine-tuning of LLMs, vision-language, or multimodal modelsÂ
Multi-agent systems, dialogue agents, and copilotsÂ
Optimization of inference speed, accuracy, reliability, and cost in productionÂ
Retrieval systems and hybrid architectures using RAG and vector databasesÂ
ML for real-world data constraints such as missing data, noisy labels, and class imbalanceÂ
Qualifications
Required QualificationsÂ
Bachelor’s degree in Computer Science, Engineering, Mathematics, Statistics, Physics, or a related field and 4 or more years in applied ML or AI research and product engineering,
- OR Master’s degree and 3 or more years in applied ML or AI research and product engineering,
- OR PhD in a relevant field and 2 or more years with generative AI, LLMs, or related ML algorithmsÂ
Other Requirements Â
Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include but are not limited to the following specialized security screenings:
- Microsoft Cloud Background Check:Â This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter.Â
Preferred QualificationsÂ
PhD in AI/ML or related field with top-venue publications and/or patentsÂ
Experience with Microsoft’s LLMOps stack: Azure AI Foundry, Azure Machine Learning, Semantic Kernel, Azure OpenAI Service, and Azure AI Search for vector/RAGÂ
Familiarity with responsible AI evaluation frameworks and bias mitigation methodsÂ
Experience across the product lifecycle from ideation to shipping
Proficiency in Python and at least one deep learning framework such as PyTorch, JAX, or TensorFlow
Experience deploying Fine Tuned LLMs or multimodal models in live production environments
Experience shipping and maintaining production AI systemsÂ
Software Engineering IC4 - The typical base pay range for this role across the U.S. is USD $119,800 - $234,700 per year. There is a different range applicable to specific work locations, within the San Francisco Bay area and New York City metropolitan area, and the base pay range for this role in those locations is USD $158,400 - $258,000 per year.
Certain roles may be eligible for benefits and other compensation. Find additional benefits and pay information here:
https://careers.microsoft.com/us/en/us-corporate-pay
This position will be open for a minimum of 5 days, with applications accepted on an ongoing basis until the position is filled.
Microsoft is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, citizenship, color, family or medical care leave, gender identity or expression, genetic information, immigration status, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran or military status, race, ethnicity, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable local laws, regulations and ordinances. If you need assistance with religious accommodations and/or a reasonable accommodation due to a disability during the application process, read more about requesting accommodations.
Learn more about this Employer on their Career Site
