What We DoĀ
At the SEI AI Division, we conduct research in applied artificial intelligence and the engineering challenges related to building, deploying, and sustaining AI-enabled systems for high-impact government missions.Ā
Ā
TheĀ Frontier LabĀ advances AI engineering and transitions frontier AI capabilities to government stakeholders through applied research, rapid prototyping, short-cycle TEVV, and technical advisory.Ā
Ā
Position SummaryĀ
As a Senior Machine Learning Research Scientist in the Frontier Lab, you will serve as a senior individual contributor and technical leader, shaping and executing applied research and prototype capability development for government andĀ DoWĀ missions.Ā This role spans the research-engineering spectrum: someĀ SRĀ MLRS hires may lean more research-heavy and others more engineering-heavy, but successful candidates collaborate effectively across both.Ā
You willĀ operateĀ with high autonomy, represent technical work with customers and stakeholders, and help guide Frontier Lab research directionāwhileĀ remainingĀ hands-on in development, evaluation, and delivery. Your work may span Frontier Lab focus areas such as:Ā
Agentic AI for mission workflows (e.g., planning, analysis, decision support) where autonomous and human-guided agents interact with tools, data systems, and operators.Ā
AI test, evaluation, verification, and validation (TEVV) to improve confidence in performance, robustness, uncertainty, and trustworthiness of ML-enabled systems.Ā
Mission-tailored language models, including techniques to improve accuracy and reliability, reduce hallucinations, and integrate structured knowledge for operational tasks.Ā
Mission modalities and multimodal learning, including sensor fusion and learning under noisy, sparse, or constrained data conditions (including synthetic data and weakly-/self-supervised approaches).Ā
AI at the tactical edge, enabling capability under constrained compute/connectivity through efficient inference, compression, rapid adaptation, and update/redeploy patterns.Ā
Ā
Key Responsibilities / DutiesĀ
Senior MLRS staff are expected toĀ operateĀ with a high degree of autonomy and technical ownership whileĀ remainingĀ hands-on in development, evaluation, and delivery.Ā
Mission-context execution: Execute work within the operational contextāunderstanding users, workflows, constraints, success criteria, and outcomesāso technical decisions are grounded in real mission needs.Ā
Technical leadership / Tech lead: Lead technical execution by defining technical tasking, sequencing work into realistic milestones,Ā maintainingĀ delivery quality, and delegating appropriately across the team.Ā
Applied research and prototyping: Design and run studies, build convincingĀ prototypesĀ and reference implementations, and produce evidence-backed insights that can be matured and transitioned into operational settings.Ā
Evaluation, assurance, and evidence:Ā EstablishĀ credible evaluation strategies and test pipelines that assess performance, robustness, reliability, and trustworthiness in mission-representative scenarios.Ā
Customer-facing technical ownership: Serve as the primary technical interface whenĀ appropriate; translate mission goals into measurable technical outcomes; communicate progress, decisions, and risks clearly to stakeholders.Ā
Mentorship and talent development: Proactively mentor junior staff and teammates, raising the bar for research rigor, engineering practice, and delivery habits across project teams.Ā
State-of-the-artĀ awareness and agenda shaping:Ā MaintainĀ strong awareness of frontier developments aligned to the Frontier Lab, share insights with the lab, and help shape research directions and future work selection.Ā
Self-direction and time management: Manage multiple priorities effectively, sustain steady execution cadence, and resolve blockers with minimal oversight.Ā
Community building (internal and external): Build a strong research culture through internal talks, reading groups, and workshops; and engage with external AI/ML communities (professional societies, consortiums, working groups, and conferences) to strengthen collaboration pathways and keep the lab connected to emerging practice.Ā
RequirementsĀ
Education / ExperienceĀ Ā
BSĀ in Computer Science, Electrical Engineering, Statistics, or related field withĀ 10 yearsĀ of relevant experience; OR MSĀ withĀ 8 yearsĀ of relevant experience;Ā OR PhDĀ withĀ 5 yearsĀ of relevant experience.Ā
DeepĀ expertiseĀ in one or more Frontier Lab-aligned areas (agentic systems, LLM reliability/evaluation, CV evaluation, robustness/assurance, TEVV pipelines, multimodal learning, edge ML).Ā
Strong engineering capabilityĀ ā canĀ build andĀ maintainĀ high-quality prototypes, evaluation infrastructure, and repeatable experimentation workflows.Ā
Strong written and verbal communication skills; able toĀ representĀ technical work credibly to senior stakeholders.Ā
Demonstrated ability to lead technical workstreams and coordinate multi-person execution.Ā
Ā
Knowledge, Skills, & Abilities (KSAs)Ā
Technical judgment:Ā Makes sound architectural and methodological decisions; balances ambition with mission constraints.Ā
Customer translation:Ā Converts mission needs into tractable technical plans, measurable success criteria, and credible evaluation evidence.Ā
Scientific leadership:Ā MaintainsĀ rigor;Ā identifiesĀ flawed assumptions; improves evaluation quality and research practices.Ā
Mentorship & influence:Ā Elevates team performance through hands-on guidance and strong technical standards.Ā
Initiative:Ā ProactivelyĀ identifiesĀ risks/opportunities, proposes new work, and creates alignment without directive management.Ā
Self-direction and time management: Plans work effectively under ambiguity,Ā maintainsĀ execution cadence, and escalates risks early.Ā
Ā
Desired ExperienceĀ
Leading applied research projects resulting inĀ effectiveĀ prototypes, mission-relevant evaluation outcomes, or transitioned methods.Ā
Publications at strong venues (e.g.,Ā NeurIPSĀ / ICLR / ICML, relevant workshops, MLCON), and/or demonstrable impact through applied research artifacts (benchmarks, evaluation suites, open-source, technical reports).Ā
Designing and operating TEVV efforts including evaluation pipelines, robustness analysis, calibration/uncertainty work, regression suites, and scenario-based evaluation protocols.Ā
Building agentic capabilities integrated with tools, data systems, and human workflows (decision support, planning, analytic contexts).Ā
Experience with secure or operational environments and delivery constraints typical of government settings.Ā
Experience shaping a technical roadmap or research portfolio aligned to sponsor priorities and lab strategy.Ā
Ā
Other RequirementsĀ
Flexible to travel to SEI offices inĀ Pittsburgh, PAĀ andĀ Washington, DC / Arlington, VA, sponsor sites, conferences, and offsite meetings (~10% travel).Ā
ĀYou must be able and willing to work onsite at an SEI office in Pittsburgh, PA or Arlington, VA 5 days per week.
You will be subject to a background investigation and must be eligible to obtain andĀ maintainĀ a Department of War)Ā security clearance.
Ā
Location
Arlington, VA, Pittsburgh, PAJob Function
Software/Applications Development/EngineeringPosition Type
Staff ā RegularFull time/Part time
Full timePay Basis
SalaryMore Information:Ā
Please visit āWhy Carnegie Mellonā to learn more about becoming part of an institution inspiring innovations that change the world.Ā
Click here to view a listing of employee benefits
Carnegie Mellon University is an Equal Opportunity Employer/Disability/Veteran.Ā
Statement of Assurance
Learn more about this Employer on their Career Site
