Description
In this role, you will play a critical role shaping the future of our LLM efforts, specifically in transforming our models into highly capable, intelligent assistants that power billions of Apple products. You will tackle core training challenges in instruction following, tool use, deep reasoning, and architectural adaption — designing models that deliver magical, deeply integrated, and privacy-forward experiences across the Apple ecosystem. You will work alongside a fast-growing team of world-class experts to explore novel training strategies, architectural adaptations, and advanced evaluation methodologies.
Minimum Qualifications
Demonstrated expertise in deep learning with a focus on LLMs, post-training, or reinforcement learning, backed by a strong record of academic or real-world accomplishments in these or closely related domains. Proficient programming skills in Python and a major deep learning framework such as JAX or PyTorch. Masters/PhD, or equivalent practical experience, in Computer Science, Machine Learning, or a related technical field.
Preferred Qualifications
Experience training state-of-the-art large models at scale, with familiarity in distributed training challenges and trade-offs. Experience improving model performance on complex reasoning tasks (math, coding, logic). Experience with various transformers architectures and its transformations. Strong communication skills and a passion for working cross-functionally across Research and Product teams.
Learn more about this Employer on their Career Site
