Responsibilities
- Develop the PT2 compiler (e.g., TorchDynamo, TorchInductor, PyTorch Distributed, PyTorch Core)
- Improve PyTorch performance via systematic solutions for the entire community
- Explore the intersection of the PyTorch compiler and PyTorch distributed
- Optimize Generative AI models across the stack (pre-training, fine-tuning, and inference)
- Collaborate with users of PyTorch to enable new use cases of PT2 technologies both inside and outside Meta
Minimum Qualifications
- Currently has, or is in the process of obtaining a Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience. Degree must be completed prior to joining Meta
- Currently has or is in the process of obtaining a PhD degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience. Degree must be completed prior to joining Meta
- Research or industry experience in developing compilers, ML systems, ML accelerators, GPU performance, and similar
- Advanced in Python or C++ programming
Preferred Qualifications
- Experience in developing PyTorch/PT2, Triton, MLIR, JAX, XLA, TVM is a huge plus
- Knowledge in GPU architecture, ML accelerator performance, and developing high-performance kernels
- Experience in building OSS communities and extensive social media presence in the ML Sys domain
- Experience with training models, end-to-end model optimizations, or applying ML to systems
- Knowledge of communication collectives, PyTorch distributed, and parallelism
- Experience in developing inside other ML frameworks like Caffe2, TensorFlow, ONNX, TensorRT
- First-authored publications at peer-reviewed conferences (e.g. NeurIPS, MLSys, ASPLOS, PLDI, ICML, or similar)
$58.65/hour to $181,000/year + bonus + equity + benefits
Learn more about this Employer on their Career Site
