SonicJobs Logo
Left arrow iconBack to search

Research Scientist, Reinforcement Learning

Deeproute.ai
Posted a month ago, valid for a month
Location

Fremont, CA 94536, US

Salary

Competitive

Contract type

Full Time

By applying, a Sonicjobs account will be created for you. Sonicjobs's Privacy Policy and Terms & Conditions will apply.

SonicJobs' Terms & Conditions and Privacy Policy also apply.

Sonic Summary

info
  • We are seeking a candidate to develop next-generation autonomous driving systems using reinforcement learning (RL).
  • The role requires a minimum of 3 years of experience and offers a competitive salary of $120,000 per year.
  • Key responsibilities include training RL policies in closed-loop environments and optimizing reward functions for complex driving behaviors.
  • Candidates should possess proficiency in modern RL algorithms, Python, and deep learning frameworks like PyTorch, along with experience in distributed training frameworks.
  • Preferred qualifications include publications in top-tier conferences and contributions to open-source RL libraries or autonomous driving projects.

We are building next-generation end-to-end autonomous driving systems powered by reinforcement learning.

You will work on applying RL in closed-loop, safety-critical environments, leveraging large-scale simulation and real-world driving data to improve safety, comfort, and robustness.

  • Train and deploy RL policies in closed-loop driving environments
  • Scale RL training using massively parallel simulation systems
  • Design and optimize reward functions for complex driving behaviors
  • Improve sim-to-real transfer for real-world robustness
  • Collaborate with cross-functional teams to integrate models into production systems

Core Technical Skills

  • Proficiency in modern RL algorithms: DQN, PPO, SAC, TD3, etc.
  • Proficiency in modern RLHF algorithms: PPO, DPO, GRPO, etc.
  • Hands-on experience training reward models and finetuning LLM/VLM/VLA
  • Knowledge of distributed RL training at scale
  • Proficiency with massively parallel simulation environments
  • Knowledge of sim-to-real transfer techniques and domain randomization
  • Proficiency in Python, comfortable with C++
  • Proficiency in deep learning frameworks such as PyTorch
  • Experience with distributed training frameworks (Ray, Horovod, etc.)
  • Knowledge of model optimization (quantization, pruning) and CUDA is a plus
  • Knowledge of traffic rules, driving behavior modeling

Preferred Qualifications

  • Publications in top-tier venues (ICML, NeurIPS, ICLR, CVPR, ICCV, ECCV, ICRA, IROS, etc.)
  • Open-source contributions to RL libraries or autonomous driving projects
  • Previous experience with LLM fine-tuning using RLHF
  • Knowledge of safe RL, interpretable AI, or robustness techniques
  • Familiarity with autonomous vehicle regulations and safety standards



Learn more about this Employer on their Career Site

Apply now in a few quick clicks

By applying, a Sonicjobs account will be created for you. Sonicjobs's Privacy Policy and Terms & Conditions will apply.

SonicJobs' Terms & Conditions and Privacy Policy also apply.