SonicJobs Logo
Left arrow iconBack to search

Robotics Data Pipeline Intern

Persona AI Inc
Posted a month ago, valid for 8 days
Location

Houston, TX 77203, US

Salary

Competitive

Contract type

Full Time

By applying, a Sonicjobs account will be created for you. Sonicjobs's Privacy Policy and Terms & Conditions will apply.

SonicJobs' Terms & Conditions and Privacy Policy also apply.

Sonic Summary

info
  • Persona AI is seeking a Robotics Data Pipeline Intern to assist in processing multimodal data for humanoid robots.
  • Candidates should be pursuing a B.S., M.S., or Ph.D. in fields like Computer Science or Robotics, with solid Python skills and exposure to PyTorch.
  • The role involves hands-on work with sensor stream processing, video pipeline optimization, and kinematic retargeting, among other tasks.
  • The internship offers competitive compensation and excellent benefits, with a focus on real work and code contribution rather than shadowing.
  • While specific experience is a plus, genuine curiosity and a willingness to learn are highly valued in this position.

Robotics Data Pipeline Intern – Multimodal Data

About Us At Persona, we're building the next generation of humanoid robots, and that requires an unprecedented volume of high-quality, multimodal data. We're moving beyond basic teleoperation to leverage massive datasets of in-the-wild egocentric video combined with dense sensor streams (IMU, haptics, kinematics, and high-fidelity force profiles). We're looking for a curious, technically sharp intern to roll up their sleeves and help us turn raw, unstructured multimodal data into high-fidelity training assets for our robots.

The Role As a Data Pipeline Intern, you'll work directly alongside our data and robotics engineering teams to support the infrastructure that feeds our foundation models. You'll get hands-on experience with real multimodal data challenges, from sensor stream processing and video pipeline optimization to force analysis and kinematic retargeting. This is not a "fetch coffee and shadow engineers" internship. You'll own real work and ship real code.

What You'll Work On

  • Rebuilding and extending pipelines that ingest and synchronously process egocentric video alongside rich sensor streams (IMU, force-torque, tactile, proprioception)

  • Owning post-processing algorithms for force analysis and hidden state inference, including contact force estimation, occlusion handling, and inverse kinematics gap-filling

  • Bridging kinematic retargeting work that translates human hand tracking into humanoid end-effector coordinates

  • Optimizing and testing data augmentation strategies (spatial, temporal, synthetic viewpoints, sensor noise injection)

  • Tying together work across our Hardware Teleoperation Team to help align human-robot play-data across modalities

What We're Looking For

  • Currently pursuing a B.S., M.S., or Ph.D. in Computer Science, Data Engineering, Machine Learning, Robotics, or a related field

  • Solid Python skills and exposure to PyTorch, particularly around data loading or multimodal datasets

  • Coursework or project experience with computer vision, time-series data, or sensor processing

  • Familiarity with video processing tools (OpenCV, FFmpeg) or pose estimation frameworks (MediaPipe) is a plus

  • Awareness of imitation learning, VLA architectures, or human-to-robot transfer concepts is a plus, but genuine curiosity counts for a lot here

Bonus Points

  • Experience with NVIDIA's robotics stack (Isaac, Cosmos, GR00T)

  • Exposure to distributed computing (Ray, Spark) or simulation environments (Omniverse, MuJoCo)

  • Any project work involving synthetic data generation or tactile/spatial data representations




Learn more about this Employer on their Career Site

Apply now in a few quick clicks

By applying, a Sonicjobs account will be created for you. Sonicjobs's Privacy Policy and Terms & Conditions will apply.

SonicJobs' Terms & Conditions and Privacy Policy also apply.