Skip to main content

Figure

Helix AI Engineer, Video Pretraining

San Jose, CAmidAdded 2 days ago

About this role

Figure AI is seeking a Helix AI Engineer to develop large-scale video foundation models that learn from raw video data to enable autonomous humanoid robots. The role involves designing pretraining strategies, building efficient data pipelines, and collaborating with cross-functional teams to integrate models into the robotics autonomy stack. This position requires 5 days/week in-office presence in San Jose, CA.

What you'll do

  • Design and train large-scale video foundation models on diverse internet and robot-collected datasets
  • Develop pretraining strategies that capture temporal dynamics and motion interactions from video sequences
  • Build models with transferable representations for perception, tracking, prediction, and control tasks
  • Implement efficient data pipelines and distributed training systems for high-throughput video processing
  • Optimize model performance across compute, memory, and training efficiency constraints
  • Design evaluation frameworks and benchmarks for measuring temporal understanding and generalization

What they're looking for

  • Large-scale video model training and sequential data handling
  • Deep learning architectures for video and vision systems
  • PyTorch and distributed training frameworks
  • Large-scale model pretraining and dataset curation
  • GPU cluster and distributed systems experience
  • Software engineering and scalable systems design
  • Experimental rigor and rapid iteration
  • Video diffusion, autoregressive modeling, or world models (bonus)
Apply on the employer's site

Opens the official application on the employer’s site. No login required.