openai
Machine Learning Engineer, Distributed Data Systems - Robotics
San Francisco (Remote)fulltimemid
About this role
OpenAI seeks a Machine Learning Engineer to design and scale distributed data infrastructure for large-scale multimodal training and evaluation in robotics. You'll build robust pipelines, collaborate with researchers, and maintain critical systems supporting OpenAI's rapid iteration cycles.
What you'll do
- Design and build distributed compute, data orchestration, and storage systems
- Scale data platforms reliably while maintaining efficiency and security
- Partner with researchers to translate requirements into production systems
- Harden, optimize, and maintain critical multimodal training infrastructure
- Ensure data pipelines meet scalability and reliability requirements
- Support evaluation systems for robotics and AI research
What they're looking for
- Distributed systems architecture
- Large-scale infrastructure design
- Data pipeline orchestration
- Software engineering fundamentals
- System reliability and hardening
- Distributed storage technologies
- Problem-solving under ambiguity
- Cross-functional collaboration
Benefits
- Hybrid work model (3 days in office per week)
- Based in San Francisco, CA
- Relocation assistance provided
- Work on cutting-edge robotics and AGI research
- Collaborative environment with leading researchers
Opens the official application on the employer’s site. No login required.