openai

Inference Engineer, Robotics

San Francisco (Remote)fulltimemid

About this role

OpenAI seeks a GPU Inference Engineer to optimize model serving and inference performance for their robotics research platform. You'll collaborate with researchers to build efficient serving infrastructure and design inference-friendly models while working with cutting-edge hardware and software systems.

What you'll do

Optimize model serving, inference performance, and system efficiency for robotics applications
Drive kernel-level and data movement optimizations to improve throughput and reliability
Partner with research and product teams to ensure models scale effectively
Design and build critical serving infrastructure for robotics growth
Assist researchers in developing inference-optimized model architectures
Navigate ambiguity and drive complex technical initiatives to completion

What they're looking for

Model performance optimization, especially inference layer
Kernel-level systems and low-level performance tuning
GPU optimization and data movement
Model serving infrastructure design
Multimodal AI workload scaling
Systems engineering and debugging
Technical leadership and initiative ownership
Python or C++ (implied)

Benefits

Hybrid work model (3 days per week in office)
San Francisco location with relocation assistance
Work on AGI-level robotics research
Collaborate with world-class AI researchers
Equal opportunity employer with comprehensive non-discrimination policy

Apply on the employer's site →

Opens the official application on the employer’s site. No login required.