openai

Software Engineer, Data Infrastructure

San Franciscofulltimemid

About this role

Build and operate large-scale data infrastructure systems powering OpenAI's products and research. Own the full lifecycle of distributed compute, storage, streaming, and orchestration platforms designed to handle exabyte-scale workloads with reliability and security.

What you'll do

Design, build, and maintain distributed compute, storage, and streaming infrastructure systems
Scale data platforms to handle orders of magnitude growth while maintaining reliability and efficiency
Collaborate with product, research, and analytics teams to enable new capabilities and features
Participate in on-call rotation and ensure system reliability in production
Empower engineers with robust data tooling and platforms
Debug and optimize large-scale distributed systems for performance

What they're looking for

Distributed systems and infrastructure engineering (4+ years)
Data infrastructure platforms (Spark, Kafka, Flink, Airflow, Trino, or Iceberg)
Infrastructure tooling (Terraform)
Debugging complex distributed systems
Scalable storage and compute architecture
ML infrastructure and feature engineering
Streaming systems design
Data governance and security

Benefits

Hybrid work model (3 days onsite per week)
San Francisco-based role with relocation assistance
Work on foundational systems powering AI research and products
Collaborate with world-class research and engineering teams
Opportunity to shape next-generation data infrastructure

Apply on the employer's site →

Opens the official application on the employer’s site. No login required.