openai
Software Engineer, Data Infrastructure
San Franciscofulltimemid
About this role
Build and operate large-scale data infrastructure systems powering OpenAI's products and research. Own the full lifecycle of distributed compute, storage, streaming, and orchestration platforms designed to handle exabyte-scale workloads with reliability and security.
What you'll do
- Design, build, and maintain distributed compute, storage, and streaming infrastructure systems
- Scale data platforms to handle orders of magnitude growth while maintaining reliability and efficiency
- Collaborate with product, research, and analytics teams to enable new capabilities and features
- Participate in on-call rotation and ensure system reliability in production
- Empower engineers with robust data tooling and platforms
- Debug and optimize large-scale distributed systems for performance
What they're looking for
- Distributed systems and infrastructure engineering (4+ years)
- Data infrastructure platforms (Spark, Kafka, Flink, Airflow, Trino, or Iceberg)
- Infrastructure tooling (Terraform)
- Debugging complex distributed systems
- Scalable storage and compute architecture
- ML infrastructure and feature engineering
- Streaming systems design
- Data governance and security
Benefits
- Hybrid work model (3 days onsite per week)
- San Francisco-based role with relocation assistance
- Work on foundational systems powering AI research and products
- Collaborate with world-class research and engineering teams
- Opportunity to shape next-generation data infrastructure
Opens the official application on the employer’s site. No login required.