Skip to main content

openai

Software Engineer, Data Infrastructure

San Franciscofulltimemid

About this role

Build and operate large-scale data infrastructure systems powering OpenAI's products and research. Own the full lifecycle of distributed compute, storage, streaming, and orchestration platforms designed to handle exabyte-scale workloads with reliability and security.

What you'll do

  • Design, build, and maintain distributed compute, storage, and streaming infrastructure systems
  • Scale data platforms to handle orders of magnitude growth while maintaining reliability and efficiency
  • Collaborate with product, research, and analytics teams to enable new capabilities and features
  • Participate in on-call rotation and ensure system reliability in production
  • Empower engineers with robust data tooling and platforms
  • Debug and optimize large-scale distributed systems for performance

What they're looking for

  • Distributed systems and infrastructure engineering (4+ years)
  • Data infrastructure platforms (Spark, Kafka, Flink, Airflow, Trino, or Iceberg)
  • Infrastructure tooling (Terraform)
  • Debugging complex distributed systems
  • Scalable storage and compute architecture
  • ML infrastructure and feature engineering
  • Streaming systems design
  • Data governance and security

Benefits

  • Hybrid work model (3 days onsite per week)
  • San Francisco-based role with relocation assistance
  • Work on foundational systems powering AI research and products
  • Collaborate with world-class research and engineering teams
  • Opportunity to shape next-generation data infrastructure
Apply on the employer's site

Opens the official application on the employer’s site. No login required.