Skip to main content

openai

Software Engineer, Compute Infrastructure

San Francisco (Remote)fulltimemid

About this role

OpenAI is seeking software engineers to build the foundational compute platform powering frontier AI research and products. You'll work across the full infrastructure stack—from hardware and networking to scheduling and developer tools—optimizing systems that handle massive-scale AI workloads.

What you'll do

  • Design and optimize system software for large-scale compute clusters running demanding AI workloads
  • Build automation for hardware provisioning, firmware upgrades, and incident response across data centers
  • Profile and optimize training workloads using benchmarks and performance traces across compute, memory, storage, and networking
  • Develop high-performance networking fabrics, protocols, and observability for distributed training and serving
  • Create reliable scheduling, orchestration, and fleet health systems across heterogeneous hardware and providers
  • Build infrastructure abstractions and developer tools that simplify using enormous compute systems

What they're looking for

  • Systems programming and performance optimization
  • Distributed systems and cluster orchestration
  • High-performance computing (HPC) concepts
  • Networking protocols and RDMA
  • GPU hardware and accelerator experience
  • Kubernetes or container orchestration
  • Benchmarking and profiling tools
  • Hardware automation and provisioning
Apply on the employer's site

Opens the official application on the employer’s site. No login required.