Skip to main content

openai

Software Engineer, Infrastructure - Analytics Platform

San Francisco (Remote)fulltimemid

About this role

OpenAI seeks a systems-focused software engineer to design and operate production-critical infrastructure supporting AI research. You'll own backend systems end-to-end in Rust or C++, focusing on performance, scalability, and distributed systems while partnering with researchers.

What you'll do

  • Own critical infrastructure across design, implementation, deployment, and iteration
  • Build performant backend systems in Rust or C++ for research workflows
  • Design and optimize distributed data and serving systems with focus on consistency, replication, and failure isolation
  • Debug production bottlenecks in latency, throughput, and overload behavior
  • Operate business-critical services through on-call, incidents, and safe rollouts
  • Improve reliability of Kubernetes-hosted services and coordinate with researchers on system delivery

What they're looking for

  • Rust or C++ systems programming
  • Distributed systems design and operation
  • Performance optimization and profiling
  • Kubernetes and container orchestration
  • Infrastructure-as-code tools (Terraform or equivalent)
  • Observability and monitoring (Prometheus or similar)
  • Low-level system knowledge (concurrency, async, memory, networking, I/O)
  • Analytics infrastructure experience (ClickHouse-like systems preferred)

Benefits

  • Hybrid work model (3 days per week in San Francisco office)
  • Relocation assistance provided
Apply on the employer's site

Opens the official application on the employer’s site. No login required.