openai
Software Engineer, Infrastructure - Analytics Platform
San Francisco (Remote)fulltimemid
About this role
OpenAI seeks a systems-focused software engineer to design and operate production-critical infrastructure supporting AI research. You'll own backend systems end-to-end in Rust or C++, focusing on performance, scalability, and distributed systems while partnering with researchers.
What you'll do
- Own critical infrastructure across design, implementation, deployment, and iteration
- Build performant backend systems in Rust or C++ for research workflows
- Design and optimize distributed data and serving systems with focus on consistency, replication, and failure isolation
- Debug production bottlenecks in latency, throughput, and overload behavior
- Operate business-critical services through on-call, incidents, and safe rollouts
- Improve reliability of Kubernetes-hosted services and coordinate with researchers on system delivery
What they're looking for
- Rust or C++ systems programming
- Distributed systems design and operation
- Performance optimization and profiling
- Kubernetes and container orchestration
- Infrastructure-as-code tools (Terraform or equivalent)
- Observability and monitoring (Prometheus or similar)
- Low-level system knowledge (concurrency, async, memory, networking, I/O)
- Analytics infrastructure experience (ClickHouse-like systems preferred)
Benefits
- Hybrid work model (3 days per week in San Francisco office)
- Relocation assistance provided
Opens the official application on the employer’s site. No login required.