openai
Full-Stack Software Engineer, Compute Foundations
San Franciscofulltimemid
About this role
Join OpenAI's Frontier Clusters team to build full-stack web tools that help researchers and operators manage the world's largest supercomputers. You'll create dashboards and systems that provide real-time visibility into cluster health, job failures, and resource optimization for frontier AI model training.
What you'll do
- Build full-stack web applications for monitoring cluster health, job failures, and capacity in real time
- Design data models, APIs, and visualizations for large-scale scheduling and resource allocation
- Develop scalable backend services processing high-volume cluster data with low latency
- Create intuitive frontend workflows connecting users to scheduling and compute systems
- Collaborate with researchers and infrastructure teams to identify and solve operational bottlenecks
- Improve reliability, performance, and security across cluster operations platforms
What they're looking for
- Full-stack web development (React, Vue, or Angular)
- Backend development (Python, Go, or Node.js)
- Distributed systems and APIs
- Cloud infrastructure knowledge
- High-performance web application design
- Kubernetes and Docker (bonus)
- Real-time data processing and visualization
- AI/ML workload scheduling (bonus)
Benefits
- Compensation: $230K–$347K USD
- Work on cutting-edge AI infrastructure at massive scale
- Collaborate with frontier AI researchers
- Fast-paced, highly collaborative environment
- Equal opportunity employer with comprehensive non-discrimination policies
Opens the official application on the employer’s site. No login required.