Skip to main content

FluidStack

Backend Engineer

San Francisco, CA$175k–$300kfulltimemidAdded today

About this role

Fluidstack seeks a Backend Engineer to build core infrastructure for managing tens of thousands of GPUs across hyperscale AI data centers. You'll own observability platforms, control plane systems, and fleet state management that enable the company to operate civilization-scale compute infrastructure reliably and at speed.

What you'll do

  • Design and operate observability platform for real-time fleet telemetry, from site-level health to individual GPU metrics
  • Build stable API surface and control plane for unified machine management across the entire company
  • Develop data pipelines and healthcheck frameworks that serve as single source of truth for fleet state
  • Own infrastructure-as-code for hardware onboarding (ZTP, DHCP, DNS) and new XPU generation integration
  • Design contracts between production systems and internal/customer-facing tools that depend on them
  • Ensure fleet state remains consistent across provisioning, operations, and customer platforms

What they're looking for

  • Backend API and control plane design
  • Large-scale observability and telemetry systems
  • Kubernetes and container orchestration
  • Infrastructure automation and IaC tooling
  • Distributed systems and state management
  • Database design for high-volume metrics
  • Cloud infrastructure and hardware provisioning
  • Systems thinking and first-principles problem solving

Benefits

  • Work on civilization-scale AI compute infrastructure
  • Full ownership of critical systems end-to-end
  • Exposure to hardware, software, and ops across the full stack
  • High velocity environment with minimal bureaucracy
  • Opportunity to shape company-wide infrastructure APIs and standards
  • Based in San Francisco with mission-driven team

Likely interview questions

  • Describe a time you designed an API that had to serve multiple teams with different needs—how did you balance flexibility and stability?
  • Tell us about your experience building observability systems at scale. What metrics mattered most and why?
Apply on the employer's site

Opens the official application on the employer’s site. No login required.