Skip to main content

Hadrian

Site Reliability Engineer, Robotics

Los Angeles, CA$164k–$270kfulltimemidAdded 2 days ago

About this role

Hadrian seeks a Site Reliability Engineer to ensure the stability and performance of robotics systems powering autonomous manufacturing facilities. You'll design observability infrastructure, build reliability tools, and partner across teams to embed production-grade practices into advanced manufacturing systems.

What you'll do

  • Ensure reliability of robotics systems across PLCs, ROS2 middleware, and Kubernetes infrastructure
  • Build observability interfaces using Prometheus, Telegraf, OpenTelemetry, and Datadog to ingest system telemetry
  • Develop frameworks, diagnostic tools, and shared libraries for controls and robotics systems
  • Define SLOs/SLIs and establish reliability gates with controls, robotics, and platform teams
  • Create automated remediation and self-healing systems to minimize manual intervention
  • Lead incident response and post-mortem analysis for production manufacturing systems

What they're looking for

  • Kubernetes and container orchestration
  • Infrastructure as Code and GitOps workflows
  • Programming in Python, Go, TypeScript, or C++
  • Systems observability and monitoring tools
  • Linux fundamentals and edge infrastructure management
  • ROS/ROS2 or robotics control systems experience
  • Networking and on-premises deployment knowledge
  • Incident management and reliability engineering

Benefits

  • Medical, dental, vision, and life insurance
  • 401(k) retirement plan
  • Equity stake in the company
  • Flexible vacation policy
  • Relocation support in certain situations
Apply on the employer's site

Opens the official application on the employer’s site. No login required.