Hadrian
Site Reliability Engineer, Robotics
Los Angeles, CA$164k–$270kfulltimemidAdded 2 days ago
About this role
Hadrian seeks a Site Reliability Engineer to ensure the stability and performance of robotics systems powering autonomous manufacturing facilities. You'll design observability infrastructure, build reliability tools, and partner across teams to embed production-grade practices into advanced manufacturing systems.
What you'll do
- Ensure reliability of robotics systems across PLCs, ROS2 middleware, and Kubernetes infrastructure
- Build observability interfaces using Prometheus, Telegraf, OpenTelemetry, and Datadog to ingest system telemetry
- Develop frameworks, diagnostic tools, and shared libraries for controls and robotics systems
- Define SLOs/SLIs and establish reliability gates with controls, robotics, and platform teams
- Create automated remediation and self-healing systems to minimize manual intervention
- Lead incident response and post-mortem analysis for production manufacturing systems
What they're looking for
- Kubernetes and container orchestration
- Infrastructure as Code and GitOps workflows
- Programming in Python, Go, TypeScript, or C++
- Systems observability and monitoring tools
- Linux fundamentals and edge infrastructure management
- ROS/ROS2 or robotics control systems experience
- Networking and on-premises deployment knowledge
- Incident management and reliability engineering
Benefits
- Medical, dental, vision, and life insurance
- 401(k) retirement plan
- Equity stake in the company
- Flexible vacation policy
- Relocation support in certain situations
Opens the official application on the employer’s site. No login required.