Basata Inc
Site Reliability Engineer
Tempe, ArizonafulltimemidAdded 2 days ago
About this role
Basata seeks its first dedicated Site Reliability Engineer to build and own a reliability practice for an AI-powered healthcare automation platform. You'll define SLOs, establish incident response processes, design scalable infrastructure, and set operational standards as the company grows from serving current clinics to serving many more.
What you'll do
- Define SLOs, build observability systems, and drive initiatives to meet reliability targets
- Establish incident response practices including triage, mitigation, resolution, and blameless postmortems
- Design and evolve infrastructure-as-code, deployment pipelines, and operational tooling
- Reduce operational toil through automation across the platform
- Collaborate with engineers to improve service operability and failure resilience
- Set operational culture and reliability engineering standards for the team
What they're looking for
- Software engineering fundamentals with code proficiency in Java, Python, or TypeScript
- Production systems experience with containerized services and cloud infrastructure
- Observability and incident response expertise
- Infrastructure-as-code and deployment systems knowledge
- Architecture-level reliability design (capacity planning, scaling, failure isolation)
- Calmness and structured judgment under pressure
- Ability to learn unfamiliar codebases and identify production failure modes
Benefits
- Drive real impact in healthcare by improving clinic operations
- High ownership and autonomy in shaping a greenfield reliability function
- Work with a small, fast-moving team on meaningful problems
- Opportunity to influence product decisions and user experience
Opens the official application on the employer’s site. No login required.