openevidence
Software Engineer, Site Reliability
MiamifulltimemidAdded today
About this role
Join OpenEvidence's infrastructure team to build and optimize mission-critical systems supporting a widely-used medical AI platform. You'll apply SRE principles to enhance system reliability, observability, and incident response while working across diverse projects with significant real-world healthcare impact.
What you'll do
- Design and harden infrastructure for a medical AI platform used by healthcare providers globally
- Improve system health, performance, and efficiency while reducing operational toil
- Build and strengthen observability practices and define service-level objectives
- Establish and maintain effective on-call processes and incident response procedures
- Own cross-functional projects where you can create maximum impact
- Collaborate with a small engineering team on mission-critical systems
What they're looking for
- Site reliability engineering (SRE) practices
- Infrastructure design and hardening
- System observability and monitoring
- Incident response and on-call management
- Service-level objective (SLO) definition
- Full-stack ownership and autonomous problem-solving
- Performance optimization
- Cloud infrastructure or data platform experience
Benefits
- Work on world-changing medical AI technology impacting hundreds of millions of lives
- Join a high-growth $12B company with exceptional team from top universities
- Autonomous ownership with end-to-end responsibility for critical systems
- Opportunity to work on bleeding-edge infrastructure at scale
- Collaborate with a small, focused 30-person engineering team
Opens the official application on the employer’s site. No login required.