SingleStore
Site Reliability Engineer
SeattlemidAdded 2 days ago
About this role
SingleStore is looking for a Site Reliability Engineer to optimize and scale its cloud services across major providers. This role involves automation, monitoring, and collaboration to enhance service performance while addressing operational challenges.
What you'll do
- Develop automation for infrastructure management
- Optimize telemetry for customer event identification
- Collaborate with engineering to enhance cloud service performance
- Debug live site incidents and perform postmortem analysis
- Participate in a rotating on-call schedule
What they're looking for
- 0-2 years SRE experience or recent graduate
- Infrastructure automation expertise
- Scripting skills (Python, Bash)
- Familiarity with Prometheus and Grafana
- Knowledge of Kubernetes
- Strong communication and collaboration skills
- Experience with cloud platforms (AWS, Azure, Google Cloud)
- Troubleshooting skills for production software
Opens the official application on the employer’s site. No login required.