Skip to main content

SingleStore

Site Reliability Engineer

SeattlemidAdded 2 days ago

About this role

SingleStore is looking for a Site Reliability Engineer to optimize and scale its cloud services across major providers. This role involves automation, monitoring, and collaboration to enhance service performance while addressing operational challenges.

What you'll do

  • Develop automation for infrastructure management
  • Optimize telemetry for customer event identification
  • Collaborate with engineering to enhance cloud service performance
  • Debug live site incidents and perform postmortem analysis
  • Participate in a rotating on-call schedule

What they're looking for

  • 0-2 years SRE experience or recent graduate
  • Infrastructure automation expertise
  • Scripting skills (Python, Bash)
  • Familiarity with Prometheus and Grafana
  • Knowledge of Kubernetes
  • Strong communication and collaboration skills
  • Experience with cloud platforms (AWS, Azure, Google Cloud)
  • Troubleshooting skills for production software
Apply on the employer's site

Opens the official application on the employer’s site. No login required.