Moniepoint
Site Reliability Engineer
Remote (Remote)midAdded today
About this role
Moniepoint Inc. is looking for a Site Reliability Engineer to enhance system performance and reliability across its platform. The role involves managing on-call tasks, developing automation, and collaborating with development teams to ensure and improve operational efficiency.
What you'll do
- Participate in on-call rotations and act as Incident Commander during major incidents.
- Create and maintain dashboards and alerts for system visibility.
- Develop automation to reduce manual operational tasks.
- Implement and track SLIs and SLOs as defined by engineering leadership.
- Investigate and resolve escalated customer complaints regarding system performance.
What they're looking for
- 3+ years in an SRE or similar role
- Proficiency in Java, Go, or Python
- Understanding of distributed systems and microservices
- Experience with Kubernetes and cloud services (GCP, AWS, Azure)
- Ability to set up dashboards using Grafana
- Knowledge of APM tools like Datadog or New Relic
- Proficient in SQL for complex queries
- Familiarity with metrics, logs, and traces
Benefits
- Employee-first culture emphasizing well-being
- Learning and development opportunities
- Attractive salary and compensation package
- Health insurance and annual bonuses
- [unknown]
Opens the official application on the employer’s site. No login required.