Skip to main content

Moniepoint

Site Reliability Engineer

Remote (Remote)midAdded today

About this role

Moniepoint Inc. is looking for a Site Reliability Engineer to enhance system performance and reliability across its platform. The role involves managing on-call tasks, developing automation, and collaborating with development teams to ensure and improve operational efficiency.

What you'll do

  • Participate in on-call rotations and act as Incident Commander during major incidents.
  • Create and maintain dashboards and alerts for system visibility.
  • Develop automation to reduce manual operational tasks.
  • Implement and track SLIs and SLOs as defined by engineering leadership.
  • Investigate and resolve escalated customer complaints regarding system performance.

What they're looking for

  • 3+ years in an SRE or similar role
  • Proficiency in Java, Go, or Python
  • Understanding of distributed systems and microservices
  • Experience with Kubernetes and cloud services (GCP, AWS, Azure)
  • Ability to set up dashboards using Grafana
  • Knowledge of APM tools like Datadog or New Relic
  • Proficient in SQL for complex queries
  • Familiarity with metrics, logs, and traces

Benefits

  • Employee-first culture emphasizing well-being
  • Learning and development opportunities
  • Attractive salary and compensation package
  • Health insurance and annual bonuses
  • [unknown]
Apply on the employer's site

Opens the official application on the employer’s site. No login required.