Adyen
Platfrom Monitoring & Incident Engineer
Chicago; San Francisco$100k–$154.5kmidAdded 2 days ago
About this role
Adyen is seeking a Monitoring Engineer/Incident Manager to oversee platform performance, manage incidents, and drive reliability improvements for a fintech payments platform. You'll work on-call in a follow-the-sun model, coordinate incident response, communicate with merchants, and lead initiatives to strengthen monitoring and observability across the platform.
What you'll do
- Monitor platform and merchant performance in EMEA shift (9 AM–6 PM PDT / 11 AM–8 PM CDT) and detect issues proactively
- Coordinate mitigation, recovery, and resolution of high-impact incidents across teams
- Communicate real-time updates to merchants during incidents and escalate critical issues to leadership
- Analyze incident trends to identify root causes and systemic weaknesses; partner with engineering on preventative fixes
- Investigate alerts and provide feedback to improve logging and alerting across platform architecture
- Lead initiatives to develop automation, improve monitoring tools, and scale detection capabilities
What they're looking for
- Incident management and problem management (5+ years)
- Platform monitoring and observability tools (Prometheus, Grafana, ELK Stack, Datadog, Dynatrace, Splunk)
- Root cause analysis and incident trend analysis
- Technical communication and stakeholder management
- Analytical and problem-solving abilities
- Process definition and standardization
- Cross-team collaboration and project management
- Ability to handle complex situations and multiple responsibilities
Opens the official application on the employer’s site. No login required.