Skip to main content

Bjak

Site Reliability Engineer - Insurance Platform (Remote, China)

China (Remote)fulltimemidAdded 2 days ago

About this role

BJAK seeks a Site Reliability Engineer based in China to ensure stability, scalability and resilience of their insurance automation platform. You'll bridge software engineering and infrastructure operations, collaborating remotely with Malaysia-based teams to keep business-critical systems running reliably at scale.

What you'll do

  • Own reliability and operational stability of production insurance systems
  • Design and improve monitoring, alerting, logging and observability across services
  • Lead incident response, troubleshooting and root cause analysis
  • Improve system resilience through redundancy, failover and recovery strategies
  • Enhance deployment safety via CI/CD pipelines, release strategies and automation
  • Manage and optimize cloud infrastructure supporting critical workflows

What they're looking for

  • Site Reliability Engineering or DevOps experience
  • Distributed systems and cloud infrastructure knowledge
  • Monitoring and observability tools expertise
  • Production incident troubleshooting and debugging
  • CI/CD pipelines and deployment automation
  • Kubernetes, Docker or container orchestration
  • Infrastructure-as-code tools (Terraform, Ansible)
  • AWS, GCP or Azure cloud platforms
Apply on the employer's site

Opens the official application on the employer’s site. No login required.