Bjak
Site Reliability Engineer - Insurance Platform (Remote, China)
China (Remote)fulltimemidAdded 2 days ago
About this role
BJAK seeks a Site Reliability Engineer based in China to ensure stability, scalability and resilience of their insurance automation platform. You'll bridge software engineering and infrastructure operations, collaborating remotely with Malaysia-based teams to keep business-critical systems running reliably at scale.
What you'll do
- Own reliability and operational stability of production insurance systems
- Design and improve monitoring, alerting, logging and observability across services
- Lead incident response, troubleshooting and root cause analysis
- Improve system resilience through redundancy, failover and recovery strategies
- Enhance deployment safety via CI/CD pipelines, release strategies and automation
- Manage and optimize cloud infrastructure supporting critical workflows
What they're looking for
- Site Reliability Engineering or DevOps experience
- Distributed systems and cloud infrastructure knowledge
- Monitoring and observability tools expertise
- Production incident troubleshooting and debugging
- CI/CD pipelines and deployment automation
- Kubernetes, Docker or container orchestration
- Infrastructure-as-code tools (Terraform, Ansible)
- AWS, GCP or Azure cloud platforms
Opens the official application on the employer’s site. No login required.