Bjak
DevOps Engineer - Platform Reliability (Remote, China)
China (Remote)fulltimemidAdded 2 days ago
About this role
BJAK seeks a DevOps Engineer in China to strengthen platform reliability for business-critical AI automation systems supporting insurance operations. You'll own infrastructure stability, CI/CD pipelines, monitoring and incident response while collaborating remotely with Malaysia-based teams to ensure safe, consistent deployments at scale.
What you'll do
- Manage cloud infrastructure, deployment pipelines and runtime environments for production systems
- Design and improve CI/CD workflows to enable safe and repeatable releases
- Build and enhance monitoring, alerting, logging and system observability
- Lead incident response efforts and perform root cause analysis
- Improve system resilience through redundancy, failover and recovery mechanisms
- Strengthen infrastructure security, access control and secrets management
What they're looking for
- DevOps, SRE or platform engineering experience
- Cloud infrastructure (AWS, GCP, Azure)
- CI/CD pipeline and deployment systems
- Production monitoring, alerting and incident management
- Kubernetes, Docker or container orchestration
- Infrastructure-as-code tools (Terraform, Ansible, Pulumi)
- Observability stacks (Prometheus, Grafana, ELK, Datadog)
- Structured troubleshooting and reliability engineering principles
Opens the official application on the employer’s site. No login required.