Close
Site Reliability Engineer (USA Only - 100% Remote)
USA - Remote (Remote)$140k–$210kfulltimemidAdded 2 days ago
About this role
Close seeks a Site Reliability Engineer to manage mission-critical infrastructure supporting a CRM platform serving 11,000+ customers. You'll automate database lifecycle management, eliminate static credentials, minimize downtime, and own the telemetry and CI/CD systems that power the entire company.
What you'll do
- Automate full lifecycle management of multi-terabyte MongoDB, PostgreSQL, and Elasticsearch databases
- Migrate from static credentials to short-lived, identity-based authentication across services
- Maintain near-zero scheduled downtime through resilient architecture and disaster recovery
- Enhance multi-region failover capabilities and automation
- Manage telemetry infrastructure processing 130TB monthly and CI/CD pipelines deploying production changes in 10 minutes
- Serve as escalation point and expert for mission-critical production systems
What they're looking for
- Kubernetes and AWS (EKS, RDS, ElastiCache, MSK)
- Database administration (MongoDB, PostgreSQL, Elasticsearch, ClickHouse)
- Infrastructure-as-code (Terraform, Ansible)
- CI/CD platforms (GitHub Actions, ArgoCD)
- Observability tools (Grafana, Loki, Tempo, Prometheus/Mimir)
- Networking fundamentals (DNS, HTTP, TCP)
- AI tools for development workflows
- Docker and container orchestration
Benefits
- 100% remote position, USA-based
- Bootstrapped, profitable company with customer-focused culture
- Funding for AI development tools (Claude, Codex, etc.)
- Access to systems with 4+ years of zero scheduled downtime
- Work on open-source infrastructure projects
Opens the official application on the employer’s site. No login required.