Skip to main content

Close

Site Reliability Engineer (USA Only - 100% Remote)

USA - Remote (Remote)$140k–$210kfulltimemidAdded 2 days ago

About this role

Close seeks a Site Reliability Engineer to manage mission-critical infrastructure supporting a CRM platform serving 11,000+ customers. You'll automate database lifecycle management, eliminate static credentials, minimize downtime, and own the telemetry and CI/CD systems that power the entire company.

What you'll do

  • Automate full lifecycle management of multi-terabyte MongoDB, PostgreSQL, and Elasticsearch databases
  • Migrate from static credentials to short-lived, identity-based authentication across services
  • Maintain near-zero scheduled downtime through resilient architecture and disaster recovery
  • Enhance multi-region failover capabilities and automation
  • Manage telemetry infrastructure processing 130TB monthly and CI/CD pipelines deploying production changes in 10 minutes
  • Serve as escalation point and expert for mission-critical production systems

What they're looking for

  • Kubernetes and AWS (EKS, RDS, ElastiCache, MSK)
  • Database administration (MongoDB, PostgreSQL, Elasticsearch, ClickHouse)
  • Infrastructure-as-code (Terraform, Ansible)
  • CI/CD platforms (GitHub Actions, ArgoCD)
  • Observability tools (Grafana, Loki, Tempo, Prometheus/Mimir)
  • Networking fundamentals (DNS, HTTP, TCP)
  • AI tools for development workflows
  • Docker and container orchestration

Benefits

  • 100% remote position, USA-based
  • Bootstrapped, profitable company with customer-focused culture
  • Funding for AI development tools (Claude, Codex, etc.)
  • Access to systems with 4+ years of zero scheduled downtime
  • Work on open-source infrastructure projects
Apply on the employer's site

Opens the official application on the employer’s site. No login required.