Mistral AI
Site Reliability Engineer - NYC
New YorkfulltimemidAdded 2 days ago
About this role
Mistral is looking for experienced Site Reliability Engineers to enhance the performance and availability of their AI platform and applications. The role involves collaborating with software engineers and AI researchers to improve reliability and scalability while managing production operations.
What you'll do
- Design and maintain scalable, highly available infrastructures
- Ensure system availability and facilitate seamless environment replication
- Troubleshoot issues in production systems and respond to incidents
- Implement monitoring and incident response systems for optimal performance
- Drive infrastructure automation and orchestration improvements
- Document processes for knowledge sharing within the team
What they're looking for
- Master's degree in Computer Science or related field
- 7+ years in DevOps/SRE roles
- Experience with cloud computing and distributed systems
- CI/CD and container orchestration expertise
- Familiarity with infrastructure-as-code tools
- Proficiency in scripting languages like Python or Bash
- Strong analytical and problem-solving abilities
- Excellent communication skills
Benefits
- Dynamic, collaborative work environment
- Opportunity for innovation and creativity
- Work with a diverse, distributed team
- Involvement in open-source projects
- Exposure to high-stakes industries
Opens the official application on the employer’s site. No login required.