Mistral AI
Research Engineer, Data Infrastructure
Palo AltofulltimemidAdded today
About this role
Mistral AI seeks a Research Engineer to design and operate next-generation data infrastructure supporting massive compute and storage systems. You'll architect multi-cluster orchestration, modernize storage formats, and build platforms enabling secure, scalable model training and fine-tuning operations.
What you'll do
- Design and scale distributed compute and storage systems for high-performance training workloads
- Architect multi-cluster orchestration layers across diverse hardware and regions
- Transition storage infrastructure to modern formats supporting exabyte-scale datasets
- Develop internal training platform capabilities across Kubernetes and SLURM environments
- Implement data lineage and metadata management systems for complex pipelines
- Manage cloud-native deployments with modern workflows and on-call support for critical jobs
What they're looking for
- Data infrastructure and MLOps engineering
- Python programming
- Kubernetes and container orchestration
- Distributed systems debugging
- Cloud-native deployment workflows
- Columnar storage formats and data lake optimization
- Multi-cluster infrastructure management
- Large-scale system design and operations
Benefits
- Competitive salary and equity
- Medical, dental, and vision coverage for employee and family
- 401K pension plan
Opens the official application on the employer’s site. No login required.