Mistral AI
Research Engineer, Machine Learning
Palo Altofull-timemidAdded today
About this role
Mistral AI seeks a Research Engineer to build and optimize large-scale machine learning systems powering our open-weight models. You'll collaborate with Research Scientists either on shared infrastructure and tooling or embedded within research squads, bridging cutting-edge research with production-grade systems.
What you'll do
- Design and optimize large-scale ML training pipelines and distributed systems
- Build robust tools and frameworks that accelerate research scientists' work
- Integrate research checkpoints, streamline evaluation processes, and expose APIs
- Conduct experiments with advanced deep-learning techniques on large GPU clusters
- Develop and benchmark ML algorithms with production-quality code
- Deliver prototypes that transition into production components for products and APIs
What they're looking for
- Large-scale ML system development (4+ years)
- PyTorch, JAX, or TensorFlow
- Distributed training frameworks (DeepSpeed, FSDP, SLURM, Kubernetes)
- Deep learning, NLP, or LLM experience
- Python programming
- Software design and code quality practices
- CUDA (bonus)
- Data pipeline engineering (bonus)
Opens the official application on the employer’s site. No login required.