Mistral AI
Research Engineer, Machine Learning
Palo AltofulltimemidAdded today
About this role
Mistral AI seeks a Research Engineer to build and optimize large-scale machine learning systems powering their open-weight models. You'll work at the intersection of cutting-edge research and production, either enhancing shared training infrastructure or embedding within research squads to translate novel ideas into scalable code.
What you'll do
- Develop and optimize large-scale ML training pipelines and distributed learning systems
- Build robust tools and infrastructure to accelerate researcher productivity
- Integrate research checkpoints into production, streamline evaluation, and expose APIs
- Design and benchmark deep learning algorithms on high-scale experiments (70B+ models, thousands of GPUs)
- Convert research prototypes into production-grade components for consumer and enterprise products
- Write efficient, well-tested Python code with strong software engineering practices
What they're looking for
- Python programming
- PyTorch, JAX, or TensorFlow
- Distributed training frameworks (DeepSpeed, FSDP, SLURM, Kubernetes)
- Deep learning and NLP/LLM experience
- Software design and code quality (testing, CI/CD)
- CUDA or data pipeline optimization (bonus)
- Large-scale ML system architecture
- Collaborative problem-solving
Benefits
- Competitive salary and equity
- Medical, dental, and vision coverage for employee and family
- 401(k) with 6% matching
- 18 days paid time off
- Office parking reimbursement
Opens the official application on the employer’s site. No login required.