Skip to main content

Mistral AI

Research Engineer, Machine Learning

Palo AltofulltimemidAdded today

About this role

Mistral AI seeks a Research Engineer to build and optimize large-scale machine learning systems powering their open-weight models. You'll work at the intersection of cutting-edge research and production, either enhancing shared training infrastructure or embedding within research squads to translate novel ideas into scalable code.

What you'll do

  • Develop and optimize large-scale ML training pipelines and distributed learning systems
  • Build robust tools and infrastructure to accelerate researcher productivity
  • Integrate research checkpoints into production, streamline evaluation, and expose APIs
  • Design and benchmark deep learning algorithms on high-scale experiments (70B+ models, thousands of GPUs)
  • Convert research prototypes into production-grade components for consumer and enterprise products
  • Write efficient, well-tested Python code with strong software engineering practices

What they're looking for

  • Python programming
  • PyTorch, JAX, or TensorFlow
  • Distributed training frameworks (DeepSpeed, FSDP, SLURM, Kubernetes)
  • Deep learning and NLP/LLM experience
  • Software design and code quality (testing, CI/CD)
  • CUDA or data pipeline optimization (bonus)
  • Large-scale ML system architecture
  • Collaborative problem-solving

Benefits

  • Competitive salary and equity
  • Medical, dental, and vision coverage for employee and family
  • 401(k) with 6% matching
  • 18 days paid time off
  • Office parking reimbursement
Apply on the employer's site

Opens the official application on the employer’s site. No login required.