Skip to main content

Mistral AI

Research Engineer, Machine Learning

Palo Altofull-timemidAdded today

About this role

Mistral AI seeks a Research Engineer to build and optimize large-scale machine learning systems powering our open-weight models. You'll collaborate with Research Scientists either on shared infrastructure and tooling or embedded within research squads, bridging cutting-edge research with production-grade systems.

What you'll do

  • Design and optimize large-scale ML training pipelines and distributed systems
  • Build robust tools and frameworks that accelerate research scientists' work
  • Integrate research checkpoints, streamline evaluation processes, and expose APIs
  • Conduct experiments with advanced deep-learning techniques on large GPU clusters
  • Develop and benchmark ML algorithms with production-quality code
  • Deliver prototypes that transition into production components for products and APIs

What they're looking for

  • Large-scale ML system development (4+ years)
  • PyTorch, JAX, or TensorFlow
  • Distributed training frameworks (DeepSpeed, FSDP, SLURM, Kubernetes)
  • Deep learning, NLP, or LLM experience
  • Python programming
  • Software design and code quality practices
  • CUDA (bonus)
  • Data pipeline engineering (bonus)
Apply on the employer's site

Opens the official application on the employer’s site. No login required.