Skip to main content

Together AI

Research Engineer, Frontier Speculative Decoding

San Francisco, New York CitymidAdded 2 days ago

About this role

Together AI is seeking a Research Engineer to connect advanced research in generative AI with practical applications. The role focuses on customizing models based on specific customer needs and entails rigorous data processing and performance evaluations to ensure optimal efficiency.

What you'll do

  • Design and enhance speculator algorithms for accuracy and efficiency.
  • Bridge the gap between raw data and production-ready models.
  • Engage directly with customers to assess and meet their needs.
  • Collaborate with teams to integrate work into the production platform.
  • Maintain technical ownership and tackle challenging problems.
  • Work in a dynamic and impactful generative AI environment.

What they're looking for

  • Data curation and processing
  • Hyperparameter tuning
  • Experience with training codebases
  • Model checkpoint evaluation
  • Proficiency in Python and PyTorch
  • Knowledge of SLURM/Kubernetes
  • Familiarity with modern LLMs
  • Basic understanding of distributed training frameworks

Benefits

  • Competitive compensation
  • Startup equity
  • Health insurance
  • Flexible work environment
  • Equity in salary based on experience
  • Opportunities for professional growth
Apply on the employer's site

Opens the official application on the employer’s site. No login required.