Together AI
Research Engineer, Frontier Speculative Decoding
San Francisco, New York CitymidAdded 2 days ago
About this role
Together AI is seeking a Research Engineer to connect advanced research in generative AI with practical applications. The role focuses on customizing models based on specific customer needs and entails rigorous data processing and performance evaluations to ensure optimal efficiency.
What you'll do
- Design and enhance speculator algorithms for accuracy and efficiency.
- Bridge the gap between raw data and production-ready models.
- Engage directly with customers to assess and meet their needs.
- Collaborate with teams to integrate work into the production platform.
- Maintain technical ownership and tackle challenging problems.
- Work in a dynamic and impactful generative AI environment.
What they're looking for
- Data curation and processing
- Hyperparameter tuning
- Experience with training codebases
- Model checkpoint evaluation
- Proficiency in Python and PyTorch
- Knowledge of SLURM/Kubernetes
- Familiarity with modern LLMs
- Basic understanding of distributed training frameworks
Benefits
- Competitive compensation
- Startup equity
- Health insurance
- Flexible work environment
- Equity in salary based on experience
- Opportunities for professional growth
Opens the official application on the employer’s site. No login required.