Together AI

Research Engineer, Core ML

San FranciscomidAdded 2 days ago

About this role

The Research Engineer role in Core ML at Together AI focuses on transforming advanced reinforcement learning techniques and inference optimizations into practical systems that enhance API performance. The position requires involvement in all aspects of system development, aiming for measurable gains in efficiency and model quality.

What you'll do

Design algorithms for low-latency, high-throughput inference.
Optimize high-performance inference engines and systems.
Integrate RL and post-training pipelines for cost-effective inference.
Profile and debug production systems for stable improvements.
Lead technical direction in inference and RL projects.
Mentor engineers on full-stack ML systems.

What they're looking for

Expertise in ML inference systems.
Understanding of RL algorithms and techniques.
Experience with GPU optimization.
Ability to debug production workloads.
Proficient in performance engineering.
Familiarity with scheduling and batching strategies.
Technical leadership capabilities.
End-to-end system ownership.

Apply on the employer's site →

Opens the official application on the employer’s site. No login required.