Anthropic

Research Engineer, Performance RL (Reinforcement Learning)

San Francisco, CAFrom $850kmidAdded 2 days ago

About this role

Join Anthropic's Code RL team to advance AI models' ability to write correct, performant code for accelerators using reinforcement learning. You'll design RL environments, conduct experiments, and deliver research into production training runs while collaborating across teams.

What you'll do

Design and implement RL environments and evaluation systems for accelerator code generation
Conduct experiments to advance code generation capabilities and shape research roadmap
Integrate research innovations into model training pipelines
Collaborate with researchers, engineers, and performance specialists across Anthropic
Translate accelerator performance knowledge into learnable tasks and reward signals

What they're looking for

Accelerator programming (CUDA, ROCm, Triton, Pallas)
ML framework expertise (JAX or PyTorch)
Full-stack development (kernels, model code, distributed systems)
Reinforcement learning
LLM training methodologies
ML workload optimization and porting
Research and engineering implementation balance

Benefits

Annual salary: $350,000–$850,000 USD
Hybrid work policy (minimum 25% in-office)
Visa sponsorship available
Work on cutting-edge AI safety and capability research

Apply on the employer's site →

Opens the official application on the employer’s site. No login required.