Skip to main content

Anthropic

Research Engineer, Performance RL (Reinforcement Learning)

San Francisco, CAFrom $850kmidAdded 2 days ago

About this role

Join Anthropic's Code RL team to advance AI models' ability to write correct, performant code for accelerators using reinforcement learning. You'll design RL environments, conduct experiments, and deliver research into production training runs while collaborating across teams.

What you'll do

  • Design and implement RL environments and evaluation systems for accelerator code generation
  • Conduct experiments to advance code generation capabilities and shape research roadmap
  • Integrate research innovations into model training pipelines
  • Collaborate with researchers, engineers, and performance specialists across Anthropic
  • Translate accelerator performance knowledge into learnable tasks and reward signals

What they're looking for

  • Accelerator programming (CUDA, ROCm, Triton, Pallas)
  • ML framework expertise (JAX or PyTorch)
  • Full-stack development (kernels, model code, distributed systems)
  • Reinforcement learning
  • LLM training methodologies
  • ML workload optimization and porting
  • Research and engineering implementation balance

Benefits

  • Annual salary: $350,000–$850,000 USD
  • Hybrid work policy (minimum 25% in-office)
  • Visa sponsorship available
  • Work on cutting-edge AI safety and capability research
Apply on the employer's site

Opens the official application on the employer’s site. No login required.