Skip to main content

Anthropic

Full-Stack Software Engineer, Reinforcement Learning

San Francisco, CA | New York City, NYFrom $405kmidAdded 2 days ago

About this role

Join Anthropic's Reinforcement Learning team as a Full-Stack Software Engineer building platforms that power environment creation, data collection, and training for Claude. You'll own products end-to-end—from backend APIs to web interfaces used by researchers and thousands of data labelers—in a fast-moving environment where iteration happens in hours, not months.

What you'll do

  • Build web platforms for RL environment creation, management, versioning, and quality review
  • Develop vendor-facing interfaces for external partners to create and iterate on training environments
  • Design human data collection platforms at scale with labeling workflows and quality assurance systems
  • Create evaluation dashboards and observability UIs for training run health and environment quality
  • Implement backend services and APIs connecting environment tools, data collection, and RL infrastructure
  • Build scalable code data generation pipelines producing diverse programming tasks with reward signals

What they're looking for

  • Full-stack software engineering (database to frontend)
  • Python
  • React, TypeScript, or modern web frameworks
  • System design for scalable platforms
  • UX design and user interface development
  • Cross-functional communication with researchers and operations teams
  • High agency and problem-solving in ambiguous situations
  • Data collection and labeling platform experience (preferred)
Apply on the employer's site

Opens the official application on the employer’s site. No login required.