Skip to main content

Anthropic

Research Engineer, Domain Scaling

San Francisco, CA | New York City, NY | Seattle, WAFrom $850kmidAdded 2 days ago

About this role

Join Anthropic's Domain Scaling team to develop specialized AI capabilities for real-world industries like finance, healthcare, and legal. You'll lead the end-to-end process of creating reinforcement learning environments, managing data sourcing, and measuring model performance improvements across knowledge work domains.

What you'll do

  • Own data strategy for knowledge work verticals from task sourcing through RL training
  • Manage technical relationships with external data vendors and evaluate data quality
  • Collaborate with domain experts to design data pipelines and evaluation frameworks
  • Develop novel approaches for creating RL environments for high-value tasks
  • Build QA frameworks to prevent reward hacking and ensure environment quality
  • Partner with RL and product teams to translate capability goals into training environments

What they're looking for

  • Large language model fine-tuning and domain adaptation
  • Reinforcement learning and reward design
  • Training data curation and LLM dataset management
  • Vendor management and technical relationship building
  • ML evaluation and benchmark design
  • Cross-functional collaboration
  • Production ML systems experience
  • Domain expertise in finance, healthcare, or legal

Benefits

  • Annual salary range: $350,000–$850,000 USD
  • Hybrid work policy requiring 25% office time minimum
  • Visa sponsorship available
  • Work on cutting-edge AI safety and capability research
  • Multiple office locations: San Francisco, New York City, or Seattle
Apply on the employer's site

Opens the official application on the employer’s site. No login required.