Anthropic

Research Engineer, Domain Scaling

San Francisco, CA | New York City, NY | Seattle, WAFrom $850kmidAdded 2 days ago

About this role

Join Anthropic's Domain Scaling team to develop specialized AI capabilities for real-world industries like finance, healthcare, and legal. You'll lead the end-to-end process of creating reinforcement learning environments, managing data sourcing, and measuring model performance improvements across knowledge work domains.

What you'll do

Own data strategy for knowledge work verticals from task sourcing through RL training
Manage technical relationships with external data vendors and evaluate data quality
Collaborate with domain experts to design data pipelines and evaluation frameworks
Develop novel approaches for creating RL environments for high-value tasks
Build QA frameworks to prevent reward hacking and ensure environment quality
Partner with RL and product teams to translate capability goals into training environments

What they're looking for

Large language model fine-tuning and domain adaptation
Reinforcement learning and reward design
Training data curation and LLM dataset management
Vendor management and technical relationship building
ML evaluation and benchmark design
Cross-functional collaboration
Production ML systems experience
Domain expertise in finance, healthcare, or legal

Benefits

Annual salary range: $350,000–$850,000 USD
Hybrid work policy requiring 25% office time minimum
Visa sponsorship available
Work on cutting-edge AI safety and capability research
Multiple office locations: San Francisco, New York City, or Seattle

Apply on the employer's site →

Opens the official application on the employer’s site. No login required.