Anthropic
Research Engineer, Domain Scaling
San Francisco, CA | New York City, NY | Seattle, WAFrom $850kmidAdded 2 days ago
About this role
Join Anthropic's Domain Scaling team to develop specialized AI capabilities for real-world industries like finance, healthcare, and legal. You'll lead the end-to-end process of creating reinforcement learning environments, managing data sourcing, and measuring model performance improvements across knowledge work domains.
What you'll do
- Own data strategy for knowledge work verticals from task sourcing through RL training
- Manage technical relationships with external data vendors and evaluate data quality
- Collaborate with domain experts to design data pipelines and evaluation frameworks
- Develop novel approaches for creating RL environments for high-value tasks
- Build QA frameworks to prevent reward hacking and ensure environment quality
- Partner with RL and product teams to translate capability goals into training environments
What they're looking for
- Large language model fine-tuning and domain adaptation
- Reinforcement learning and reward design
- Training data curation and LLM dataset management
- Vendor management and technical relationship building
- ML evaluation and benchmark design
- Cross-functional collaboration
- Production ML systems experience
- Domain expertise in finance, healthcare, or legal
Benefits
- Annual salary range: $350,000–$850,000 USD
- Hybrid work policy requiring 25% office time minimum
- Visa sponsorship available
- Work on cutting-edge AI safety and capability research
- Multiple office locations: San Francisco, New York City, or Seattle
Opens the official application on the employer’s site. No login required.