Skip to main content

Anthropic

Research Engineer, Knowledge Foundations

San Francisco, CAFrom $850kmidAdded 2 days ago

About this role

Anthropic is seeking a Research Engineer to design and optimize training environments and evaluations for Claude's knowledge work capabilities. You'll conduct end-to-end experiments spanning data pipelines, model training, and evaluation to improve how Claude searches, retrieves, and reasons over information at scale.

What you'll do

  • Design and iterate on training environments and data pipelines for knowledge-intensive reasoning tasks
  • Run full-cycle experiments: form hypotheses, build infrastructure, train models, and analyze results
  • Develop evaluations that measure progress on search, retrieval, and reasoning quality
  • Identify model failure modes and translate them into training signals
  • Collaborate with RL, post-training, and product teams to align priorities and ship improvements
  • Build observability, dashboards, and operational tooling for training environments with high signal-to-noise metrics

What they're looking for

  • Python engineering with production-grade, reliable code
  • ML experiment design and analysis
  • Full-stack ML work (data pipelines, training, evaluation)
  • Large language model training, fine-tuning, or RL experience
  • LLM evaluation design for open-ended domains
  • Distributed systems operation at scale
  • Clear communication across teams and time zones
  • Comfort with ambiguity and prioritization
Apply on the employer's site

Opens the official application on the employer’s site. No login required.