Skip to main content

Harvey

Research Engineer, Post-Training

San Francisco (Remote)$231k–$340kfulltimemidAdded 2 days ago

About this role

Harvey seeks a Research Engineer to lead post-training efforts that improve AI agent performance on legal work. You'll design training experiments, build evaluation systems, and collaborate with researchers to optimize models through feedback loops and domain-specific optimizations.

What you'll do

  • Drive post-training experiments balancing performance, cost, latency, and security trade-offs
  • Optimize agent systems including skills, tools, retrieval strategies, and validation loops for legal tasks
  • Design reliable grading and reward systems for evaluation and training iteration
  • Analyze agent behavior patterns and convert insights into training data or harness improvements
  • Collaborate with Harvey researchers and external partners on experiment design and model improvements

What they're looking for

  • Post-training expertise (SFT, preference optimization, RLHF, reward modeling, distillation)
  • Strong Python and research-engineering ability
  • Model behavior analysis and failure mode identification
  • Self-management of ambiguous applied research projects
  • Data and evaluation infrastructure building
  • Distributed training and GPU workload experience
  • Clear communication across research, engineering, and product teams

Benefits

  • Competitive salary: $231,000 - $340,000
  • Work on frontier AI transforming legal and professional services
  • Opportunity for significant personal and professional growth
  • Collaborate with world-class researchers and engineers
  • Scale impact with 1500+ customers in 60+ countries
Apply on the employer's site

Opens the official application on the employer’s site. No login required.