Skip to main content

Firecrawl

Research Engineer — Reinforcement Learning

San Francisco, CA (Hybrid) OR Remote (Americas, UTC-3 to UTC-10) (Remote)$180k–$290kfulltimemidAdded 2 days ago

About this role

Firecrawl seeks a Research Engineer to build reinforcement learning systems that improve their web data extraction product. You'll design training infrastructure, reward pipelines, and fine-tuning systems while bridging classical RL with modern LLM agents, shipping models to production at a fast-moving startup.

What you'll do

  • Design and operate training infrastructure, data pipelines, and reward modeling systems from scratch
  • Fine-tune foundation models to achieve state-of-the-art performance on web data extraction tasks
  • Apply reinforcement learning techniques to improve multi-step LLM agent workflows
  • Run rapid experiments to validate hypotheses and iterate on model improvements
  • Communicate RL research findings clearly to non-technical stakeholders and product teams
  • Collaborate with IR-focused researchers and engineers to integrate RL improvements into the product

What they're looking for

  • Reinforcement learning (PPO, RLHF, reward modeling, policy optimization)
  • Model fine-tuning and training infrastructure
  • GPU cluster management and training operations
  • LLM agents and prompt engineering
  • Data pipeline and evaluation framework design
  • Production ML systems and deployment
  • Fast experimentation and iteration
  • Clear technical communication

Benefits

  • Competitive salary: $180,000–$290,000/year plus up to 0.15% equity
  • Flexible work: San Francisco hybrid or remote (Americas, UTC-3 to UTC-10)
  • Small, technical, fast-moving team
  • Opportunity to ship research directly to production
  • Work on essential infrastructure for LLM applications
Apply on the employer's site

Opens the official application on the employer’s site. No login required.