Firecrawl

Research Engineer — Reinforcement Learning

San Francisco, CA (Hybrid) OR Remote (Americas, UTC-3 to UTC-10) (Remote)$180k–$290kfulltimemidAdded 2 days ago

About this role

Firecrawl seeks a Research Engineer to build reinforcement learning systems that improve their web data extraction product. You'll design training infrastructure, reward pipelines, and fine-tuning systems while bridging classical RL with modern LLM agents, shipping models to production at a fast-moving startup.

What you'll do

Design and operate training infrastructure, data pipelines, and reward modeling systems from scratch
Fine-tune foundation models to achieve state-of-the-art performance on web data extraction tasks
Apply reinforcement learning techniques to improve multi-step LLM agent workflows
Run rapid experiments to validate hypotheses and iterate on model improvements
Communicate RL research findings clearly to non-technical stakeholders and product teams
Collaborate with IR-focused researchers and engineers to integrate RL improvements into the product

What they're looking for

Reinforcement learning (PPO, RLHF, reward modeling, policy optimization)
Model fine-tuning and training infrastructure
GPU cluster management and training operations
LLM agents and prompt engineering
Data pipeline and evaluation framework design
Production ML systems and deployment
Fast experimentation and iteration
Clear technical communication

Benefits

Competitive salary: $180,000–$290,000/year plus up to 0.15% equity
Flexible work: San Francisco hybrid or remote (Americas, UTC-3 to UTC-10)
Small, technical, fast-moving team
Opportunity to ship research directly to production
Work on essential infrastructure for LLM applications

Apply on the employer's site →

Opens the official application on the employer’s site. No login required.