Epoch AI
Software Engineer, Benchmarking
Remote (Remote)full timemidAdded 2 days ago
About this role
Epoch AI seeks a Software Engineer to develop and maintain benchmarking infrastructure for evaluating frontier AI models. You'll work with a team to run existing benchmarks, integrate with AI providers, create new benchmarks, and support internal research experiments.
What you'll do
- Run and maintain AI benchmarking infrastructure
- Integrate with AI model providers and services
- Set up and deploy existing benchmarks on internal systems
- Design and develop new benchmarks for AI evaluation
- Support and facilitate internal experiments
- Collaborate with the benchmarking team on infrastructure improvements
What they're looking for
- Software engineering and systems design
- Infrastructure development and maintenance
- API integration
- Python or similar programming languages
- Cloud platforms and deployment tools
- Database management
- Testing and debugging
- Version control systems
Benefits
- Fully remote position
- International hiring across many countries
- Rolling applications (flexible timeline)
- Collaborative research environment
- Work on frontier AI evaluation challenges
Opens the official application on the employer’s site. No login required.