Epoch AI

Software Engineer, Benchmarking

Remote (Remote)full timemidAdded 2 days ago

About this role

Epoch AI seeks a Software Engineer to develop and maintain benchmarking infrastructure for evaluating frontier AI models. You'll work with a team to run existing benchmarks, integrate with AI providers, create new benchmarks, and support internal research experiments.

What you'll do

Run and maintain AI benchmarking infrastructure
Integrate with AI model providers and services
Set up and deploy existing benchmarks on internal systems
Design and develop new benchmarks for AI evaluation
Support and facilitate internal experiments
Collaborate with the benchmarking team on infrastructure improvements

What they're looking for

Software engineering and systems design
Infrastructure development and maintenance
API integration
Python or similar programming languages
Cloud platforms and deployment tools
Database management
Testing and debugging
Version control systems

Benefits

Fully remote position
International hiring across many countries
Rolling applications (flexible timeline)
Collaborative research environment
Work on frontier AI evaluation challenges

Apply on the employer's site →

Opens the official application on the employer’s site. No login required.