Skip to main content

Epoch AI

Software Engineer, Benchmarking

Remote (Remote)full timemidAdded 2 days ago

About this role

Epoch AI seeks a Software Engineer to develop and maintain benchmarking infrastructure for evaluating frontier AI models. You'll work with a team to run existing benchmarks, integrate with AI providers, create new benchmarks, and support internal research experiments.

What you'll do

  • Run and maintain AI benchmarking infrastructure
  • Integrate with AI model providers and services
  • Set up and deploy existing benchmarks on internal systems
  • Design and develop new benchmarks for AI evaluation
  • Support and facilitate internal experiments
  • Collaborate with the benchmarking team on infrastructure improvements

What they're looking for

  • Software engineering and systems design
  • Infrastructure development and maintenance
  • API integration
  • Python or similar programming languages
  • Cloud platforms and deployment tools
  • Database management
  • Testing and debugging
  • Version control systems

Benefits

  • Fully remote position
  • International hiring across many countries
  • Rolling applications (flexible timeline)
  • Collaborative research environment
  • Work on frontier AI evaluation challenges
Apply on the employer's site

Opens the official application on the employer’s site. No login required.