Skip to main content

Cohere

Software Engineer, Data Infrastructure

New York (Remote)fulltimemidAdded 2 days ago

About this role

Cohere seeks a Software Engineer to design and optimize petabyte-scale data infrastructure supporting world-class AI model training. You'll work on high-performance storage systems, networking challenges, and data pipelines alongside leading researchers and engineers.

What you'll do

  • Build and maintain petabyte-scale storage infrastructure for demanding AI training workloads
  • Optimize networking and performance across distributed data systems
  • Collaborate with modeling teams on data storage and retrieval for training and evaluation
  • Transform unstructured data into performant datasets across multiple storage backends
  • Design and implement data pipelines using distributed processing frameworks
  • Support researchers and engineers with infrastructure for cutting-edge AI development

What they're looking for

  • Data storage infrastructure (4+ years)
  • Python
  • Kubernetes and storage management (Persistent Volumes, CSI drivers)
  • Multi-cloud storage (S3, GCS, POSIX)
  • Distributed data processing (Spark, Beam, Flink)
  • Analytics tooling (BigQuery, Airflow, dbt preferred)
  • AI research knowledge and passion
  • Ability to work on novel, unproven systems

Benefits

  • Weekly lunch stipend ($75/£75 equivalent)
  • Full health, dental, and mental health benefits
  • Retirement matching (RRSP, 401K, Pension)
  • 100% parental leave top-up for 6 months
  • 6 weeks paid vacation (30 working days)
  • Learning stipend, enrichment benefits, and $500 home office credit
Apply on the employer's site

Opens the official application on the employer’s site. No login required.