Skip to main content

openai

Data Engineer, People Innovation Labs

San Francisco (Remote)fulltimemid

About this role

OpenAI's People Innovation Labs seeks a Data Engineer to design and maintain data pipelines supporting internal HR products and people analytics. You'll build scalable data systems that integrate employee information into the Databricks warehouse and enable data-driven insights across recruiting, culture, and organizational initiatives.

What you'll do

  • Design and manage people data pipelines integrating with Databricks warehouse
  • Develop canonical datasets tracking people metrics and product performance
  • Collaborate with Data Platform, Analytics, and People teams to understand data requirements
  • Build robust, fault-tolerant systems for data ingestion and processing
  • Lead data architecture decisions as the primary engineering expert on the team
  • Ensure data security, integrity, and compliance with industry standards

What they're looking for

  • Data engineering (3+ years experience)
  • Python, Scala, or Java programming
  • Databricks and/or Snowflake
  • ETL schedulers (Airflow, Fivetran, Dagster, Prefect, or similar)
  • Distributed processing (Spark, Hadoop, Flink)
  • Distributed storage systems (HDFS, S3)
  • Data warehouse design
  • Software engineering fundamentals (8+ years)
Apply on the employer's site

Opens the official application on the employer’s site. No login required.