openai
Data Engineer, People Innovation Labs
San Francisco (Remote)fulltimemid
About this role
OpenAI's People Innovation Labs seeks a Data Engineer to design and maintain data pipelines supporting internal HR products and people analytics. You'll build scalable data systems that integrate employee information into the Databricks warehouse and enable data-driven insights across recruiting, culture, and organizational initiatives.
What you'll do
- Design and manage people data pipelines integrating with Databricks warehouse
- Develop canonical datasets tracking people metrics and product performance
- Collaborate with Data Platform, Analytics, and People teams to understand data requirements
- Build robust, fault-tolerant systems for data ingestion and processing
- Lead data architecture decisions as the primary engineering expert on the team
- Ensure data security, integrity, and compliance with industry standards
What they're looking for
- Data engineering (3+ years experience)
- Python, Scala, or Java programming
- Databricks and/or Snowflake
- ETL schedulers (Airflow, Fivetran, Dagster, Prefect, or similar)
- Distributed processing (Spark, Hadoop, Flink)
- Distributed storage systems (HDFS, S3)
- Data warehouse design
- Software engineering fundamentals (8+ years)
Opens the official application on the employer’s site. No login required.