Cylake
Data Pipeline Engineer
Sunnyvale$150k–$250kfulltimemidAdded today
About this role
Design and build scalable data lakehouse architectures supporting petabyte-scale analytics for a early-stage cybersecurity company. You'll own end-to-end data pipelines from ingestion through transformation, working with industry veterans to deliver high-impact products.
What you'll do
- Architect and maintain scalable, open-source data lakehouse infrastructure handling petabyte-scale workloads
- Design end-to-end data pipelines covering ingestion, transformation, and consumption phases
- Support both batch and real-time processing architectures
- Ensure high performance, reliability, and data quality across systems
- Build and run large-scale distributed data systems
- Document solutions and communicate technical decisions effectively
What they're looking for
- Large-scale data system architecture (petabyte scale)
- Apache Iceberg, Parquet, Kafka, Spark, and Flink
- Python programming
- Data transformation and pipeline design
- Batch and real-time processing
- PostgreSQL and Neo4j
- Data governance and lineage tools (preferred)
- Cloud platform data services (preferred)
Benefits
- Competitive compensation ($150,000–$250,000 annually)
- Comprehensive health and well-being benefits package
- Opportunity to grow with early-stage company
- Work with industry veterans and experienced team
- Equal opportunity employer with diversity commitment
- Reasonable accommodations supported throughout hiring process
Opens the official application on the employer’s site. No login required.