Alljoined
Data Infrastructure Engineer
San Francisco$140k–$180kfulltimemidAdded 2 days ago
About this role
Alljoined seeks a Data Infrastructure Engineer to build the backend systems enabling brain-computer interface research. You'll design and manage the entire data pipeline—from processing massive multimodal datasets through cloud and bare-metal clusters—to support foundational model training at scale.
What you'll do
- Build and maintain high-performance ETL pipelines processing terabytes of daily multimodal data (video, audio, text, time-series)
- Architect, provision, and manage bare-metal compute clusters, storage servers, and networking infrastructure for ML workloads
- Design storage topologies and manage databases (TimescaleDB, ClickHouse) across hybrid on-premise and cloud environments
- Bridge neuro hardware systems with central data repositories to ensure low-latency, high-throughput delivery to GPUs
- Optimize data ingestion from heterogeneous hardware peripherals while maintaining data integrity and concurrent stream handling
- Own the complete infrastructure lifecycle from I/O optimization to production deployment and monitoring
What they're looking for
- Systems-level architecture and production software engineering (3+ years)
- Python, Rust, C++, or Go
- High-performance ETL pipeline design for large-scale unstructured data
- Bare-metal cluster provisioning and management
- Database management (TimescaleDB, ClickHouse) and distributed storage
- Cloud platforms (AWS/GCP/Azure) and hybrid infrastructure
- ML frameworks (PyTorch/TensorFlow) and GPU utilization optimization
- Networking for distributed systems (InfiniBand, RoCE, zero-copy networking)
Benefits
- Competitive equity compensation at seed stage
- Housing support options
- Visa sponsorship
- 3% 401k matching
- Health insurance
- $140,000 - $180,000 annual compensation
Opens the official application on the employer’s site. No login required.