Flashpoint
Data Engineer I
Remote in the United States (Remote)$107.5k–$150kfulltimeentryAdded 2 days ago
About this role
Flashpoint seeks an experienced Data Engineer I to operate and optimize large-scale real-time data pipelines processing petabytes of threat intelligence data. You'll own production streaming infrastructure using GCP Pub/Sub and Dataflow, integrate AI enrichment via Vertex AI/Gemini, and serve as the operational backbone ensuring 24/7 system reliability for enterprise and government customers.
What you'll do
- Operate end-to-end real-time pipelines ingesting, enriching, and routing data through AI models for threat assessment
- Own Pub/Sub infrastructure and production incident response for message delivery and consumer systems
- Scale, monitor, and optimize Dataflow jobs handling petabyte-scale datasets; diagnose and fix failures
- Build and maintain monitoring, alerting, and incident-response tooling for zero-downtime systems
- Ensure data integrity and accuracy from ingestion through customer delivery
- Partner with Product, Data team, and Analysts to translate requirements into operational systems
What they're looking for
- Production streaming pipeline operation (Pub/Sub, Kafka, or equivalent)
- GCP Dataflow and Apache Beam optimization and debugging
- Vertex AI and Gemini integration with data pipelines
- Monitoring and alerting tools (Prometheus, Grafana, Stackdriver)
- Incident response and on-call operations
- Large-scale data handling (terabyte to petabyte range)
- Prompt engineering for data enrichment
- SLA management and observability
Opens the official application on the employer’s site. No login required.