Skip to main content

allencontrolsystems

Platform Engineer, Data

Austin, TXmidAdded 2 days ago

About this role

Allen Control Systems seeks a Data Platform Engineer to build scalable infrastructure for large-scale image and video pipelines while optimizing datasets for AI model training. You'll combine data engineering expertise with ML knowledge to design intelligent data curation strategies that maximize model performance for autonomous defense systems.

What you'll do

  • Design and develop scalable data infrastructure to handle growing volumes of images and videos
  • Implement dataset optimization techniques such as coreset selection, embedding-based filtering, and complexity scoring
  • Build and maintain data ingestion pipelines with synthetic data generation and versioning
  • Partner with ML engineers to ensure data integrity and model performance alignment
  • Apply hard example mining, class balancing, and deduplication strategies
  • Orchestrate data orchestration workflows for autonomous systems

What they're looking for

  • Data infrastructure and pipeline design
  • Machine learning principles and model training dynamics
  • Large-scale image and video processing
  • Python or similar programming languages
  • Data curation and dataset optimization
  • Cloud platforms and distributed systems
  • Data versioning and management
  • SQL and data querying
Apply on the employer's site

Opens the official application on the employer’s site. No login required.