allencontrolsystems
Platform Engineer, Data
Austin, TXmidAdded 2 days ago
About this role
Allen Control Systems seeks a Data Platform Engineer to build scalable infrastructure for large-scale image and video pipelines while optimizing datasets for AI model training. You'll combine data engineering expertise with ML knowledge to design intelligent data curation strategies that maximize model performance for autonomous defense systems.
What you'll do
- Design and develop scalable data infrastructure to handle growing volumes of images and videos
- Implement dataset optimization techniques such as coreset selection, embedding-based filtering, and complexity scoring
- Build and maintain data ingestion pipelines with synthetic data generation and versioning
- Partner with ML engineers to ensure data integrity and model performance alignment
- Apply hard example mining, class balancing, and deduplication strategies
- Orchestrate data orchestration workflows for autonomous systems
What they're looking for
- Data infrastructure and pipeline design
- Machine learning principles and model training dynamics
- Large-scale image and video processing
- Python or similar programming languages
- Data curation and dataset optimization
- Cloud platforms and distributed systems
- Data versioning and management
- SQL and data querying
Opens the official application on the employer’s site. No login required.