Allen Control Systems
CV/ML Platform Engineer
Austin, TXfulltimemidAdded 2 days ago
About this role
Allen Control Systems seeks a CV/ML Platform Engineer to design and operate infrastructure supporting computer vision and machine learning development for autonomous defense systems. You'll manage a 130+ GPU Kubernetes cluster, build ML CI/CD pipelines, and enable high-velocity model training and optimization for edge deployment.
What you'll do
- Deploy and operate bare-metal Kubernetes clusters with 130+ NVIDIA GPUs and AWS burst capability
- Own CV/ML CI/CD pipelines and automate model training, testing, and validation workflows
- Maintain ML infrastructure including model versioning, experiment tracking, and data provenance systems
- Implement and optimize model deployment toolchains (TensorRT, quantization) for NVIDIA Jetson edge devices
- Manage high-performance storage solutions handling terabytes of video and image data
- Monitor GPU health, training throughput, and inference metrics using observability tools
What they're looking for
- Platform Engineering or MLOps (2+ years)
- Kubernetes and bare-metal GPU cluster management
- NVIDIA CUDA, GPU Operator, and GPU scheduling (MIG, Volcano)
- MLOps platforms (Kubeflow, MLflow, Weights & Biases, DVC)
- CI/CD pipeline development with model validation and artifact management
- Model optimization (TensorRT, ONNX, quantization) for ARM/Jetson
- High-performance storage (MinIO, WEKA, Ceph) and data orchestration
- Linux systems administration, networking, and observability (Prometheus/Grafana, ELK)
Benefits
- Competitive salary
- Health, Dental, Vision Insurance
- Paid Time Off
- Engineering-first culture focused on technical excellence
- Work on cutting-edge defense technology with real-world impact
- Founded by proven entrepreneurs with $180M in successful exits
Opens the official application on the employer’s site. No login required.