Skip to main content

Allen Control Systems

CV/ML Platform Engineer

Austin, TXfulltimemidAdded 2 days ago

About this role

Allen Control Systems seeks a CV/ML Platform Engineer to design and operate infrastructure supporting computer vision and machine learning development for autonomous defense systems. You'll manage a 130+ GPU Kubernetes cluster, build ML CI/CD pipelines, and enable high-velocity model training and optimization for edge deployment.

What you'll do

  • Deploy and operate bare-metal Kubernetes clusters with 130+ NVIDIA GPUs and AWS burst capability
  • Own CV/ML CI/CD pipelines and automate model training, testing, and validation workflows
  • Maintain ML infrastructure including model versioning, experiment tracking, and data provenance systems
  • Implement and optimize model deployment toolchains (TensorRT, quantization) for NVIDIA Jetson edge devices
  • Manage high-performance storage solutions handling terabytes of video and image data
  • Monitor GPU health, training throughput, and inference metrics using observability tools

What they're looking for

  • Platform Engineering or MLOps (2+ years)
  • Kubernetes and bare-metal GPU cluster management
  • NVIDIA CUDA, GPU Operator, and GPU scheduling (MIG, Volcano)
  • MLOps platforms (Kubeflow, MLflow, Weights & Biases, DVC)
  • CI/CD pipeline development with model validation and artifact management
  • Model optimization (TensorRT, ONNX, quantization) for ARM/Jetson
  • High-performance storage (MinIO, WEKA, Ceph) and data orchestration
  • Linux systems administration, networking, and observability (Prometheus/Grafana, ELK)

Benefits

  • Competitive salary
  • Health, Dental, Vision Insurance
  • Paid Time Off
  • Engineering-first culture focused on technical excellence
  • Work on cutting-edge defense technology with real-world impact
  • Founded by proven entrepreneurs with $180M in successful exits
Apply on the employer's site

Opens the official application on the employer’s site. No login required.