palantir
Software Engineer - Hosted Model Infrastructure
Washington, D.C.full-timemid
About this role
Palantir seeks a Software Engineer to build and maintain infrastructure for deploying ML models across diverse, resource-constrained environments including air-gapped networks and edge devices. You'll own end-to-end services spanning inference engines, GPU scheduling, deployment pipelines, and platform integration to enable frontier AI capabilities for critical customers.
What you'll do
- Design and maintain ML model deployment infrastructure for air-gapped and edge environments
- Manage GPU scheduling, resource allocation, and inference engine optimization
- Build and support continuous deployment pipelines for models and capabilities
- Develop observability and monitoring solutions for production ML systems
- Integrate hosted model infrastructure with Palantir's broader platform
- Collaborate with customers to solve deployment challenges in constrained environments
What they're looking for
- Software engineering and full-stack development
- ML/AI infrastructure and deployment
- GPU scheduling and resource optimization
- Inference engine design or experience
- Continuous integration/continuous deployment (CI/CD)
- Observability and monitoring systems
- Cloud or on-premise infrastructure management
- Systems programming and optimization
Opens the official application on the employer’s site. No login required.