Skip to main content

palantir

Software Engineer - Hosted Model Infrastructure

New York, NYfull-timemid

About this role

Palantir seeks a Software Engineer to build and maintain infrastructure for deploying AI models across diverse environments, from air-gapped networks to edge devices. You'll own end-to-end services spanning inference engines, GPU scheduling, deployment pipelines, and observability, ensuring models run reliably on customer-controlled hardware with limited resources.

What you'll do

  • Design and maintain ML model deployment infrastructure for restricted environments
  • Develop and optimize inference engines and GPU resource scheduling systems
  • Build deployment pipelines and observability solutions for production models
  • Integrate hosted model services with Palantir's broader platform
  • Ensure continuous testing and delivery of model updates
  • Support customers running AI on constrained hardware in air-gapped networks

What they're looking for

  • Software engineering and full-stack development
  • Machine learning infrastructure and model deployment
  • GPU scheduling and optimization
  • Cloud/edge computing and distributed systems
  • DevOps, CI/CD pipelines, and observability tools
  • Data systems and platform integration
  • Problem-solving in resource-constrained environments
  • Python or systems programming languages
Apply on the employer's site

Opens the official application on the employer’s site. No login required.