Skip to main content

Baseten

Applied AI Inference Engineer

San Francisco (Remote)$165k–$330kfulltimemidAdded 2 days ago

About this role

Baseten seeks an Applied AI Inference Engineer to work directly with customers deploying production AI applications on their platform. You'll architect and optimize high-scale AI solutions end-to-end, balancing hands-on engineering with customer collaboration, product strategy, and technical leadership.

What you'll do

  • Design, build, and deploy AI inference solutions from problem definition through production monitoring
  • Partner with customers across sales, implementation, and expansion phases to translate business goals into reliable services
  • Develop production-level software primarily in Python, optimizing for performance, latency, and cost
  • Rapidly prototype and ship well-tested services while managing ambiguity and technical tradeoffs
  • Own products and customer projects end-to-end, functioning as engineer, project manager, and product strategist
  • Contribute to platform improvements through feature development and collaboration with engineering/product teams

What they're looking for

  • Python (or other general-purpose programming languages)
  • AI/ML pipeline development and model deployment
  • Production software engineering
  • System design and optimization
  • Technical communication
  • Project management
  • Problem-solving in ambiguous contexts
  • Customer collaboration and empathy

Benefits

  • Competitive salary with meaningful equity
  • 100% medical, dental, and vision insurance coverage for employee and dependents
  • Flexible work arrangements
Apply on the employer's site

Opens the official application on the employer’s site. No login required.