Baseten
Applied AI Inference Engineer
San Francisco (Remote)$165k–$330kfulltimemidAdded 2 days ago
About this role
Baseten seeks an Applied AI Inference Engineer to work directly with customers deploying production AI applications on their platform. You'll architect and optimize high-scale AI solutions end-to-end, balancing hands-on engineering with customer collaboration, product strategy, and technical leadership.
What you'll do
- Design, build, and deploy AI inference solutions from problem definition through production monitoring
- Partner with customers across sales, implementation, and expansion phases to translate business goals into reliable services
- Develop production-level software primarily in Python, optimizing for performance, latency, and cost
- Rapidly prototype and ship well-tested services while managing ambiguity and technical tradeoffs
- Own products and customer projects end-to-end, functioning as engineer, project manager, and product strategist
- Contribute to platform improvements through feature development and collaboration with engineering/product teams
What they're looking for
- Python (or other general-purpose programming languages)
- AI/ML pipeline development and model deployment
- Production software engineering
- System design and optimization
- Technical communication
- Project management
- Problem-solving in ambiguous contexts
- Customer collaboration and empathy
Benefits
- Competitive salary with meaningful equity
- 100% medical, dental, and vision insurance coverage for employee and dependents
- Flexible work arrangements
Opens the official application on the employer’s site. No login required.