Skip to main content

Cursor

Software Engineer, Model Routing & Inference

New YorkfulltimemidAdded 2 days ago

About this role

Build the inference platform powering Cursor's AI interactions, focusing on making model serving faster, more reliable, and cost-effective at scale. You'll own the complete inference path from routing to failover, handling millions of requests daily across multiple AI providers.

What you'll do

  • Design and maintain the inference gateway abstracting multiple provider APIs
  • Implement cross-provider failover mechanisms to prevent outages
  • Build routing, backpressure, and admission control systems for traffic spikes
  • Optimize for cost, latency, and reliability tradeoffs in production
  • Handle the inference path for agent sessions, completions, and chat messages
  • Scale the system to support millions of AI requests

What they're looking for

  • Distributed systems design and implementation
  • High-throughput, low-latency system architecture
  • Inference serving or traffic routing experience
  • Real-time data pipeline design
  • Cost and capacity planning optimization
  • Production systems at scale
  • API abstraction and integration
  • Software engineering fundamentals
Apply on the employer's site

Opens the official application on the employer’s site. No login required.