Skip to main content

Cantina

Media Software Engineer, Speech (All Levels)

San Francisco (Remote)$120k–$180kfulltimemidAdded 2 days ago

About this role

Cantina Labs seeks a Software Engineer to optimize real-time speech and audio systems powering live AI conversations. You'll reduce latency, build voice/video capabilities, and enhance WebRTC infrastructure across multiple platforms while working on cutting-edge conversational AI technology.

What you'll do

  • Optimize real-time speech and media systems for low-latency AI conversations
  • Reduce latency in audio streaming and speech processing pipelines
  • Develop and expand voice and video features for user-AI interactions
  • Improve custom WebRTC infrastructure across iOS, Android, and web platforms
  • Collaborate with product and platform teams on conversational AI experiences
  • Debug and solve complex, subtle engineering problems in real-time systems

What they're looking for

  • C or C++
  • Multithreaded and concurrent programming
  • System programming and network protocols
  • Data structures and algorithms
  • Object-oriented design
  • WebRTC, streaming protocols, or media technologies (preferred)
  • Audio/video processing (preferred)
  • iOS/Android development (preferred)

Benefits

  • Competitive salary ($120,000-$180,000) plus company equity
  • Medical, dental, and vision insurance with 99.99% premiums covered
  • 42 days paid time off (15 PTO, 10 sick, 15 holidays, 2 floating)
  • Remote and hybrid work options available
  • Bay Area location preferred with relocation support
Apply on the employer's site

Opens the official application on the employer’s site. No login required.