Cantina

Media Software Engineer, Speech (All Levels)

San Francisco (Remote)$120k–$180kfulltimemidAdded 2 days ago

About this role

Cantina Labs seeks a Software Engineer to optimize real-time speech and audio systems powering live AI conversations. You'll reduce latency, build voice/video capabilities, and enhance WebRTC infrastructure across multiple platforms while working on cutting-edge conversational AI technology.

What you'll do

Optimize real-time speech and media systems for low-latency AI conversations
Reduce latency in audio streaming and speech processing pipelines
Develop and expand voice and video features for user-AI interactions
Improve custom WebRTC infrastructure across iOS, Android, and web platforms
Collaborate with product and platform teams on conversational AI experiences
Debug and solve complex, subtle engineering problems in real-time systems

What they're looking for

C or C++
Multithreaded and concurrent programming
System programming and network protocols
Data structures and algorithms
Object-oriented design
WebRTC, streaming protocols, or media technologies (preferred)
Audio/video processing (preferred)
iOS/Android development (preferred)

Benefits

Competitive salary ($120,000-$180,000) plus company equity
Medical, dental, and vision insurance with 99.99% premiums covered
42 days paid time off (15 PTO, 10 sick, 15 holidays, 2 floating)
Remote and hybrid work options available
Bay Area location preferred with relocation support

Apply on the employer's site →

Opens the official application on the employer’s site. No login required.