DeepMind
Research Engineer, Human Understanding
Los Angeles, California, US; Mountain View, California, USmidAdded 2 days ago
About this role
Google DeepMind is hiring a Research Engineer (L5) to develop multimodal AI models for human understanding, focusing on speech, audio, and visual data. You'll design foundational capabilities to understand and generate human representations while building defenses against deepfakes and impersonation within the Frontier AI unit.
What you'll do
- Research and implement novel multimodal models for holistic human understanding across visual, audio, and textual data
- Conduct experimental research cycles from hypothesis through deployment
- Lead substantial technical projects from ideation to evaluation with cross-functional collaboration
- Develop scalable research infrastructure for multimodal models and datasets
- Design and execute strategies for tuning vision language models and foundation models for specific tasks
- Drive technical direction for core components addressing complex, ambiguous problems
What they're looking for
- Machine learning model development (audio, speech, visual models)
- Vision language models and foundation model tuning
- Python programming and deep learning frameworks (e.g., JAX)
- Independent research design, implementation, and analysis
- Multimodal learning integrating vision, audio, and text
- Generative AI techniques and architectures
- Reinforcement learning or alignment methods
- Privacy-preserving machine learning and responsible AI
Benefits
- Competitive salary range: $174,000 - $252,000 USD
- Bonus and equity compensation
- Health and wellness benefits package
- Opportunity to contribute to AGI research at Google DeepMind
- Work on impactful, responsible AI deployment across Google products
- Collaboration with leading AI researchers and scientists
Opens the official application on the employer’s site. No login required.