DeepMind

Research Engineer, Human Understanding

Los Angeles, California, US; Mountain View, California, USmidAdded 2 days ago

About this role

Google DeepMind is hiring a Research Engineer (L5) to develop multimodal AI models for human understanding, focusing on speech, audio, and visual data. You'll design foundational capabilities to understand and generate human representations while building defenses against deepfakes and impersonation within the Frontier AI unit.

What you'll do

Research and implement novel multimodal models for holistic human understanding across visual, audio, and textual data
Conduct experimental research cycles from hypothesis through deployment
Lead substantial technical projects from ideation to evaluation with cross-functional collaboration
Develop scalable research infrastructure for multimodal models and datasets
Design and execute strategies for tuning vision language models and foundation models for specific tasks
Drive technical direction for core components addressing complex, ambiguous problems

What they're looking for

Machine learning model development (audio, speech, visual models)
Vision language models and foundation model tuning
Python programming and deep learning frameworks (e.g., JAX)
Independent research design, implementation, and analysis
Multimodal learning integrating vision, audio, and text
Generative AI techniques and architectures
Reinforcement learning or alignment methods
Privacy-preserving machine learning and responsible AI

Benefits

Competitive salary range: $174,000 - $252,000 USD
Bonus and equity compensation
Health and wellness benefits package
Opportunity to contribute to AGI research at Google DeepMind
Work on impactful, responsible AI deployment across Google products
Collaboration with leading AI researchers and scientists

Apply on the employer's site →

Opens the official application on the employer’s site. No login required.