Let’s get started
By clicking ‘Next’, I agree to the Terms of Service
and Privacy Policy
Jobs / Job page
Research Engineer, Media Understanding image - Rise Careers
Job details

Research Engineer, Media Understanding

Research Engineer, Media Understanding- Multimodal Representation Models

Mountain View, CA

At Google DeepMind, we value diversity of experience, knowledge, backgrounds and perspectives and harness these qualities to create extraordinary impact. We are committed to equal employment opportunity regardless of sex, race, religion or belief, ethnic or national origin, disability, age, citizenship, marital, domestic or civil partnership status, sexual orientation, gender identity, pregnancy, or related condition (including breastfeeding) or any other basis as protected by applicable law. If you have a disability or additional need that requires accommodation, please do not hesitate to let us know.

The Role

As part of the multimodal features team in Media Understanding at Google DeepMind, you will have the opportunity to advance the state-of-the-art research in Embedding/representation models in context of large language models. You'll be at the forefront of developing models that power Google products used by billions of people worldwide. Your work will directly impact how these products understand and interact with diverse media, including text, images, audio, and video.  This is a unique opportunity to shape the future of multimodal AI and its applications in a dynamic and impactful environment.

We are a team of research/software engineers, research scientists, and machine learning experts, working together to enable superhuman understanding of the visual world. We are aiming at training the most powerful omnimodal embedding model which can be used for retrieval and other agentic use cases in Google products. 

You'll be developing the next SOTA models for multimodal understanding. Your work will include researching new modeling techniques, implementing research ideas, running experiments to evaluate improvements, and identifying new opportunities.

Key Responsibilities

As a member of the media understanding team, you will be responsible for conducting core and applied research in computer vision and language understanding to support a multitude of Google products and use cases. Your job responsibilities will include:

  • Conducting core research in the areas of computer vision, language understanding, multimodal models, large scale AI models and other key computer vision tasks.
  • Training and evaluating AI models for a variety of product use cases. 
  • Researching, Implementing, and adapting state of the art deep learning approaches for Google’s use cases
  • Collaborating closely with other GDM and partner teams to make progress towards building the most advanced embedding models.

About You

We are an applied research team that takes on challenging real-world problems and thrives on finding solutions in the presence of ambiguity. In order to set you up for success as a Research Engineer/Scientist at Google DeepMind, we look for the following skills and experience:

  • Ph.D. in Computer Science or related quantitative field, or B.S./M.S. in Computer Science or related quantitative field with 5+ years of relevant experience.
  • Innovate and assess new machine learning models and techniques for pilot projects, quickly demonstrating viability and potential impact.  Transform successful prototypes into scalable solutions for wider integration within Google's products.
  • Conduct research to identify and address impactful problems inspired by current and future real-world needs. Investigate and develop novel solutions by studying related work, conducting experiments, and constructing prototypes and demonstrations.
  • Collaborate with product teams to drive the implementation of research insights, fostering innovation and the development of new products.

In addition, the following would be an advantage: 

  • Strong research experience and publication record in top tier conferences.
  • Experience with core software engineering and applied implementations of AI 
  • A good team player who has demonstrated that they can work across teams given that image-text involves collaborating with both research and product teams.
  • Hands-on experience with Google-scale infrastructure would be a plus, e.g. large scale data mining from various Google data stores. Automation pipeline. Client deployment across PAs.

The US base salary range for this full-time position is between $215,000 - $250,000 + bonus + equity + benefits. Your recruiter can share more about the specific salary range for your targeted location during the hiring process.

Application deadline: Friday, February 28th 2025

Note: In the event your application is successful and an offer of employment is made to you, any offer of employment will be conditional on the results of a background check, performed by a third party acting on our behalf. For more information on how we handle your data, please see our Applicant and Candidate Privacy Policy.

DeepMind Glassdoor Company Review
5.0 Glassdoor star iconGlassdoor star iconGlassdoor star iconGlassdoor star iconGlassdoor star icon
DeepMind DE&I Review
5.0 Glassdoor star iconGlassdoor star iconGlassdoor star iconGlassdoor star iconGlassdoor star icon
CEO of DeepMind
DeepMind CEO photo
Demis Hassabis
Approve of CEO

Average salary estimate

$232500 / YEARLY (est.)
min
max
$215000K
$250000K

If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.

What You Should Know About Research Engineer, Media Understanding, DeepMind

Are you ready to dive into the fascinating world of AI at Google DeepMind? We’re on the lookout for a talented Research Engineer in Media Understanding to join our vibrant team in beautiful Mountain View, California. In this role, you’ll be at the cutting edge of developing state-of-the-art embedding and representation models that enhance how Google’s products engage with various media forms like text, audio, images, and video. Your work will not only involve advanced research but will also make a direct impact on millions of users globally. As a researcher, you'll collaborate closely with a team of engineers and scientists, tackling real-world challenges in the realms of computer vision and language understanding. You’ll be experimenting with innovative deep learning techniques and translating your findings into scalable solutions that are ready to be integrated into widely-used Google products. Your previous experience, especially if you hold a Ph.D. in a quantitative field such as Computer Science, will come in handy as you push the boundaries of multimodal AI. We believe in teamwork, creativity, and making a difference, so if you have a passion for solving complex problems and a background working with large-scale AI systems, this could be an incredible opportunity for you to shine and grow. Your journey with us entails not only research and experimentation but also the chance to really shape the future of AI technology.

Frequently Asked Questions (FAQs) for Research Engineer, Media Understanding Role at DeepMind
What are the primary responsibilities of a Research Engineer in Media Understanding at Google DeepMind?

As a Research Engineer in Media Understanding at Google DeepMind, your primary responsibilities include conducting core research in computer vision and language understanding, training AI models for various product use cases, and collaborating with diverse teams to develop advanced embedding models. You'll be actively engaged in researching and implementing state-of-the-art deep learning approaches tailored for Google's innovative products.

Join Rise to see the full answer
What qualifications are needed for the Research Engineer, Media Understanding position at Google DeepMind?

To qualify for the Research Engineer, Media Understanding role at Google DeepMind, candidates should have a Ph.D. in Computer Science or a related field, or a B.S./M.S. with at least 5 years of relevant experience. A strong research background, familiarity with machine learning models, and collaboration skills across research and product teams are also essential to excel in this position.

Join Rise to see the full answer
How does a Research Engineer at Google DeepMind contribute to multimodal AI?

In the role of a Research Engineer at Google DeepMind, your contributions to multimodal AI entail developing cutting-edge models that optimize understanding across different media formats. You will leverage your research to innovate and solve real-world challenges, shaping how Google products interact and understand diverse data, ultimately impacting a global user base.

Join Rise to see the full answer
What is the work environment like for Research Engineers at Google DeepMind?

The work environment at Google DeepMind is dynamic and inclusive, where collaboration is highly valued. As a Research Engineer, you will work on exciting projects alongside talented engineers, research scientists, and machine learning experts. The atmosphere encourages creativity, problem-solving, and continuous innovation in tackling complex research challenges.

Join Rise to see the full answer
What opportunities for growth and development exist for a Research Engineer, Media Understanding at Google DeepMind?

As a Research Engineer, Media Understanding at Google DeepMind, you'll have numerous opportunities for professional growth. You'll engage with pioneer research in AI, collaborate with top-tier teams, and have access to resources and training that help advance your skills. Regular exposure to groundbreaking projects enables you to stay at the forefront of technology.

Join Rise to see the full answer
Common Interview Questions for Research Engineer, Media Understanding
Can you explain your experience with multimodal modeling?

When discussing your experience with multimodal modeling, highlight specific projects where you've successfully integrated different data types such as text, audio, and images. Discuss any techniques you used, the challenges faced, and how you overcame them to create a cohesive understanding of multimedia content.

Join Rise to see the full answer
What frameworks or libraries do you prefer for deep learning research?

In your response, mention popular deep learning frameworks such as TensorFlow or PyTorch. Explain why you choose these frameworks, focusing on aspects like community support, ease of integration, or specific features that have helped you in developing models for research or real-world applications.

Join Rise to see the full answer
Describe a research project that had a significant impact on your work.

Discuss a research project where your findings led to tangible outcomes. Emphasize the objectives, methodologies, results and how you implemented those results into a product or service, demonstrating your ability to translate research into practical solutions.

Join Rise to see the full answer
How do you approach understanding and solving complex problems?

During the interview, describe your problem-solving strategy. Highlight your process of breaking down complex challenges, conducting thorough research, testing hypotheses, and collaborating with teammates to brainstorm potential solutions. Show your analytical skills and ability to think critically.

Join Rise to see the full answer
Can you provide an example of a successful collaboration with product teams?

Share a specific example where you partnered with product teams to drive research insights into product development. Discuss the interaction dynamics, the contribution you made, and the outcome. This illustrates your ability to bridge the gap between research and practical application.

Join Rise to see the full answer
What state-of-the-art techniques have you implemented in your research?

Be prepared to discuss the latest techniques you've explored, whether it’s transformer models, reinforcement learning, or novel embedding strategies. Highlight how you implemented these techniques and their impact on your projects, reflecting your up-to-date knowledge in the field.

Join Rise to see the full answer
How do you evaluate the performance of your models?

When responding, explain the evaluation metrics you utilize, such as accuracy, precision, and recall. Describe your approach to validating model performance and how you iterate based on the results to refine your models continuously.

Join Rise to see the full answer
What do you find most challenging about research in AI?

Reflect on the aspects of AI research that present challenges, such as the unpredictability of experiment outcomes or the need for constant adaptation to new technologies. Discuss how you maintain motivation and adapt strategies to overcome these hurdles.

Join Rise to see the full answer
What role does publication play in your research work?

Explain your belief in the importance of publishing research findings. Discuss how sharing your work with the academic community not only contributes to your professional growth but also helps foster collaboration and exchange of ideas that advance the field as a whole.

Join Rise to see the full answer
How do you stay current with advancements in AI research?

Discuss your methods for keeping up-to-date with AI advancements, such as attending conferences, reading journals, or participating in relevant online communities. Highlight your proactive approach to learning and how it informs your work as a Research Engineer.

Join Rise to see the full answer
Similar Jobs
Photo of the Rise User
DeepMind Hybrid Mountain View, California, US
Posted 6 days ago
Zai Lab (US) LLC Hybrid 601 Gateway Blvd, South San Francisco, CA 94080, USA
Posted 4 days ago
Photo of the Rise User
DoorDash USA Remote New York, NY; San Francisco, CA; Mountain View, CA; Seattle, WA
Posted 6 days ago
iGenius Remote No location specified
Posted 3 days ago
Photo of the Rise User
Posted 4 days ago
Photo of the Rise User
Posted 9 days ago
Photo of the Rise User
Posted 3 days ago

We're committed to solving intelligence, to advance science and humanity.

83 jobs
MATCH
VIEW MATCH
FUNDING
SENIORITY LEVEL REQUIREMENT
TEAM SIZE
SALARY RANGE
$215,000/yr - $250,000/yr
EMPLOYMENT TYPE
Full-time, on-site
DATE POSTED
January 28, 2025

Subscribe to Rise newsletter

Risa star 🔮 Hi, I'm Risa! Your AI
Career Copilot
Want to see a list of jobs tailored to
you, just ask me below!