Research Scientist/Research Engineer, Multimodal Agents
Research Scientist/Research Engineer, Multimodal Agents

Research Scientist/Research Engineer, Multimodal Agents

Full-Time 100000 - 140000 ÂŁ / year (est.) No home office possible
Go Premium
T

At a Glance

  • Tasks: Develop innovative solutions to enhance multimodal AI capabilities in image and video.
  • Company: Join Google DeepMind, a leader in advancing artificial intelligence for public benefit.
  • Benefits: Competitive salary, bonuses, equity, and comprehensive benefits package.
  • Why this job: Be at the forefront of AI research and make a real-world impact.
  • Qualifications: PhD in machine learning or computer vision, with strong Python and ML skills.
  • Other info: Collaborative environment with opportunities for career growth and contributions to the scientific community.

The predicted salary is between 100000 - 140000 ÂŁ per year.

Artificial Intelligence could be one of humanity’s most useful inventions. At Google DeepMind, we’re a team of scientists, engineers, machine learning experts and more, working together to advance the state of the art in artificial intelligence. We use our technologies for widespread public benefit and scientific discovery, and collaborate with others on critical challenges, ensuring safety and ethics are the highest priority.

Our team is part of Google DeepMind (GDM) in the Frontier‑AI unit. We specialize in multimodal foundational models, with a focus on image and video domains. We are looking for a research scientist to develop agentic solutions to improve the capabilities of multimodal models in GDM. Candidates must have strong machine learning skills, including experience in LLMs and computer vision. We also require competency in software engineering, which is required to implement robust solutions at the scale that we operate. Our team values both internal and external impact and there should be opportunities for both in this role.

As a Research Scientist specializing in Multimodal Agents, you will be at the forefront of developing innovative agentic solutions to enhance the capabilities of Google DeepMind’s foundational models, particularly within the image and video domains. This is an exciting opportunity to contribute directly to advancing the state of the art in artificial intelligence, working with cutting‑edge technologies and a team of world‑class experts. You will be instrumental in designing, implementing, and deploying robust machine learning solutions at scale, with a clear path to both internal and external impact through product integration and publications. This role offers a unique chance to shape the future of AI agents by pushing the boundaries of multimodal understanding and interaction.

Key responsibilities:

  • Design and implement novel agentic solutions to enhance the capabilities of multimodal foundational models, specifically in image and video domains.
  • Conduct cutting‑edge research in machine learning, with a focus on large language models (LLMs) and computer vision, to drive advancements in multimodal understanding and interaction.
  • Develop and deploy robust, scalable machine learning systems and prototypes that integrate effectively with Google DeepMind’s existing infrastructure.
  • Collaborate with cross‑functional teams of scientists and engineers to translate research insights into impactful product features and publications.
  • Analyze and evaluate the performance of agentic models, iterating on designs and approaches to continuously improve their effectiveness and efficiency.
  • Stay abreast of the latest research and developments in AI, particularly in multimodal learning and agent systems, and contribute to the scientific community through publications and presentations.

About You:

In order to set you up for success as a Research Scientist/Research Engineer at Google DeepMind, we look for the following skills and experience:

  • PhD in machine learning, computer vision or related field
  • 3+ publications in top ML or vision conferences/journals
  • Python experience
  • JAX/pytorch experience

In addition, the following would be an advantage:

  • Distributed data pipeline experience (e.g., beam)
  • C++ experience
  • Experience developing LLM‑based agents.

The US base salary range for this full‑time position is between $141,000 - $244,000 + bonus + equity + benefits. Your recruiter can provide more about the specific salary range for your targeted location during the hiring process.

At Google DeepMind, we value diversity of experience, knowledge, backgrounds and perspectives and harness these qualities to create extraordinary impact. We are committed to equal employment opportunity regardless of sex, race, religion or belief, ethnic or national origin, disability, age, citizenship, marital, domestic or civil partnership status, sexual orientation, gender identity, pregnancy, or related condition (including breastfeeding) or any other basis as protected by applicable law. If you have a disability or additional need that requires accommodation, please do not hesitate to let us know.

Research Scientist/Research Engineer, Multimodal Agents employer: The Rundown AI, Inc.

At Google DeepMind, we pride ourselves on being an exceptional employer, offering a collaborative and innovative work culture that empowers our team of scientists and engineers to push the boundaries of artificial intelligence. Located in Ohio, we provide competitive salaries, equity options, and comprehensive benefits, alongside ample opportunities for professional growth and impactful contributions to both internal projects and the wider scientific community. Join us to be part of a mission-driven team dedicated to advancing AI for public benefit while ensuring safety and ethics remain at the forefront of our work.
T

Contact Detail:

The Rundown AI, Inc. Recruiting Team

StudySmarter Expert Advice 🤫

We think this is how you could land Research Scientist/Research Engineer, Multimodal Agents

✨Tip Number 1

Network like a pro! Reach out to people in the industry, especially those at Google DeepMind. A friendly chat can open doors and give you insights that might just set you apart from the competition.

✨Tip Number 2

Show off your skills! Prepare a portfolio or a GitHub repository showcasing your projects related to machine learning and computer vision. This is your chance to demonstrate your expertise and creativity in action.

✨Tip Number 3

Practice makes perfect! Get ready for technical interviews by solving problems on platforms like LeetCode or HackerRank. Brush up on your Python and JAX/PyTorch skills to ensure you're ready to impress.

✨Tip Number 4

Apply through our website! It’s the best way to ensure your application gets seen. Plus, it shows you’re genuinely interested in being part of the Google DeepMind team. Don’t miss out!

We think you need these skills to ace Research Scientist/Research Engineer, Multimodal Agents

Machine Learning
Large Language Models (LLMs)
Computer Vision
Software Engineering
Python
JAX
PyTorch
Distributed Data Pipeline
C++
Research Publication
Scalable Machine Learning Systems
Multimodal Understanding
Collaboration
Performance Analysis
Innovation

Some tips for your application 🫡

Tailor Your CV: Make sure your CV is tailored to highlight your experience in machine learning and computer vision. We want to see how your skills align with the role, so don’t be shy about showcasing relevant projects and publications!

Craft a Compelling Cover Letter: Your cover letter is your chance to shine! Use it to explain why you’re passionate about AI and how your background makes you a perfect fit for our team. Let us know what excites you about working on multimodal agents.

Showcase Your Research: If you have publications, make sure to mention them! We love seeing your contributions to the scientific community. Highlight any work related to LLMs or computer vision, as this will catch our eye.

Apply Through Our Website: Don’t forget to apply through our website! It’s the best way to ensure your application gets into the right hands. Plus, it shows us you’re serious about joining our amazing team at Google DeepMind.

How to prepare for a job interview at The Rundown AI, Inc.

✨Know Your Stuff

Make sure you brush up on your machine learning and computer vision knowledge. Be ready to discuss your previous research, especially any publications you've contributed to. This is your chance to showcase your expertise in LLMs and multimodal models!

✨Showcase Your Problem-Solving Skills

Prepare to discuss specific challenges you've faced in your past projects and how you tackled them. Google DeepMind values innovative solutions, so think about how you can demonstrate your ability to design and implement robust systems.

✨Collaborate Like a Pro

Since the role involves working with cross-functional teams, be ready to talk about your experience collaborating with others. Highlight any successful projects where teamwork led to impactful results, as this will show you're a great fit for their collaborative culture.

✨Stay Current with AI Trends

Familiarise yourself with the latest advancements in AI, particularly in multimodal learning and agent systems. Being able to discuss recent research or breakthroughs will not only impress your interviewers but also show your passion for the field.

Research Scientist/Research Engineer, Multimodal Agents
The Rundown AI, Inc.
Go Premium

Land your dream job quicker with Premium

You’re marked as a top applicant with our partner companies
Individual CV and cover letter feedback including tailoring to specific job roles
Be among the first applications for new jobs with our AI application
1:1 support and career advice from our career coaches
Go Premium

Money-back if you don't land a job in 6-months

T
  • Research Scientist/Research Engineer, Multimodal Agents

    Full-Time
    100000 - 140000 ÂŁ / year (est.)
  • T

    The Rundown AI, Inc.

    50-100
Similar positions in other companies
UK’s top job board for Gen Z
discover-jobs-cta
Discover now
>