At a Glance
- Tasks: Develop innovative solutions to enhance multimodal AI capabilities in image and video.
- Company: Join Google DeepMind, a leader in advancing artificial intelligence for public benefit.
- Benefits: Competitive salary, bonuses, equity, and comprehensive benefits package.
- Why this job: Be at the forefront of AI research and make a real-world impact.
- Qualifications: PhD in machine learning or computer vision, with strong publication record.
- Other info: Collaborative environment with opportunities for career growth and scientific contribution.
The predicted salary is between 100000 - 140000 ÂŁ per year.
Artificial Intelligence could be one of humanity's most useful inventions. At Google DeepMind, we're a team of scientists, engineers, machine learning experts and more, working together to advance the state of the art in artificial intelligence. We use our technologies for widespread public benefit and scientific discovery, and collaborate with others on critical challenges, ensuring safety and ethics are the highest priority.
Our team is part of Google DeepMind (GDM) in the FrontierâAI unit. We specialize in multimodal foundational models, with a focus on image and video domains. We are looking for a research scientist to develop agentic solutions to improve the capabilities of multimodal models in GDM. Candidates must have strong machine learning skills, including experience in LLMs and computer vision. We also require competency in software engineering, which is required to implement robust solutions at the scale that we operate.
The Role: As a Research Scientist specializing in Multimodal Agents, you will be at the forefront of developing innovative agentic solutions to enhance the capabilities of Google DeepMind's foundational models, particularly within the image and video domains. This is an exciting opportunity to contribute directly to advancing the state of the art in artificial intelligence, working with cuttingâedge technologies and a team of worldâclass experts. You will be instrumental in designing, implementing, and deploying robust machine learning solutions at scale, with a clear path to both internal and external impact through product integration and publications. This role offers a unique chance to shape the future of AI agents by pushing the boundaries of multimodal understanding and interaction.
Key responsibilities:
- Design and implement novel agentic solutions to enhance the capabilities of multimodal foundational models, specifically in image and video domains.
- Conduct cuttingâedge research in machine learning, with a focus on large language models (LLMs) and computer vision, to drive advancements in multimodal understanding and interaction.
- Develop and deploy robust, scalable machine learning systems and prototypes that integrate effectively with Google DeepMind's existing infrastructure.
- Collaborate with crossâfunctional teams of scientists and engineers to translate research insights into impactful product features and publications.
- Analyze and evaluate the performance of agentic models, iterating on designs and approaches to continuously improve their effectiveness and efficiency.
- Stay abreast of the latest research and developments in AI, particularly in multimodal learning and agent systems, and contribute to the scientific community through publications and presentations.
About You:
In order to set you up for success as a Research Scientist/Research Engineer at Google DeepMind, we look for the following skills and experience:
- PhD in machine learning, computer vision or related field
- 3+ publications in top ML or vision conferences/journals
- Python experience
- JAX/pytorch experience
In addition, the following would be an advantage:
- Distributed data pipeline experience (e.g., beam)
- C++ experience
- Experience developing LLMâbased agents.
The US base salary range for this fullâtime position is between $141,000 - $244,000 + bonus + equity + benefits. Your recruiter can provide more about the specific salary range for your targeted location during the hiring process.
At Google DeepMind, we value diversity of experience, knowledge, backgrounds and perspectives and harness these qualities to create extraordinary impact. We are committed to equal employment opportunity regardless of sex, race, religion or belief, ethnic or national origin, disability, age, citizenship, marital, domestic or civil partnership status, sexual orientation, gender identity, pregnancy, or related condition (including breastfeeding) or any other basis as protected by applicable law. If you have a disability or additional need that requires accommodation, please do not hesitate to let us know.
Research Scientist/Research Engineer, Multimodal Agents in London employer: The Rundown AI, Inc.
Contact Detail:
The Rundown AI, Inc. Recruiting Team
StudySmarter Expert Advice đ¤Ť
We think this is how you could land Research Scientist/Research Engineer, Multimodal Agents in London
â¨Tip Number 1
Network like a pro! Reach out to people in the industry, attend conferences, and join online forums. The more connections we make, the better our chances of landing that dream job.
â¨Tip Number 2
Show off your skills! Create a portfolio showcasing your projects, especially those related to multimodal models or AI. This gives us a chance to demonstrate our expertise beyond just a CV.
â¨Tip Number 3
Prepare for interviews by practising common questions and discussing our past research. We should be ready to explain complex concepts in simple terms, as communication is key in this field.
â¨Tip Number 4
Donât forget to apply through our website! Itâs the best way to ensure our application gets noticed. Plus, it shows weâre genuinely interested in being part of the Google DeepMind team.
We think you need these skills to ace Research Scientist/Research Engineer, Multimodal Agents in London
Some tips for your application đŤĄ
Tailor Your CV: Make sure your CV is tailored to highlight your experience in machine learning and computer vision. We want to see how your skills align with the role, so donât be shy about showcasing relevant projects or publications!
Craft a Compelling Cover Letter: Your cover letter is your chance to shine! Use it to explain why youâre passionate about multimodal agents and how your background makes you a perfect fit for our team. We love seeing enthusiasm and a clear understanding of our mission.
Showcase Your Research Impact: When listing your publications, focus on those that demonstrate your contributions to the field. Weâre interested in how your work has made an impact, so include any metrics or recognitions if possible!
Apply Through Our Website: Donât forget to apply through our website! Itâs the best way for us to receive your application and ensures youâre considered for the role. Plus, it shows youâre serious about joining our team at Google DeepMind!
How to prepare for a job interview at The Rundown AI, Inc.
â¨Know Your Stuff
Make sure you brush up on your machine learning and computer vision knowledge. Be ready to discuss your experience with large language models and any relevant projects you've worked on. This is your chance to showcase your expertise, so donât hold back!
â¨Showcase Your Research
Prepare to talk about your publications and how they relate to the role. Highlight any innovative solutions you've developed in the past, especially those that align with multimodal models. This will demonstrate your ability to contribute to cutting-edge research.
â¨Collaborate Like a Pro
Since collaboration is key in this role, think of examples where you've successfully worked with cross-functional teams. Be ready to discuss how you translated research insights into practical applications, as this will show your ability to make an impact.
â¨Stay Current
Keep yourself updated on the latest trends and advancements in AI, particularly in multimodal learning. Mention any recent papers or breakthroughs that excite you during the interview. This shows your passion for the field and commitment to continuous learning.