Senior Research Engineer - Multimodal & Video Foundation Model
Senior Research Engineer - Multimodal & Video Foundation Model

Senior Research Engineer - Multimodal & Video Foundation Model

Full-Time 36000 - 60000 £ / year (est.) No home office possible
T

At a Glance

  • Tasks: Drive innovation in AI model architecture and develop cutting-edge multimodal systems.
  • Company: Join a leading tech company at the forefront of AI research.
  • Benefits: Full-time role with competitive salary and opportunities for professional growth.
  • Why this job: Be part of groundbreaking research that shapes the future of AI technology.
  • Qualifications: Bachelor's degree in a technical field and expertise in Python & PyTorch required.
  • Other info: Collaborative environment with potential for impactful contributions to real-world applications.

The predicted salary is between 36000 - 60000 £ per year.

Overview

Senior Research Engineer – Multimodal & Video Foundation Model

As a member of the AI model team, you will drive innovation in architecture development for cutting-edge models of various scales, including small, large, and multi-modal systems. Your work will enhance intelligence, improve efficiency, and introduce new capabilities to advance the field.

Responsibilities

  • Pioneer multimodal and video-centric research that moves fast and breaks ground, contributing directly to usable prototypes and scalable systems.
  • Design and implement novel AI architectures for multimodal language models, integrating text, visual, and audio modalities.
  • Engineer scalable training and inference pipelines optimized for large-scale multimodal datasets and distributed GPU systems across thousands of GPUs.
  • Optimize systems and algorithms for efficient data processing, model execution, and pipeline throughput.
  • Build modular tools for preprocessing, analyzing, and managing multimodal data assets (e.g., images, video, text).
  • Collaborate cross-functionally with research and engineering teams to translate cutting-edge model innovations into production-grade solutions.
  • Prototype generative AI applications showcasing new capabilities of multimodal foundation models in real-world products.
  • Develop benchmarking tools to rigorously evaluate model performance across diverse multimodal tasks.

Qualifications

  • Bachelor’s degree in Computer Science, Computer Engineering, or a related technical field, or equivalent practical experience
  • Expertise in Python & PyTorch, including practical experience working with the full development pipeline from data processing & data loading to training, inference, and optimization.
  • Experience working with large-scale text data, or (bonus) interleaved data spanning audio, video, image, and/or text.
  • Direct hands-on experience in developing or benchmarking at least one of the following topics: LLMs, Vision Language Models, Audio Language Models, generative video models

Nice to have skills

  • PhD in Computer Vision, Machine Learning, NLP, Computer Science, Applied Statistics, or a closely related field
  • Demonstrated expertise in computer vision, video generation foundation model and/or multimodal research.
  • First-author publications at leading AI conferences such as CVPR, ICCV, ECCV, ICML, ICLR, NeurIPS etc.

Important information for candidates

  • Recruitment scams have become increasingly common. To protect yourself, please keep the following in mind when applying for roles: Apply only through our official channels. We do not use third-party platforms or agencies for recruitment unless clearly stated. All open roles are listed on our official careers page: https://tether.recruitee.com/
  • Verify the recruiter’s identity. All our recruiters have verified LinkedIn profiles. If you’re unsure, you can confirm their identity by checking their profile or contacting us through our website.
  • Be cautious of unusual communication methods. We do not conduct interviews over WhatsApp, Telegram, or SMS. All communication is done through official company emails and platforms.
  • Double-check email addresses. All communication from us will come from emails ending in @tether.to or @tether.io
  • We will never request payment or financial details. If someone asks for personal financial information or payment at any point during the hiring process, it is a scam. Please report it immediately.

Job details

  • Seniority level: Not Applicable
  • Employment type: Full-time
  • Job function: Information Technology
  • Industries: Technology, Information and Internet

#J-18808-Ljbffr

Senior Research Engineer - Multimodal & Video Foundation Model employer: Tether.io

At Tether, we pride ourselves on being an exceptional employer, fostering a culture of innovation and collaboration in the heart of the tech industry. Our commitment to employee growth is evident through continuous learning opportunities and the chance to work on groundbreaking projects that shape the future of AI. With a focus on cutting-edge research and a supportive environment, we empower our team members to push boundaries and achieve their full potential.
T

Contact Detail:

Tether.io Recruiting Team

StudySmarter Expert Advice 🤫

We think this is how you could land Senior Research Engineer - Multimodal & Video Foundation Model

✨Tip Number 1

Network like a pro! Reach out to folks in the industry, attend meetups, and connect with people on LinkedIn. You never know who might have the inside scoop on job openings or can refer you directly.

✨Tip Number 2

Show off your skills! Create a portfolio showcasing your projects, especially those related to multimodal and video models. This gives potential employers a taste of what you can do and sets you apart from the crowd.

✨Tip Number 3

Prepare for interviews by brushing up on your technical knowledge and problem-solving skills. Practice common interview questions and be ready to discuss your past projects in detail. Confidence is key!

✨Tip Number 4

Don’t forget to apply through our website! It’s the best way to ensure your application gets seen by the right people. Plus, it shows you’re serious about joining our team at StudySmarter.

We think you need these skills to ace Senior Research Engineer - Multimodal & Video Foundation Model

AI Architecture Development
Multimodal Research
Python
PyTorch
Data Processing
Training and Inference Optimization
Large-Scale Dataset Management
Generative AI Applications
Computer Vision
Machine Learning
NLP
Benchmarking Tools Development
Collaboration Skills
Prototyping

Some tips for your application 🫡

Show Off Your Skills: Make sure to highlight your expertise in Python and PyTorch. We want to see how you've tackled the full development pipeline, so share specific examples of your work with data processing, training, and optimisation.

Tailor Your Application: Don’t just send a generic CV! Tailor your application to reflect the responsibilities and qualifications listed in the job description. We love seeing how your experience aligns with our needs, especially in multimodal and video-centric research.

Be Clear and Concise: Keep your application clear and to the point. We appreciate well-structured documents that make it easy for us to see your qualifications and experiences without wading through unnecessary fluff.

Apply Through Our Website: Remember to apply through our official careers page! This ensures your application reaches us directly and helps you avoid any recruitment scams. We’re excited to see what you bring to the table!

How to prepare for a job interview at Tether.io

✨Know Your Models

Make sure you’re well-versed in the latest advancements in multimodal and video-centric models. Brush up on your knowledge of LLMs, Vision Language Models, and generative video models. Being able to discuss these topics confidently will show that you're not just familiar with the field but are genuinely passionate about it.

✨Showcase Your Projects

Prepare to discuss specific projects where you've implemented AI architectures or worked with large-scale datasets. Bring examples of your work, especially if they involve Python and PyTorch. This will help demonstrate your hands-on experience and problem-solving skills in real-world scenarios.

✨Collaborate and Communicate

Since this role involves cross-functional collaboration, be ready to talk about your experiences working with different teams. Highlight how you’ve effectively communicated complex ideas to non-technical stakeholders. This will illustrate your ability to bridge gaps between research and engineering.

✨Ask Insightful Questions

Prepare thoughtful questions about the company’s current projects and future directions in AI. This shows your interest in their work and helps you gauge if the company aligns with your career goals. Plus, it gives you a chance to engage in a meaningful conversation during the interview.

Senior Research Engineer - Multimodal & Video Foundation Model
Tether.io

Land your dream job quicker with Premium

You’re marked as a top applicant with our partner companies
Individual CV and cover letter feedback including tailoring to specific job roles
Be among the first applications for new jobs with our AI application
1:1 support and career advice from our career coaches
Go Premium

Money-back if you don't land a job in 6-months

>