AI Research Engineer (Pre-training - LLM & Multi-Modal) in London

London Full-Time 70000 - 90000 £ / year (est.) No working from home possible

Apply Now

At a Glance

Tasks: Drive innovation in AI model architecture and enhance multi-modal systems.
Company: Leading AI research team focused on groundbreaking advancements.
Benefits: Competitive salary, flexible work options, and opportunities for professional growth.
Other info: Collaborative environment with access to thousands of NVIDIA GPUs for impactful research.
Why this job: Join a cutting-edge team and push the boundaries of AI technology.
Qualifications: Degree in Computer Science; PhD preferred with experience in LLM and Multi-Modal pre-training.

The predicted salary is between 70000 - 90000 £ per year.

As a member of the AI model team, you will drive innovation in architecture development for cutting‑edge models of various scales, including small, large, and multi‑modal systems. Your work will enhance intelligence, improve efficiency, and introduce new capabilities to advance the field. You will have a deep expertise in Large Language Model (LLM) and Multi‑Modal architectures, a strong grasp of pre‑training optimization, and a hands‑on, research‑driven approach. Your mission is to explore and implement novel techniques and algorithms that lead to groundbreaking advancements: multi‑modal data curation and alignment, strengthening baselines, and identifying and resolving existing pre‑training bottlenecks to push the limits of cross‑modal AI performance.

Responsibilities

Large‑Scale Pre‑Training: Conduct foundational pre‑training for LLMs and Multi‑Modal models (integrating text, vision, audio, or other modalities) on large, distributed servers equipped with multi‑nodes and thousands of NVIDIA GPUs.
Architecture & Alignment Innovation: Design, prototype, and scale innovative architectures, tokenizers, and cross‑modal alignment layers to enhance model intelligence and multi‑modal understanding.
Data Strategy: Source, filter, and curate massive‑scale textual and multi‑modal datasets, establishing robust data pipelines for efficient pre‑training.
Experimental Research: Independently and collaboratively execute experiments, analyze results, and refine training methodologies for optimal performance and token efficiency.
Optimization & Debugging: Investigate, debug, and eliminate bottlenecks in model efficiency, computational performance, and multi‑modal alignment stability during long training runs.
System Scalability: Contribute to the advancement of distributed training systems to ensure seamless scalability and hardware efficiency on target platforms.

Qualifications

A degree in Computer Science or related field. Ideally a PhD in NLP, Machine Learning, or a related field, with a solid track record in AI R&D and publications in A* conferences.
Hands‑on experience contributing to large‑scale LLM or Multi‑Modal pre‑training runs on large, distributed servers equipped with thousands of NVIDIA GPUs, ensuring scalability and impactful advancements in model performance.
Familiarity and practical experience with large‑scale, distributed training frameworks, libraries, and tools.
Deep knowledge of state‑of‑the‑art transformer and non‑transformer modifications aimed at enhancing intelligence, efficiency, and scalability.
Strong expertise in PyTorch and Hugging Face libraries with practical experience in model development, continual pre‑training, and deployment.

AI Research Engineer (Pre-training - LLM & Multi-Modal) in London employer: Tether

As an AI Research Engineer at our innovative company, you will be part of a dynamic team dedicated to pushing the boundaries of AI technology in a collaborative and forward-thinking environment. We offer competitive benefits, a strong focus on employee development, and opportunities for meaningful contributions to groundbreaking projects in a location that fosters creativity and technological advancement. Join us to be at the forefront of AI research while enjoying a supportive work culture that values your expertise and growth.

Contact Details:

Tether Recruitment Team

View Tether profile

We think you need these skills to ace AI Research Engineer (Pre-training - LLM & Multi-Modal) in London

Large Language Model (LLM) expertise

Multi-Modal architecture knowledge

Pre-training optimization

Data curation and alignment

Experimental research execution

Model debugging and optimization

Distributed training frameworks

NVIDIA GPU utilisation

PyTorch proficiency

Hugging Face library experience

Token efficiency refinement

Scalability in distributed systems

Architecture design and prototyping

Data pipeline establishment

AI Research Engineer (Pre-training - LLM & Multi-Modal) in London

Tether

Location: London

Apply Now

AI Research Engineer (Pre-training - LLM & Multi-Modal) in London

At a Glance

AI Research Engineer (Pre-training - LLM & Multi-Modal) in London employer: Tether

We think you need these skills to ace AI Research Engineer (Pre-training - LLM & Multi-Modal) in London

Company

Product

Help