Multimodal AI Intern

Multimodal AI Intern

Full-Time No home office possible
M

Job Description

JOB AD:

Multimodal AI Intern (Audio-Visual AI) – PhD Level

  • Staines-upon-ThamesJoin a leading AI research team driving innovation in next-generation mobile technologies.
  • We're looking for a PhD-level intern or recent graduate to work on cutting-edge audio-visual AI solutions that will shape the future of smart, on-device intelligence.
  • You'll collaborate with world-class researchers and engineers, helping turn novel machine learning concepts into production-ready software for intelligent mobile platforms.
  • What You'll Do:Develop and prototype innovative solutions in multimodal on-device AI (audio + video).
  • Research and implement methods such as contrastive learning, model compression, or multimodal LLMs.
  • Tackle real-world challenges with efficient, scalable code using PyTorch or TensorFlow.Work within a high-impact team and contribute to research publications and internal reports.

What We're Looking For:

  • PhD student or recent graduate in ML/AI, Computer Science, Engineering, or a related field.First-author publications in top AI/ML venues (CVPR, NeurIPS, ICML, ICLR, etc.).
  • Strong skills in Python and/or C/C++, and hands-on experience with modern ML frameworks.Familiarity with Git and sound software engineering practices.
  • Excellent communication and problem-solving abilities.

Bonus Points For:

  • Experience in emotion recognition, foundational face models, or deception detection.
  • Knowledge of multi-task learning, embedded AI, or distributed ML systems.
  • Contributions to open-source ML libraries.Expertise in AI pipeline optimization and profiling.
M

Contact Detail:

Microtech Global Ltd Recruiting Team

M
Similar positions in other companies
UK’s top job board for Gen Z
discover-jobs-cta
Discover now
>