Internship - Perception and Spatial AI
Internship - Perception and Spatial AI

Internship - Perception and Spatial AI

Internship 20000 - 30000 £ / year (est.) No home office possible
T

At a Glance

  • Tasks: Work on real-world robotic systems, focusing on perception and navigation.
  • Company: Join Humanoid, a pioneering tech company redefining robotics.
  • Benefits: Competitive pay, great food, and hands-on experience with experts.
  • Other info: Exciting 12-week summer internship in London with excellent growth opportunities.
  • Why this job: Dive into cutting-edge AI and robotics, making a tangible impact.
  • Qualifications: Studying Computer Science, Machine Learning, or Robotics; strong ML and vision skills.

The predicted salary is between 20000 - 30000 £ per year.

Here at Humanoid, we believe in a future where robots amplify human potential. That’s why we’ve set out on a mission to build the world’s most capable, commercially-scalable, and safe humanoid robots. We’re bringing that mission to life with HMND‑01 Alpha - our rapidly developed humanoid platform now running in real industrial pilots - and we’re growing the team to take it even further.

Our Mission

We’re building software systems that enable robots to operate effectively in the real world expanding human capability and redefining how work gets done.

The Opportunity

We’re looking for interns who are curious, proactive, and excited to work on real-world robotic systems. This is an open-ended internship, you won’t be confined to a single component, but will work across perception, navigation, and multimodal systems, collaborating closely with the team to find where you can have the most impact. You may work anywhere along the stack, from camera systems (timestamping, synchronization, validation), through perception and scene understanding, to navigation and integration with locomotion. The scope is intentionally broad. We’re looking for people who are excited to dive into unfamiliar areas and learn quickly.

This is a full-time internship (5 days per week) over the summer (mid June - mid September), based in our London Paddington office, where you’ll contribute to real systems from early on with guidance and support from experienced researchers and engineers.

Duration: 12 weeks | Start date: June | Compensation: Competitive pay + we'll keep you fed (seriously, the food is good)

What you might work on

  • Develop perception systems for robot navigation and interaction in real-world environments
  • Work on focused problems within Vision-Language(-Action) or multimodal models (components, datasets, evaluation)
  • Run and analyse experiments using existing pipelines
  • Improve data quality through curation and labeling
  • Explore scene understanding, 3D perception, or navigation methods and apply them to real systems
  • Prototype ideas and iterate quickly with guidance
  • Collaborate on integrating models into robotic platforms

What we’re looking for

  • Pursuing a degree in Computer Science, Machine Learning, Robotics, or a related field.
  • Strong foundations in machine learning and/or computer vision.
  • Hands‑on experience with PyTorch and training ML models.
  • Experience running experiments and interpreting results.
  • Interest in multimodal models, 3D vision, spatial reasoning, navigation or embodied AI.
  • Ability to take ownership and iterate with guidance.
  • Strong problem‑solving skills and attention to detail.
  • Fast learner, comfortable in a research‑driven, fast‑moving environment.

How to apply

Complete the challenge below and submit your solution as a public GitHub repository — include a README with instructions to run your system, example outputs, and a short note on your design choices. You will be able to include your GitHub repository URL when you fill out the application form, alongside your name and CV. We’re not looking for standard solutions, we're looking for how you think. The strongest submissions are creative, original, and push beyond the obvious.

Intern Challenge: From Video to 3D Reconstruction

Build a system that takes a short video (e.g. captured on a phone), of a small indoor area such as a small room, and reconstructs a 3D scene. The core goal is geometric reconstruction from video. Semantic understanding is welcome, but optional.

At a minimum, your system should:

  • Generate a 3D representation of the scene from video input
  • Produce a reconstruction that is geometrically coherent and consistent

Optional extensions:

  • Assign semantic labels in 3D (e.g. tables, chairs)
  • Ensure any semantic predictions are aligned with the underlying geometry

There are no constraints on real‑time performance, We’re intentionally leaving the approach open, use any tools, models, frameworks, or agentic workflows you find effective.

What to submit

  • A working codebase
  • Clear instructions on how to run your system
  • Example input(s) and output(s)
  • (Optional) A short note explaining your design choices and tradeoffs

What we care about

  • Simplicity and usability of your solution
  • Creativity in approach
  • Quality of 3D scene reconstruction
  • Clear, compelling presentation of results
  • Coherence between geometry and semantics

Make something you’re proud of.

Internship - Perception and Spatial AI employer: Thehumanoid

At Humanoid, we foster a dynamic and innovative work culture where interns are empowered to explore diverse areas of perception and spatial AI within our cutting-edge robotics projects. Located in the vibrant London Paddington area, our internship offers competitive pay, excellent food, and invaluable mentorship from experienced professionals, ensuring that you not only contribute to real-world systems but also grow your skills in a supportive environment. Join us to be part of a mission that amplifies human potential through technology, while enjoying a collaborative atmosphere that encourages creativity and learning.
T

Contact Detail:

Thehumanoid Recruiting Team

StudySmarter Expert Advice 🤫

We think this is how you could land Internship - Perception and Spatial AI

✨Tip Number 1

Get your hands dirty with the challenge! Dive into that internship task and show us how you think creatively. The more original your approach, the better your chances of standing out.

✨Tip Number 2

Network like a pro! Connect with current interns or employees at Humanoid on LinkedIn. Ask them about their experiences and any tips they might have. It’s all about making those connections!

✨Tip Number 3

Practice makes perfect! Brush up on your skills in machine learning and computer vision. Use platforms like StudySmarter to revise key concepts and get comfortable with PyTorch before the interview.

✨Tip Number 4

Don’t forget to apply through our website! It’s the best way to ensure your application gets seen. Plus, we love seeing candidates who are proactive and take the initiative.

We think you need these skills to ace Internship - Perception and Spatial AI

Machine Learning
Computer Vision
PyTorch
Experimentation and Results Interpretation
Multimodal Models
3D Vision
Spatial Reasoning
Navigation
Robotics
Problem-Solving Skills
Attention to Detail
Fast Learning
Collaboration
Prototyping

Some tips for your application 🫡

Show Your Curiosity: When you're writing your application, let your curiosity shine through! We want to see how excited you are about diving into the world of robotics and AI. Share any relevant projects or experiences that highlight your eagerness to learn and explore new areas.

Be Creative in Your Challenge: For the intern challenge, don’t just stick to the basics. We’re looking for creative solutions that push boundaries. Think outside the box and showcase your unique approach to the problem. Remember, it’s not just about the end result but also how you got there!

Keep It Clear and Concise: Make sure your README is easy to follow. We appreciate clarity, so include straightforward instructions on how to run your system and what outputs to expect. A well-organised submission can really make you stand out from the crowd!

Apply Through Our Website: Don’t forget to submit your application through our website! It’s the best way for us to keep track of your application and ensure it gets the attention it deserves. Plus, we love seeing everything come together in one place!

How to prepare for a job interview at Thehumanoid

✨Know Your Stuff

Make sure you brush up on your knowledge of machine learning and computer vision. Familiarise yourself with the latest trends in spatial AI and multimodal models. Being able to discuss these topics confidently will show that you're genuinely interested in the field.

✨Show Off Your Projects

Bring examples of your previous work, especially any projects related to 3D reconstruction or robotics. If you've worked with PyTorch or have run experiments, be ready to discuss your approach and the results. This will demonstrate your hands-on experience and problem-solving skills.

✨Ask Smart Questions

Prepare thoughtful questions about the internship and the team’s projects. Inquire about the challenges they face in perception systems or how they integrate models into robotic platforms. This shows your curiosity and eagerness to learn, which is exactly what they're looking for.

✨Be Yourself

Humanoid values creativity and originality, so don’t be afraid to let your personality shine through. Share your passion for robotics and how you envision contributing to their mission. Authenticity can set you apart from other candidates.

Internship - Perception and Spatial AI
Thehumanoid

Land your dream job quicker with Premium

You’re marked as a top applicant with our partner companies
Individual CV and cover letter feedback including tailoring to specific job roles
Be among the first applications for new jobs with our AI application
1:1 support and career advice from our career coaches
Go Premium

Money-back if you don't land a job in 6-months

>