At a Glance
- Tasks: Work on real-world robotic systems, focusing on perception and navigation.
- Company: Join Humanoid, a pioneering tech company redefining robotics.
- Benefits: Competitive pay, great food, and hands-on experience with experts.
- Other info: Exciting 12-week summer internship in London with excellent growth opportunities.
- Why this job: Dive into cutting-edge AI and robotics, making a tangible impact.
- Qualifications: Studying Computer Science, Machine Learning, or Robotics; strong ML and vision skills.
The predicted salary is between 20000 - 30000 £ per year.
Here at Humanoid, we believe in a future where robots amplify human potential. That’s why we’ve set out on a mission to build the world’s most capable, commercially-scalable, and safe humanoid robots. We’re bringing that mission to life with HMND‑01 Alpha - our rapidly developed humanoid platform now running in real industrial pilots - and we’re growing the team to take it even further.
Our Mission
We’re building software systems that enable robots to operate effectively in the real world expanding human capability and redefining how work gets done.
The Opportunity
We’re looking for interns who are curious, proactive, and excited to work on real-world robotic systems. This is an open-ended internship, you won’t be confined to a single component, but will work across perception, navigation, and multimodal systems, collaborating closely with the team to find where you can have the most impact. You may work anywhere along the stack, from camera systems (timestamping, synchronization, validation), through perception and scene understanding, to navigation and integration with locomotion. The scope is intentionally broad. We’re looking for people who are excited to dive into unfamiliar areas and learn quickly.
This is a full-time internship (5 days per week) over the summer (mid June - mid September), based in our London Paddington office, where you’ll contribute to real systems from early on with guidance and support from experienced researchers and engineers.
Duration: 12 weeks | Start date: June | Compensation: Competitive pay + we'll keep you fed (seriously, the food is good)
What you might work on
- Develop perception systems for robot navigation and interaction in real-world environments
- Work on focused problems within Vision-Language(-Action) or multimodal models (components, datasets, evaluation)
- Run and analyse experiments using existing pipelines
- Improve data quality through curation and labeling
- Explore scene understanding, 3D perception, or navigation methods and apply them to real systems
- Prototype ideas and iterate quickly with guidance
- Collaborate on integrating models into robotic platforms
What we’re looking for
- Pursuing a degree in Computer Science, Machine Learning, Robotics, or a related field.
- Strong foundations in machine learning and/or computer vision.
- Hands‑on experience with PyTorch and training ML models.
- Experience running experiments and interpreting results.
- Interest in multimodal models, 3D vision, spatial reasoning, navigation or embodied AI.
- Ability to take ownership and iterate with guidance.
- Strong problem‑solving skills and attention to detail.
- Fast learner, comfortable in a research‑driven, fast‑moving environment.
How to apply
Complete the challenge below and submit your solution as a public GitHub repository — include a README with instructions to run your system, example outputs, and a short note on your design choices. You will be able to include your GitHub repository URL when you fill out the application form, alongside your name and CV. We’re not looking for standard solutions, we're looking for how you think. The strongest submissions are creative, original, and push beyond the obvious.
Intern Challenge: From Video to 3D Reconstruction
Build a system that takes a short video (e.g. captured on a phone), of a small indoor area such as a small room, and reconstructs a 3D scene. The core goal is geometric reconstruction from video. Semantic understanding is welcome, but optional.
At a minimum, your system should:
- Generate a 3D representation of the scene from video input
- Produce a reconstruction that is geometrically coherent and consistent
Optional extensions:
- Assign semantic labels in 3D (e.g. tables, chairs)
- Ensure any semantic predictions are aligned with the underlying geometry
There are no constraints on real‑time performance, We’re intentionally leaving the approach open, use any tools, models, frameworks, or agentic workflows you find effective.
What to submit
- A working codebase
- Clear instructions on how to run your system
- Example input(s) and output(s)
- (Optional) A short note explaining your design choices and tradeoffs
What we care about
- Simplicity and usability of your solution
- Creativity in approach
- Quality of 3D scene reconstruction
- Clear, compelling presentation of results
- Coherence between geometry and semantics
Make something you’re proud of.
Internship - Perception and Spatial AI employer: Humanoid
Contact Detail:
Humanoid Recruiting Team
StudySmarter Expert Advice 🤫
We think this is how you could land Internship - Perception and Spatial AI
✨Tip Number 1
Get your hands dirty with the tech! Dive into projects that showcase your skills in machine learning and computer vision. Build something cool, like a 3D reconstruction system, and make sure to share it on GitHub. This shows us your creativity and problem-solving skills!
✨Tip Number 2
Networking is key! Connect with professionals in the robotics field through LinkedIn or local meetups. Don’t be shy—reach out to people at Humanoid and ask about their work. A friendly chat can sometimes lead to unexpected opportunities!
✨Tip Number 3
Prepare for interviews by brushing up on your knowledge of perception systems and navigation methods. We love candidates who can discuss their thought process and approach to problem-solving. Practice explaining your projects clearly and confidently!
✨Tip Number 4
Apply through our website! It’s the best way to ensure your application gets seen. Make sure to include your GitHub link and highlight any unique aspects of your projects. We’re excited to see what you’ve been working on!
We think you need these skills to ace Internship - Perception and Spatial AI
Some tips for your application 🫡
Show Your Curiosity: When you're writing your application, let your curiosity shine through! Talk about what excites you about perception and spatial AI, and how you want to dive into the unknown. We love seeing candidates who are eager to learn and explore new areas.
Be Creative in Your Challenge: For the intern challenge, don’t just stick to the basics. We’re looking for creativity and originality in your approach. Think outside the box and show us how you can push beyond the obvious with your 3D reconstruction system. Make it something you’re proud of!
Clear Instructions Matter: Make sure your GitHub repository is easy to navigate. Include clear instructions on how to run your system and provide example inputs and outputs. We want to see your work in action without any hassle, so clarity is key!
Tailor Your CV: Don’t forget to tailor your CV to highlight relevant experience in machine learning, computer vision, or robotics. We want to see how your skills align with our mission. And remember, apply through our website to ensure your application gets to us directly!
How to prepare for a job interview at Humanoid
✨Know Your Stuff
Make sure you brush up on your knowledge of machine learning and computer vision. Familiarise yourself with the latest trends in perception and spatial AI, especially as they relate to robotics. Being able to discuss these topics confidently will show that you're genuinely interested in the field.
✨Show Off Your Projects
Bring examples of your work, especially any projects related to 3D reconstruction or multimodal models. If you've worked on something similar to the internship challenge, be ready to explain your thought process and the challenges you faced. This will demonstrate your hands-on experience and problem-solving skills.
✨Ask Smart Questions
Prepare thoughtful questions about the team’s current projects or the technologies they use. This not only shows your enthusiasm but also helps you gauge if the company is the right fit for you. Questions about their approach to integrating models into robotic platforms can spark engaging discussions.
✨Be Ready to Learn
Since this internship is all about diving into unfamiliar areas, express your eagerness to learn and adapt. Share examples of how you've quickly picked up new skills or tackled challenging problems in the past. This will highlight your proactive attitude and readiness to contribute to the team.