At a Glance
- Tasks: Join our AI Inference team to optimise cutting-edge models and improve performance.
- Company: Exciting AI startup with rapid growth and innovative projects.
- Benefits: Gain valuable experience, potential full-time offer, and work in a hybrid London office.
- Why this job: Make an impact in AI while developing your skills in a dynamic environment.
- Qualifications: Master's or PhD in Computer Science, strong programming skills, and experience with ML frameworks.
- Other info: Limited spots available; outstanding performers may receive full-time offers.
The predicted salary is between 500 - 1500 Β£ per month.
Perplexity is excited to announce the Internship Program for exceptional Master's or PhD students studying Computer Science or Engineering in the UK, enrolled in the academic year. This is an intensive program in which you will work directly with our AI Inference team. This program offers a unique opportunity to gain valuable experience in a rapidly growing AI startup. Outstanding performers might be offered a full-time position at the end of the program.
Our AI Inference team is responsible for running the models behind the Perplexity products. The team maintains the inference engine and deployments behind models ranging from single-node embeddings to distributed sparse Mixture-of-Experts models, maintaining large GPU clusters. With a keen focus on latency and throughput, the Inference team is responsible for the entire serving stack, from GPU kernels to networking and monitoring infrastructure.
Responsibilities
- Work with the inference team to improve serving latency and throughput
- Bring up support for new models and state-of-the-art inference optimizations or quantization schemes
- Optimize inference across the entire stack, from GPU kernels to serving endpoints
Qualifications
- Strong engineering track record with proven knowledge of fundamentals and programming languages (multi-threaded programming, networking, compilation, systems programming, etc)
- Pursuing a Master's or PhD in Computer Science with a focus on performance-related subjects (HPC, Compilers, Distributed Systems)
- Experience with ML frameworks (Torch, JAX)
- Experience with GPU programming (CUDA, Triton)
- Experience with High-Performance Computing (OpenMPI)
Schedule
- Internship program: 13 weeks, full-time or part-time, in-person in London office (hybrid schedule: 3 days from the office, 2 days WFH)
Interview Process
- Fill out the application on Perplexity website
- If selected, People Ops and technical interviews will be involved.
- Offer. We're impressed! We'd love to welcome you to our Internship program
- Start. We have a desk waiting for you in our London office
FAQ
- Do you sponsor visas? Can I apply if I need a visa to work in the UK? β Unfortunately we are unable to sponsor visas
- What if I'm on a student visa? β You need to seek approval from your University (to determine if you are eligible to work full time or part time only)
- How many internship spots are there? We have spots for 2-3 interns in our 2025 class.
- Is housing provided? Unfortunately we cannot provide housing.
- Is health insurance provided? Unfortunately we cannot provide health insurance for interns. Full-time employees receive full health insurance and benefits.
- How many full-time offers are available at the end of the residency? There is no limit. All outstanding performers will be given a full-time offer.
UK Internship Program in London employer: Perplexity
Contact Detail:
Perplexity Recruiting Team
StudySmarter Expert Advice π€«
We think this is how you could land UK Internship Program in London
β¨Tip Number 1
Network like a pro! Reach out to current or past interns at Perplexity on LinkedIn. Ask them about their experiences and any tips they might have for the interview process. Itβs all about making connections, and who knows, they might even put in a good word for you!
β¨Tip Number 2
Prepare for technical interviews by brushing up on your coding skills. Use platforms like LeetCode or HackerRank to practice problems related to multi-threaded programming and systems design. The more comfortable you are with these concepts, the better you'll perform!
β¨Tip Number 3
Showcase your projects! If you've worked on any relevant projects, especially those involving ML frameworks or GPU programming, make sure to highlight them during your interviews. Bring along a portfolio or GitHub link to demonstrate your skills in action.
β¨Tip Number 4
Donβt forget to apply through our website! Itβs the best way to ensure your application gets seen by the right people. Plus, it shows youβre genuinely interested in joining the team at Perplexity. Letβs get you that internship!
We think you need these skills to ace UK Internship Program in London
Some tips for your application π«‘
Show Your Passion: When you're writing your application, let your enthusiasm for AI and computer science shine through. We want to see that youβre genuinely excited about the opportunity to work with our AI Inference team and contribute to cutting-edge projects.
Tailor Your CV: Make sure your CV is tailored to highlight relevant experience and skills that match the internship requirements. We love seeing how your background in programming, ML frameworks, or GPU programming aligns with what we do at Perplexity.
Craft a Compelling Cover Letter: Your cover letter is your chance to tell us why youβre the perfect fit for this internship. Be specific about your achievements and how they relate to the role. We appreciate a personal touch, so donβt be afraid to share your story!
Apply Through Our Website: Donβt forget to submit your application through our website! Itβs the best way for us to receive your materials and ensures youβre considered for the position. Plus, itβs super easy to do!
How to prepare for a job interview at Perplexity
β¨Know Your Stuff
Make sure you brush up on your knowledge of multi-threaded programming, networking, and systems programming. Be ready to discuss your experience with ML frameworks like Torch or JAX, and donβt forget to highlight any GPU programming skills you have with CUDA or Triton.
β¨Showcase Your Projects
Prepare to talk about specific projects you've worked on that relate to high-performance computing or distributed systems. Bring examples that demonstrate your problem-solving skills and how youβve optimised performance in previous roles or studies.
β¨Ask Smart Questions
During the interview, ask insightful questions about the AI Inference teamβs current challenges or the technologies they use. This shows your genuine interest in the role and helps you understand how you can contribute effectively.
β¨Practice Technical Interviews
Since technical interviews are part of the process, practice coding problems and system design questions relevant to the role. Use platforms like LeetCode or HackerRank to sharpen your skills and get comfortable with the types of questions you might face.