AI Inference Engineer: GPU-Driven, Rust & Python in London

AI Inference Engineer: GPU-Driven, Rust & Python in London

London Full-Time 60000 - 80000 € / year (est.) No home office possible
Deepstreamtech

At a Glance

  • Tasks: Enhance AI performance by building GPU kernels and developing a Rust-based server.
  • Company: Deepstreamtech, an innovative tech company in Greater London.
  • Benefits: Competitive salary, flexible working hours, and opportunities for professional growth.
  • Other info: Fast-paced environment with exciting projects and career advancement potential.
  • Why this job: Join a cutting-edge team and make a real impact in AI technology.
  • Qualifications: 3+ years in ML inference, deep learning frameworks, and GPU architecture skills.

The predicted salary is between 60000 - 80000 € per year.

Deepstreamtech, located in Greater London, is seeking an AI Inference Engineer to enhance the performance and optimization of their inference engine. The role involves building and managing GPU kernels, migrating them to NVIDIA's CuTe DSL, and developing a Rust-based server for their growing applications.

Candidates should have over 3 years of experience with ML inference, familiarity with deep learning frameworks, and strong skills in GPU architectures. This role is crucial for delivering high performance under tight latency and cost constraints.

AI Inference Engineer: GPU-Driven, Rust & Python in London employer: Deepstreamtech

Deepstreamtech is an exceptional employer that fosters a dynamic and innovative work culture in the heart of Greater London. With a strong emphasis on employee growth, we offer opportunities for professional development and collaboration on cutting-edge AI technologies. Our commitment to a supportive environment ensures that every team member can thrive while contributing to impactful projects in the rapidly evolving field of machine learning.

Deepstreamtech

Contact Detail:

Deepstreamtech Recruiting Team

StudySmarter Expert Advice🤫

We think this is how you could land AI Inference Engineer: GPU-Driven, Rust & Python in London

Tip Number 1

Network like a pro! Reach out to folks in the AI and GPU space on LinkedIn or at meetups. We can’t stress enough how personal connections can open doors that applications alone can’t.

Tip Number 2

Show off your skills! Create a portfolio showcasing your projects, especially those involving Rust, Python, and GPU optimisations. We love seeing real-world applications of your expertise!

Tip Number 3

Prepare for technical interviews by brushing up on your ML inference knowledge and deep learning frameworks. We recommend doing mock interviews with friends or using online platforms to get comfortable.

Tip Number 4

Don’t forget to apply through our website! It’s the best way to ensure your application gets seen. Plus, we’re always on the lookout for passionate candidates who fit the bill.

We think you need these skills to ace AI Inference Engineer: GPU-Driven, Rust & Python in London

GPU Kernel Development
NVIDIA CuTe DSL
Rust Programming
Python Programming
Machine Learning Inference
Deep Learning Frameworks
GPU Architectures

Some tips for your application 🫡

Tailor Your CV:Make sure your CV highlights your experience with ML inference and GPU architectures. We want to see how your skills align with the role, so don’t be shy about showcasing relevant projects!

Craft a Compelling Cover Letter:Your cover letter is your chance to shine! Tell us why you’re passionate about AI and how your background in Rust and Python makes you a perfect fit for our team. Keep it engaging and personal!

Showcase Your Projects:If you've worked on any cool projects involving deep learning frameworks or GPU kernels, make sure to mention them. We love seeing practical examples of your work that demonstrate your expertise!

Apply Through Our Website:We encourage you to apply directly through our website. It’s the best way for us to receive your application and ensures you don’t miss out on any important updates from our team!

How to prepare for a job interview at Deepstreamtech

Know Your Tech Inside Out

Make sure you’re well-versed in GPU architectures and the deep learning frameworks relevant to the role. Brush up on your Rust and Python skills, as you'll likely be asked to demonstrate your understanding of these languages during the interview.

Showcase Your Experience

Prepare specific examples from your past work that highlight your experience with ML inference and performance optimisation. Be ready to discuss how you've tackled tight latency and cost constraints in previous projects.

Understand Deepstreamtech's Needs

Research Deepstreamtech and their current projects. Understanding their products and challenges will help you tailor your answers and show that you're genuinely interested in contributing to their success.

Ask Insightful Questions

Prepare thoughtful questions about the team, the technology stack, and future projects. This not only shows your enthusiasm but also helps you gauge if the company is the right fit for you.