Machine Learning Infrastructure Engineer
Machine Learning Infrastructure Engineer

Machine Learning Infrastructure Engineer

City of London Full-Time 72000 - 104000 £ / year (est.) No home office possible
Go Premium
Harnham

At a Glance

  • Tasks: Design and deploy scalable ML infrastructure for cutting-edge AI projects.
  • Company: Join a pioneering AI startup redefining 3D content generation.
  • Benefits: Competitive salary, flexible pay for strong candidates, and a dynamic work environment.
  • Why this job: Shape the future of AI and work on groundbreaking technology.
  • Qualifications: Experience in building ML infrastructure and cloud architecture from scratch.
  • Other info: Be part of a small, fast-moving team with significant growth potential.

The predicted salary is between 72000 - 104000 £ per year.

Job Description

ML & Cloud Infrastructure Engineer

London

Up to £150000

About the Role

A cutting-edge AI start-up is pioneering the development of frontier 3D foundation models, pushing the boundaries of computer vision and spatial computing. Their mission is to redefine how industries such as robotics, AR/VR, gaming, and film generate and interact with 3D content.

They are now seeking an ML & Cloud Infrastructure Engineer to join their growing team.

This is a unique opportunity to work at the forefront of AI innovation, building the infrastructure that underpins complex ML workloads and production systems. You’ll play a central role in scaling the company’s platforms and ensuring their pioneering technology reaches its full potential.

Key Responsibilities,

  • Develop and maintain scalable, high-performance cloud-based infrastructure for ML workloads and API deployment.
  • Manage and optimize cloud platforms (AWS, Azure, GCP) and set up ML nodes for local and distributed training.
  • Install, configure, and monitor servers, ensuring system reliability.
  • Design and optimize storage solutions for large-scale ML datasets.
  • Manage containerized applications with Docker, Kubernetes, Terraform, and related tools.
  • Collaborate with ML engineers and researchers to ensure seamless orchestration of training and production environments.
  • Troubleshoot and respond to cloud/production incidents, implementing long-term solutions.

What We’re Looking For

  • At least 3 years of professional experience in a cloud-related engineering role (ML-related experience highly desirable).
  • Strong scripting skills (Bash, PowerShell, Python, etc.) for automation.
  • Proven expertise in at least one major cloud platform (AWS, GCP, or Azure).
  • Experience with containerization and orchestration (Docker, Kubernetes).
  • Ability to manage and optimize large-scale cloud infrastructure.
  • Familiarity with Python (Jupyter) and ML frameworks (e.g., PyTorch).
  • Experience with cloud monitoring tools (Prometheus, Grafana).
  • Exposure to cloud-based databases (RDS, Aurora, Spanner, etc.) and data-visualisation tools.
  • Knowledge of CI/CD tools (e.g., CircleCI).

Machine Learning Infrastructure Engineer employer: Harnham

At SpAItial, we offer an exceptional opportunity for a Machine Learning Infrastructure Engineer to join a dynamic and innovative team at the forefront of AI technology. Our collaborative work culture fosters creativity and growth, providing employees with the chance to shape groundbreaking 3D generative AI systems while enjoying competitive salaries and a supportive environment in vibrant London. With significant funding dedicated to compute resources, you will have the tools and freedom to build impactful solutions that redefine industry standards.
Harnham

Contact Detail:

Harnham Recruiting Team

StudySmarter Expert Advice 🤫

We think this is how you could land Machine Learning Infrastructure Engineer

✨Tip Number 1

Network like a pro! Reach out to people in the industry, attend meetups, and connect with potential colleagues on LinkedIn. You never know who might have the inside scoop on job openings or can refer you directly.

✨Tip Number 2

Show off your skills! Create a portfolio showcasing your projects, especially those related to ML infrastructure. This is your chance to demonstrate your expertise in building cloud systems from scratch, so make it shine!

✨Tip Number 3

Prepare for technical interviews by brushing up on relevant technologies like Docker, Kubernetes, and cloud platforms. Practice coding challenges and system design questions to show you can handle the demands of the role.

✨Tip Number 4

Apply through our website! We love seeing candidates who take the initiative. Tailor your application to highlight your experience with scalable cloud infrastructure and ML systems, and let us know why you're excited about shaping the future of AI at SpAItial.

We think you need these skills to ace Machine Learning Infrastructure Engineer

Cloud Infrastructure Design
ML Workloads Management
GPU Cluster Management
Storage Systems Management
Distributed Training Environments
Containerisation (Docker, Kubernetes)
Infrastructure as Code (Terraform)
Monitoring and Incident Response
CI/CD Practices
Collaboration with Researchers
Python Programming
Experience with AWS/GCP/Azure
Scalability Optimisation
High-Performance System Design

Some tips for your application 🫡

Show Your Passion for AI: When writing your application, let your enthusiasm for AI and machine learning shine through. We want to see that you’re not just looking for a job, but that you’re genuinely excited about shaping the future of 3D generative AI with us.

Highlight Relevant Experience: Make sure to detail your experience in building cloud and ML systems from scratch. We’re keen on seeing specific examples of your work with GPU clusters, containerised workflows, and any other relevant tech you've tackled. This is your chance to show us what you can bring to the table!

Tailor Your Application: Don’t just send a generic application! Tailor your CV and cover letter to reflect the key responsibilities and skills mentioned in our job description. We love it when candidates take the time to connect their experiences directly to what we’re looking for.

Apply Through Our Website: We encourage you to apply through our website for a smoother process. It helps us keep track of applications better and ensures you don’t miss out on any important updates. Plus, it’s super easy to do!

How to prepare for a job interview at Harnham

✨Know Your Tech Stack

Make sure you’re well-versed in the tech stack mentioned in the job description. Brush up on AWS, GCP, Azure, Kubernetes, Docker, and Terraform. Be ready to discuss your hands-on experience with these tools and how you've used them to build ML infrastructure from scratch.

✨Showcase Your Problem-Solving Skills

Prepare to share specific examples of challenges you've faced while building cloud and ML systems. Highlight your thought process and the steps you took to overcome obstacles. This will demonstrate your ability to think critically and adapt in a fast-paced environment.

✨Understand the Company’s Vision

Research SpAItial and its mission to redefine 3D content generation. Familiarise yourself with their projects and be ready to discuss how your skills can contribute to their goals. Showing genuine interest in their work can set you apart from other candidates.

✨Prepare for Technical Questions

Expect technical questions that assess your knowledge of scalable cloud infrastructure and ML workloads. Practice explaining complex concepts clearly and concisely. You might also want to prepare for practical tests or scenarios that require you to design a system on the spot.

Machine Learning Infrastructure Engineer
Harnham
Location: City of London
Go Premium

Land your dream job quicker with Premium

You’re marked as a top applicant with our partner companies
Individual CV and cover letter feedback including tailoring to specific job roles
Be among the first applications for new jobs with our AI application
1:1 support and career advice from our career coaches
Go Premium

Money-back if you don't land a job in 6-months

>