At a Glance
- Tasks: Design infrastructure for GPU/TPU strategy and manage cluster deployments.
- Company: Isomorphic Labs in London focuses on cutting-edge AI/ML technologies.
- Benefits: Competitive salary of £54,185 per year with opportunities for collaboration.
- Other info: Significant experience deploying in Kubernetes is required.
- Why this job: Join a diverse team to drive innovation in machine learning infrastructure.
- Qualifications: Experience with large-scale AI/ML workloads and cloud compute design, preferably GCP.
The predicted salary is between 60000 - 80000 £ per year.
Responsibilities
- Focus on end‑to‑end GPU/TPU (accelerator) strategy, designing infrastructure, optimizing performance, and integrating new hardware to leverage advancements.
- Work in partnership with the Machine Learning Platform team to push and support deployments, building, monitoring and managing cluster deployments.
- Support the technical strategy around hardware acquisition and deployment decisions.
- Drive research and efficiency design around the infrastructure up to the point of service to the ML platforms teams.
- Contribute to consistently improving the reliability of ML runs.
- Operate and handle research, development, and production cloud infrastructure and systems.
- Partner and collaborate with a diverse set of teams, including science, research, product, business development and operations.
- Contribute to core technical decisions (e.g., choice of tooling, infrastructure, and architectural design).
Qualifications
- Real world experience with large‑scale AI/ML workloads.
- Experience working in cloud compute infrastructure design, preferably GCP.
- Strong programming skills.
- Significant experience deploying in Kubernetes.
- Familiarity with NVIDIA GPU generations.
Nice to have
- Background in either ML SWE or infrastructure SRE work.
- Experience leading and delivering projects to multidisciplinary stakeholders.
- Familiarity with Google TPU generations.
- Familiarity with workload scheduling, machine learning efficiency research, ML‑driven R&D cycles, and hardware benchmarking.
Location: Isomorphic Labs London
Salary: £54,185 per year (estimated)
Software Engineer (HPC Platform), London in City of Westminster employer: Isomorphic Labs
Isomorphic Labs, located in London, is at the forefront of AI/ML technology. Employees benefit from a competitive salary and the opportunity to collaborate with multidisciplinary teams, enhancing their professional growth.
We think you need these skills to ace Software Engineer (HPC Platform), London in City of Westminster
Python
SQL
Data Engineering
Problem-Solving Skills
Automation
Data Pipeline Development
API Integration