Principal Software Engineer, ML Platform (Stability & Infrastructure) in London
Principal Software Engineer, ML Platform (Stability & Infrastructure)

Principal Software Engineer, ML Platform (Stability & Infrastructure) in London

London Full-Time 80000 - 100000 £ / year (est.) Home office (partial)
Isomorphic Labs Limited

At a Glance

  • Tasks: Lead the reliability of AI systems to cure diseases with cutting-edge technology.
  • Company: Isomorphic Labs, a pioneering biotech firm focused on revolutionary drug discovery.
  • Benefits: Competitive salary, hybrid work model, and opportunities for professional growth.
  • Why this job: Join a mission-driven team making real-world impacts in healthcare through AI.
  • Qualifications: Experience in large-scale AI/ML workloads and cloud infrastructure.
  • Other info: Collaborative culture that values diverse perspectives and encourages innovation.

The predicted salary is between 80000 - 100000 £ per year.

Isomorphic Labs is applying frontier AI to help unlock deeper scientific insights, faster breakthroughs, and life-changing medicines with an ambition to solve all disease. The future is coming. A future enabled and enriched by the incredible power of machine learning. A future in which diseases are curtailed or cured starting with better and faster drug discovery. Come and be part of an interdisciplinary team driving groundbreaking innovation and play a meaningful role in contributing towards us achieving our ambitious goals, while being a part of an inspiring and collaborative culture.

About Iso

Isomorphic Labs (IsoLabs) was launched in 2021 to advance human health by building on and beyond the Nobel-winning AlphaFold system. Since then, our interdisciplinary team of drug discovery experts and machine learning specialists has built powerful new predictive and generative AI models that accelerate scientific discovery at digital speed. Our name comes from the belief that there is an underlying symmetry between biology and information science. By harnessing AI’s powerful capabilities, we can use it to model complex biological phenomena to help design novel molecules, anticipate how drugs will perform and develop innovative medicines to treat and cure some of the world’s most devastating diseases. We have built a world-leading drug design engine comprising AI models that are capable of working across multiple therapeutic areas and drug modalities. We are continually innovating on model architecture and developing cutting-edge capabilities to advance rational drug design. Every day, and with each new breakthrough, we’re getting closer to the promise of digital biology, and achieving our ambitious mission to one day solve all disease with the help of AI.

Your Impact

We are building the largest foundation models in biotech and applying them immediately to cure disease. You will play a pivotal role in ensuring the reliability and scalability of the foundations that make this possible. As a Principal Engineer, you will lead the efforts to harden our systems, ensuring our groundbreaking AI is built on an unshakeable base, working closely with the research team and the Applied ML teams to ensure the infrastructure is stable, reliable and can operate with more data and larger models as we grow.

What You Will Do

  • You will own the end-to-end strategy for platform reliability, with a specific focus on our accelerator (GPU/TPU) infrastructure and workload orchestration.
  • You will move between high-level architectural design and hands-on systems engineering to eliminate friction in the researcher experience.
  • Lead the reliability work for our global job scheduler.
  • You will design and implement a robust "test harness" to safely validate infrastructure upgrades without impacting live research.
  • Architect and optimize our next-generation inference services.
  • You will solve core scaling limits, ensuring high-throughput performance and feature parity across our model serving stack.
  • Overhaul our logging and monitoring systems to provide radical visibility.
  • You will build proactive alerting and telemetry that identifies systemic failures before they impact research workflows.
  • Improve our internal CI/CD stability, targeting a significant reduction in failure rates and significantly faster feedback loops for the engineering organization.
  • Contribute to core technical decisions on tooling and architectural design while partnering with science, product, and operations teams to align infrastructure with biotech R&D cycles.

Skills and Qualifications

  • Proven experience in architecting and managing large-scale AI/ML workloads in a production environment.
  • Expertise in cloud compute design, specifically within Google Cloud Platform (GCP).
  • Orchestration: Significant experience deploying and managing complex workloads within Kubernetes (GKE).
  • Professional familiarity with NVIDIA GPU generations and the intricacies of high-performance compute.
  • Strong programming skills and a "reliability-first" approach to software development.

Nice to Have

  • A career history that spans both ML Software Engineering and Infrastructure SRE roles.
  • Experience leading multi-disciplinary projects and navigating complex stakeholder requirements in a fast-paced environment.
  • Familiarity with workload scheduling, ML efficiency research, and hardware benchmarking.
  • Experience with Google TPU generations and specialized ML-driven R&D cycles.

Culture and values

  • Thoughtful: Thoughtful at Iso is about curiosity, creativity and care. It is about good people doing good, rigorous and future-making science every single day.
  • Brave: Brave at Iso is about fearlessness, but it’s also about initiative and integrity. The scale of the challenge demands nothing less.
  • Determined: Determined at Iso is the way we pursue our goal. It’s a confidence in our hypothesis, as well as the urgency and agility needed to deliver on it. Because disease won’t wait, so neither should we.
  • Together: Together at Iso is about connection, collaboration across fields and catalytic relationships. It’s knowing that transformation is a group project, and remembering that what we’re doing will have a real impact on real people everywhere.

Creating an extraordinary company

We believe that to be successful we need a team with a range of skills and talents. We're building an environment where collaboration is fundamental, learning is shared and every employee feels supported and able to thrive. We value unique experiences, knowledge, backgrounds, and perspectives, and harness these qualities to create extraordinary impact. We are committed to equal employment opportunities regardless of sex, race, religion or belief, ethnic or national origin, disability, age, citizenship, marital, domestic or civil partnership status, sexual orientation, gender identity, pregnancy or related condition (including breastfeeding) or any other basis protected by applicable law.

If you have a disability or additional need that requires accommodation, please do not hesitate to let us know. It’s hugely important for us to share knowledge and build strong relationships with each other, and we find it easier to do this if we spend time together in person. This is why we follow a hybrid model, and would require you to be able to come into the office 3 days a week (currently Tuesday, Wednesday, and one other day depending on which team you’re in). If you have additional needs that would prevent you from following this hybrid approach, we’d be happy to talk through these if you’re selected for an initial screening call.

Please note that when you submit an application, your data will be processed in line with our privacy policy.

Principal Software Engineer, ML Platform (Stability & Infrastructure) in London employer: Isomorphic Labs Limited

Isomorphic Labs is an exceptional employer, offering a collaborative and innovative work culture that empowers employees to contribute to groundbreaking advancements in drug discovery through AI. With a strong commitment to employee growth, the company provides opportunities for professional development while fostering a supportive environment where diverse perspectives are valued. Located in London, IsoLabs promotes a hybrid work model, ensuring a balance between in-person collaboration and flexibility, making it an ideal place for those seeking meaningful and impactful careers in biotechnology.
Isomorphic Labs Limited

Contact Detail:

Isomorphic Labs Limited Recruiting Team

StudySmarter Expert Advice 🤫

We think this is how you could land Principal Software Engineer, ML Platform (Stability & Infrastructure) in London

✨Tip Number 1

Network like a pro! Reach out to people in the industry, attend meetups, and connect with current employees at Isomorphic Labs. A personal connection can make all the difference when it comes to landing that interview.

✨Tip Number 2

Show off your skills! Prepare a portfolio or a GitHub repository showcasing your projects, especially those related to AI/ML. This is your chance to demonstrate your expertise and passion for the field.

✨Tip Number 3

Ace the interview by being prepared! Research Isomorphic Labs, understand their mission, and be ready to discuss how your experience aligns with their goals. Practice common technical questions and be ready to showcase your problem-solving skills.

✨Tip Number 4

Don’t forget to apply through our website! It’s the best way to ensure your application gets seen. Plus, it shows you’re genuinely interested in joining our team at Isomorphic Labs.

We think you need these skills to ace Principal Software Engineer, ML Platform (Stability & Infrastructure) in London

Architecting large-scale AI/ML workloads
Managing production environments
Cloud compute design
Google Cloud Platform (GCP)
Kubernetes (GKE)
NVIDIA GPU generations
High-performance compute
Programming skills
Reliability-first approach to software development
Workload scheduling
ML efficiency research
Hardware benchmarking
Google TPU generations
Multi-disciplinary project leadership
Stakeholder management

Some tips for your application 🫡

Tailor Your Application: Make sure to customise your CV and cover letter for the Principal Software Engineer role. Highlight your experience with AI/ML workloads and cloud compute design, especially if you've worked with Google Cloud Platform or Kubernetes.

Showcase Your Impact: When detailing your past experiences, focus on the impact you made in previous roles. Use metrics where possible to demonstrate how your contributions improved system reliability or performance.

Be Authentic: Let your personality shine through in your application. We value curiosity, creativity, and a collaborative spirit, so don’t hesitate to share what drives you and how you align with our culture.

Apply Through Our Website: We encourage you to apply directly through our website. It’s the best way for us to receive your application and ensures you’re considered for this exciting opportunity at Isomorphic Labs!

How to prepare for a job interview at Isomorphic Labs Limited

✨Know Your Tech Inside Out

Make sure you’re well-versed in the technologies mentioned in the job description, especially around AI/ML workloads and cloud computing. Brush up on your knowledge of Google Cloud Platform and Kubernetes, as these will likely come up during technical discussions.

✨Showcase Your Problem-Solving Skills

Prepare to discuss specific challenges you've faced in previous roles, particularly those related to system reliability and scalability. Use the STAR method (Situation, Task, Action, Result) to structure your answers and highlight how you tackled complex issues.

✨Align with Their Culture

Isomorphic Labs values thoughtfulness, bravery, determination, and collaboration. Think of examples from your past experiences that demonstrate these qualities. Be ready to explain how you can contribute to their mission of advancing human health through innovative science.

✨Ask Insightful Questions

Prepare thoughtful questions that show your interest in the role and the company’s goals. Inquire about their current projects, team dynamics, or how they measure success in their infrastructure initiatives. This not only shows your enthusiasm but also helps you gauge if the company is the right fit for you.

Principal Software Engineer, ML Platform (Stability & Infrastructure) in London
Isomorphic Labs Limited
Location: London

Land your dream job quicker with Premium

You’re marked as a top applicant with our partner companies
Individual CV and cover letter feedback including tailoring to specific job roles
Be among the first applications for new jobs with our AI application
1:1 support and career advice from our career coaches
Go Premium

Money-back if you don't land a job in 6-months

>