At a Glance
- Tasks: Join us to build an AI-driven platform for groundbreaking biological discoveries.
- Company: Sequential, a leader in innovative AI solutions for biology.
- Benefits: Competitive salary, flexible work options, and opportunities for professional growth.
- Other info: Collaborative environment with significant autonomy and career advancement potential.
- Why this job: Make a real impact in healthcare by translating data into actionable insights.
- Qualifications: Strong Python skills and experience with ML systems; passion for biology is a plus.
The predicted salary is between 48000 - 84000 ÂŁ per year.
Sequential is building a next-generation AI-driven discovery platform to identify and design novel functional actives, including peptides and complex ingredient systems. The platform integrates large-scale biological datasets (>50,000 samples and measurements) spanning multi-omics data, microbiome sequencing, clinical and real-world outcomes. Our goal is to translate biological signals into actionable compound discovery and optimisation, powering a pipeline across: Discovery → Prediction → Design → Validation.
We are looking for a Senior Computational Biologist with ML Engineering background to help build, bridge, and functionalise the link between AI-powered biological discovery and real-world clinical outcomes. This role sits across biological discovery and scalable ML engineering. You will own key parts of the end-to-end architecture from data to model to evaluation to deployment, and work closely with ML engineers and software engineers to productise the platform into client-ready outputs. This is a senior role with significant autonomy and technical ownership.
The Data You Will Work With:
- The platform is built on a growing dataset of >50,000 biological samples and measurements, including paired pre- and post-treatment observations.
- The data includes multiple modalities such as microbiome sequencing (16S rRNA sequencing, ITS sequencing, shotgun metagenomics), Multi-omics (proteomics, lipidomics, metabolomics), Clinical and observational data (treatment exposure, formulation and ingredient combinations, clinical outcomes, patient metadata).
- Datasets include longitudinal measurements, enabling analysis of biological response to interventions (e.g., ingredient exposure, treatment, formulation).
1) Build the discovery engine (data → signal → candidate)
- Develop models that identify novel functional actives from multi-omic datasets.
- Detect patterns in biological signatures that correlate with clinical outcomes (e.g., inflammation reduction, microbiome restoration, barrier repair, malodour reduction).
- Create robust feature representations from:
- microbiome sequencing (16S/ITS/shotgun)
- gene expression / transcriptomics
- lipidomics / proteomics / metabolomics
- clinical metadata and response data
- SNP and risk features (where relevant)
2) Predict mechanism + response
- Build predictive models for:
- molecule–microbe interactions
- molecule–host pathway effects
- omics signature prediction
- clinical response forecasting
- safety and developability scoring
3) Design and optimise functional complexes
- Implement multi-objective optimisation and scoring frameworks to balance:
- efficacy / predicted response
- safety and stability constraints
- manufacturability and cost
- regulatory feasibility
- intelligent ingredient complexes
- repurposed peptides
- newly discovered natural peptides
4) Productionise the AI product launch
- Build end-to-end ML pipeline covering ingestion, training, evaluation and deployment.
- Develop APIs/services to serve predictions and ranked candidates into internal tools and client outputs.
- Create evaluation harnesses to compare predicted vs. observed validation outcomes.
- Implement monitoring and governance: drift, data quality checks, model versioning, auditability.
5) Collaborate cross-functionally
- Work closely with biology, formulation, and clinical teams to design experiments and validation loops.
- Partner with product and commercial teams to shape “client-ready” deliverables (e.g., ranked actives, evidence packs, scientific dossiers).
- Lead and/or partner with ML and software teams to define ownership boundaries, code reviews norms, and the path from prototype to a maintained service.
12-Month Mission:
- Month 0–3: Foundation & Proof of Concept
- Establish harmonised datasets and core data pipelines with dataset versioning, documented schemas, and baseline QC checks.
- Deliver feasibility screening models for active discovery with an agreed split strategy (sample, cohort, and time splits) and reporting baseline ranking metrics (e.g., hit-rate@K, NDCG@K).
- Build initial predictive baselines with clear metrics.
- Implement multi-criteria scoring + optimisation with a defined objective, weighting strategy, and ablation plan.
- Extend predictive models and improve candidate ranking performance.
- Develop reproducible experiment tracking and evaluation workflows.
- Compare predictions against real validation outputs and refine models.
- Improve robustness, interpretability, and governance.
- Deliver a performance report suitable for internal and external stakeholders.
- Finalise v1.0 AI product outputs (ranked candidates + evidence summaries).
- Support pilot client projects and accelerate validation turnaround time.
- Contribute to the first commercial-ready “hero” functional complex pipeline.
What we’re looking for:
Essential:
- Strong Python and experience building ML systems end-to-end with evidence (links to shipped projects, tools, or repos).
- Proven ability to work with large, messy real-world datasets and to define leakage-safe validation splits (e.g., sample, time, cohort).
- Practical knowledge of ML evaluation, validation strategy, and failure modes including error analysis and iteration planning.
- Experience with model development using PyTorch / TensorFlow / JAX.
- Ability to communicate clearly across technical + scientific stakeholders.
- Strong experience leading or coordinating cross-functional teams, and thinking strategically across product and science (prioritisation, tradeoffs, and shipping).
- Comfort deploying models (batch + real-time inference) into production environments or owning the handoff to engineering with clear interfaces.
Strongly preferred:
- Experience with biological or high-dimensional scientific datasets (omics, imaging, microbiome, clinical).
- Experience with multi-modal learning, embeddings, or representation learning.
- Familiarity with optimisation / ranking / multi-objective scoring systems.
- Interest in mechanism-driven modelling and scientific interpretability.
- Foundation model experience (LLMs and/or biological foundation models) with proof (fine-tuning/adapters or applied FM pipelines, plus how it was evaluated).
- A strong portfolio of tool or pipeline development (internal platforms, evaluation harnesses, reproducible workflows).
Nice to have:
- Experience building developer-friendly tooling: CLIs, dashboards, APIs, or reusable libraries used by other scientists/engineers.
- Some front-end/product sense (you care about shipping usable tools, not just notebooks).
- Experience with causal inference, Bayesian methods, or mechanistic simulation.
Senior Computational Biologist | ML Engineer in Cambridge employer: SEQUENTIAL
Contact Detail:
SEQUENTIAL Recruiting Team
StudySmarter Expert Advice 🤫
We think this is how you could land Senior Computational Biologist | ML Engineer in Cambridge
✨Tip Number 1
Network like a pro! Get out there and connect with people in the industry. Attend meetups, conferences, or even online webinars. You never know who might have the inside scoop on job openings or can refer you to someone looking for your skills.
✨Tip Number 2
Show off your skills! Create a portfolio showcasing your projects, especially those related to ML and computational biology. This is your chance to demonstrate what you can do beyond just a CV. Make sure to include links to any relevant GitHub repos or tools you've developed.
✨Tip Number 3
Prepare for interviews by brushing up on both technical and soft skills. Practice explaining complex concepts in simple terms, as you'll need to communicate effectively with both technical and non-technical stakeholders. Mock interviews can be a great way to get comfortable!
✨Tip Number 4
Don't forget to apply through our website! We love seeing candidates who are genuinely interested in joining us at StudySmarter. Tailor your application to highlight how your experience aligns with our mission and the role you're applying for.
We think you need these skills to ace Senior Computational Biologist | ML Engineer in Cambridge
Some tips for your application 🫡
Tailor Your CV: Make sure your CV is tailored to the role of Senior Computational Biologist | ML Engineer. Highlight your experience with large datasets, ML systems, and any relevant projects you've worked on. We want to see how your skills align with our mission!
Craft a Compelling Cover Letter: Your cover letter is your chance to shine! Use it to explain why you're passionate about AI-driven biological discovery and how your background makes you a perfect fit for our team. Keep it engaging and personal – we love to see your personality!
Showcase Your Projects: Don’t forget to include links to your previous work or projects that demonstrate your expertise in ML and computational biology. Whether it's GitHub repos or published papers, we want to see what you've accomplished and how it relates to our goals.
Apply Through Our Website: We encourage you to apply directly through our website. It’s the best way to ensure your application gets into the right hands. Plus, it shows us you’re genuinely interested in joining our team at StudySmarter!
How to prepare for a job interview at SEQUENTIAL
✨Know Your Data
Familiarise yourself with the types of biological datasets mentioned in the job description. Be prepared to discuss how you would handle large, messy datasets and your experience with multi-omics data. This shows you understand the core of what the role entails.
✨Showcase Your ML Skills
Bring examples of your previous work with machine learning systems, especially those that demonstrate end-to-end development. Highlight any projects using PyTorch, TensorFlow, or JAX, and be ready to explain your approach to model validation and evaluation.
✨Communicate Clearly
Practice explaining complex technical concepts in simple terms. You’ll need to communicate effectively with both technical and scientific stakeholders, so think about how you can bridge that gap during your interview.
✨Prepare for Cross-Functional Collaboration
Think about your past experiences working with cross-functional teams. Be ready to share specific examples of how you’ve collaborated with biology, formulation, or clinical teams, and how you’ve contributed to shaping client-ready deliverables.