Principal / Senior Data Scientist

Principal / Senior Data Scientist

Saffron Walden Full-Time 48000 - 84000 £ / year (est.) No home office possible
W

At a Glance

  • Tasks: Lead machine learning projects to enhance understanding of cellular biology and health.
  • Company: Join the Wellcome Sanger Institute, a top genomics research hub improving global health.
  • Benefits: Enjoy hybrid working, professional development, and a supportive, inclusive culture.
  • Why this job: Make a real impact on human health while collaborating with leading scientists in an innovative environment.
  • Qualifications: MSc/PhD in a quantitative field; experience in machine learning and Python required.
  • Other info: Apply by 6th July 2025; salary based on skills and experience.

The predicted salary is between 48000 - 84000 £ per year.

Do you want to help us improve human health and understand life on Earth? Make your mark by shaping the future to enable or deliver life-changing science to solve some of humanity’s greatest challenges. We seek a Senior or Principal Data Scientist to deliver a project at the intersection of machine learning and biology at Wellcome Sanger Institute. The position is for a two year fixed term contract working on a project funded by Open Targets within Mo Lotfollahi's team in our Cellular Genomics Programme.

About the Role: The Project aims to leverage datasets internally generated at the Sanger Institute and publicly available data from human cells to create foundational models for cellular biology predicting cellular behaviours in response to different perturbations, to enhance our understanding of life's rules and improving health for all. Your role will involve designing foundational models leveraging multi-modal readouts. This includes integrating and processing data from various sources to develop robust and versatile AI models. To achieve this, you will work with open-source software, proposing, developing, and maintaining new solutions to analyse and interpret large-scale single-cell datasets. We have access to unique data and are also in the position to generate data to train unique models. Additionally, we have substantial computational power and GPU resources to train large models efficiently. Our teams are well-positioned to tackle this problem with experience in both generating and analysis of datasets, including millions of cells across multiple tissues and conditions (e.g., disease, healthy). This involves a detailed understanding of the training of large-scale ML models and a track record of undertaking large data-science projects.

What you’ll be doing:

  • Independently manage and lead machine learning projects and write outcomes in a scientific publication for submission to journals or machine learning conferences (ICLR, ICML, CVPR, etc).
  • Be responsible for the design of machine learning tools according to requirements specified by the scientific leadership of the group.
  • Communicate the results through other relevant means, for example, presenting posters at international scientific meetings and meetings with collaborators.
  • Collaborate with team members to propose, develop, and evaluate new machine learning models that enable understanding single-cell data and its application in drug discovery.
  • Take a full part in the general duties of the team, and to pass on skills and knowledge to other team members and visitors.
  • Participate in wider Sanger Institute activities as appropriate.
  • Contribute to problem solving discussions with team members across the wider Institute as appropriate to solve complicated or multifaceted problems and generate novel ideas and new approaches.

Additional responsibilities for Principal Data Scientist:

  • Lead writing scientific papers aimed at biotechnology and biology.
  • Write/support grant applications.
  • Take the lead and collaborate with Ph.D. students and postdocs on developing solutions for the interdisciplinary scientific problems in biology as well as engaging with key stakeholders and collaborators both within and outside the Institute.
  • Maintain a key understanding of the research portfolio within the team, and support in ensuring grants and key objectives are delivered.

About Us: Open Targets is a public-private partnership involving the Wellcome Sanger Institute, a world-leading genomics institution, EMBL’s European Bioinformatics Institute (EMBL-EBI), a global leader in the management, integration and analysis of public domain life science data, and world-leading pharmaceutical companies GSK, Sanofi, MSD, Pfizer, and Genentech. The Open Targets (OT) research programme generates and analyses data to connect targets to diseases, assess the strength of this evidence, and help identify and prioritise targets for drug discovery. This includes evidence that causally links targets and diseases, as well as foundational data that helps us understand biological processes and disease progression more deeply. You will work within an interdisciplinary team of life scientists and computer/ML scientists, with a shared objective of advancing biological research through these foundational models. This role will sit within the AI/ML Faculty group led by Dr. Mo Lotfollahi, and you will be responsible for delivering your portfolio of scientific research projects as part of the broader team strategy.

About You: You will be supported in your personal and professional development and have the opportunity to lead peer-reviewed publications around using genetics and genomics approaches to guide drug discovery and present them at national and international conferences. MSc and/or Ph.D. or equivalent experience in a relevant quantitative discipline (e.g., Computer Science, Computational Biology, Genetics, Bioinformatics, Physics, Engineering, or Applied Statistics/Mathematics). Proven experience using advanced statistical techniques, machine learning, and modern deep learning techniques. Previous ML work experience in scientific/academic environment (RA/Internships are considered as work experience). Strong knowledge of Python, including core data science libraries such as Scikit-Learn, SciPy, TensorFlow, and PyTorch. Knowledge of software development good practices and collaboration tools, including git-based version control, python package management, and code reviews. Excellent communication skills, with the ability to explain complex machine learning algorithms and statistical methods to non-technical stakeholders. Experience working with cloud environments and tools, such as Amazon AWS S3, EC2, etc. Evidence of related work experience as a researcher in the area of Machine learning. Strong publication record. Ability to quickly understand scientific, technical, and process challenges and breakdown complex problems into actionable steps. Ability to work in a frequently changing environment with the capability to interpret management information to amend plans. Ability to prioritize, manage workload, and deliver agreed activities consistently on time. Demonstrate good networking, influencing and relationship building skills. Strategic thinking is the ability to see the ‘bigger picture. Ability to build collaborative working relationships with internal and external stakeholders at all levels. Demonstrates inclusivity and respect for all.

Additional essential skills for the Principal Data Scientist:

  • Experience in supervision (PhD students and Postdoctoral Fellows).
  • Experience in writing manuscripts for publication.
  • Experience working with cloud environments and tools, such as Amazon AWS S3, EC2, etc.
  • Relevant solid publication record in either machine learning or application of machine learning in biology.

Other Information: Salary per annum (dependent upon skills and experience): Application Process: Please upload your CV and a cover letter outlining how you meet the criteria set out above. Closing date: 6th July 2025.

We recognise that there are many benefits to Hybrid Working; including an improved work-life balance, with more focused time, as well as the ability to organise working time so that collaborative opportunities and team discussions are facilitated on campus. The hybrid working arrangement will vary for different roles and teams. The nature of your role and the type of work you do will determine if a hybrid working arrangement is possible.

Equality, Diversity and Inclusion: We aim to attract, recruit, retain and develop talent from the widest possible talent pool, thereby gaining insight and access to different markets to generate a greater impact on the world. We have a supportive culture with the following staff networks, LGBTQ+, Parents and Carers, Disability and Race Equity to bring people together to share experiences, offer specific support and development opportunities and raise awareness. The networks are also a place for allies to provide support to others. We want our people to be whoever they want to be because we believe people who bring their best selves to work, do their best work. That’s why we’re committed to creating a truly inclusive culture at Sanger Institute. We will consider all individuals without discrimination and are committed to creating an inclusive environment for all employees, where everyone can thrive.

We are proud to deliver an awarding campus-wide employee wellbeing strategy and programme. The importance of good health and adopting a healthier lifestyle and the commitment to reduce work-related stress is strongly acknowledged and recognised at Sanger Institute. Sanger Institute became a signatory of the International Technician Commitment initiative In March 2018. The Technician Commitment aims to empower and ensure visibility, recognition, career development and sustainability for technicians working in higher education and research, across all disciplines. Every year the Wellcome Sanger Institute supports the visits of dozens of overseas researchers from across the globe to the Genome campus to collaborate, share insights and to undertake ground-breaking scientific research. Our reliable in-house service provides expert advice and guidance to support current and prospective staff, and visitors at various points of their journey.

Principal / Senior Data Scientist employer: Wellcome Sanger Institute

At the Wellcome Sanger Institute, we are dedicated to advancing human health and understanding life on Earth through innovative research. As a Senior or Principal Data Scientist, you will thrive in a collaborative and inclusive environment that prioritises personal and professional development, offering unique access to cutting-edge data and computational resources. Our commitment to employee wellbeing, hybrid working arrangements, and a supportive culture ensures that you can make a meaningful impact while enjoying a fulfilling work-life balance.
W

Contact Detail:

Wellcome Sanger Institute Recruiting Team

StudySmarter Expert Advice 🤫

We think this is how you could land Principal / Senior Data Scientist

✨Tip Number 1

Familiarise yourself with the latest research and publications from the Wellcome Sanger Institute, especially those authored by Mo Lotfollahi. This will not only give you insight into the team's current projects but also help you understand their methodologies and focus areas, which can be beneficial during interviews.

✨Tip Number 2

Network with professionals in the field of data science and biology, particularly those who have experience in machine learning applications within scientific research. Attend relevant conferences or webinars to connect with potential colleagues and gain insights that could enhance your application.

✨Tip Number 3

Prepare to discuss your previous projects in detail, especially those involving large datasets and machine learning models. Be ready to explain your thought process, the challenges you faced, and how you overcame them, as this will demonstrate your problem-solving skills and technical expertise.

✨Tip Number 4

Showcase your ability to communicate complex ideas clearly. Since the role involves collaboration with non-technical stakeholders, practice explaining your work in a way that is accessible to those without a technical background, which will highlight your communication skills during the interview.

We think you need these skills to ace Principal / Senior Data Scientist

Advanced Statistical Techniques
Machine Learning
Deep Learning
Python Programming
Data Science Libraries (Scikit-Learn, SciPy, TensorFlow, PyTorch)
Software Development Best Practices
Version Control (Git)
Cloud Computing (Amazon AWS S3, EC2)
Scientific Communication
Project Management
Data Integration and Processing
Collaboration and Teamwork
Problem-Solving Skills
Research Publication Writing
Networking and Relationship Building
Strategic Thinking

Some tips for your application 🫡

Tailor Your CV: Make sure your CV highlights relevant experience in machine learning, data science, and biology. Emphasise any projects or roles that demonstrate your ability to manage large datasets and develop AI models.

Craft a Compelling Cover Letter: In your cover letter, clearly articulate how your skills and experiences align with the job requirements. Mention specific projects you've worked on that relate to cellular biology and machine learning, and express your enthusiasm for contributing to the Wellcome Sanger Institute's mission.

Showcase Your Publications: If you have a strong publication record, make sure to include this in your application. Highlight any papers related to machine learning or biology, especially those published in reputable journals, as this will strengthen your candidacy.

Demonstrate Communication Skills: Since excellent communication skills are essential for this role, consider including examples in your application of how you've effectively communicated complex scientific concepts to non-technical audiences, whether through presentations, posters, or collaborative projects.

How to prepare for a job interview at Wellcome Sanger Institute

✨Showcase Your Technical Skills

Make sure to highlight your experience with machine learning and data science tools, especially Python and its libraries like TensorFlow and PyTorch. Be prepared to discuss specific projects where you've applied these skills, as this will demonstrate your capability to handle the technical demands of the role.

✨Prepare for Problem-Solving Questions

Expect to face questions that assess your ability to tackle complex scientific problems. Practice breaking down intricate issues into manageable steps and be ready to explain your thought process clearly. This will show your strategic thinking and problem-solving abilities.

✨Communicate Effectively

Since you'll need to explain complex concepts to non-technical stakeholders, practice simplifying your explanations without losing the essence of the information. Good communication skills are crucial, so consider how you can convey your ideas clearly and concisely.

✨Demonstrate Collaboration Experience

Highlight your experience working in interdisciplinary teams, especially in collaborative environments. Be ready to share examples of how you've worked with others to achieve common goals, as this role involves significant teamwork and collaboration with various stakeholders.

Principal / Senior Data Scientist
Wellcome Sanger Institute
W
Similar positions in other companies
UK’s top job board for Gen Z
discover-jobs-cta
Discover now
>