At a Glance
- Tasks: Lead and grow the customer analytics team, building advanced analytics models.
- Company: Join a boutique firm delivering creative consumer insights to top brands and consultancies.
- Benefits: Work with cutting-edge technologies and enjoy a flexible work environment.
- Why this job: Make an impact by solving real business issues with data-driven solutions.
- Qualifications: Proven experience in data science, machine learning, and client engagement required.
- Other info: Experience with cloud infrastructures like Azure or AWS is a plus.
The predicted salary is between 48000 - 84000 £ per year.
Do you want to help us improve human health and understand life on Earth? Make your mark by shaping the future to enable or deliver life-changing science to solve some of humanity’s greatest challenges.
Senior Data Scientist
We seek a senior machine learning research scientist to join a collaborative project between the Wellcome Sanger Institute and Open Targets. This project aims to leverage datasets internally generated at the Sanger Institute and publicly available data from human cells to create foundational models for biology, enhancing our understanding of life’s rules and improving health for all. You will work within an interdisciplinary team of life scientists and computer/ML scientists, with a shared objective of advancing biological research through these foundational models. This role will sit within the AI/ML Faculty group led by Dr. Mohammad Lotfollahi, and the successful candidates, across different seniority levels (senior and principal), will be responsible for delivering their portfolio of scientific research projects as part of the broader team strategy.
About the role
Your role will involve designing foundational models leveraging multi-modal readouts. This includes integrating and processing data from various sources to develop robust and versatile AI models. To achieve this, you will work with open-source software, proposing, developing, and maintaining new solutions to analyze and interpret large-scale single-cell datasets. We have access to unique data and are also in the position to generate data to train unique models. Additionally, we have substantial computational power and GPU resources to train large models efficiently.
Our teams are well-positioned to tackle this problem with experience in both generating and analyzing datasets, including millions of cells across multiple tissues and conditions (e.g., disease, healthy). This involves a detailed understanding of the training of large-scale ML models and a track record of undertaking large data-science projects.
You will be responsible for:
- Independently managing and leading machine learning research projects and writing outcomes in a scientific publication for submission to journals or machine learning conferences (ICLR, ICML, CVPR, etc).
- Collaborating with team members in proposing, developing, and evaluating new machine learning models that enable understanding single-cell data and its application in drug discovery.
- Working with Ph.D. students and postdocs in collaborating teams on developing solutions for interdisciplinary scientific problems in biology as well as providing supervision and training to junior members of the team.
- Contributing to writing scientific papers on biotechnology and biology.
- Distilling your developed solutions into open-source and easy-to-install packages with documentation that facilitates the usage of your solution for downstream users, including biologists and bioinformaticians.
- Presenting your research and analysis pipelines to internal and external audiences.
About You:
You will be supported in your personal and professional development and have the opportunity to lead peer-reviewed publications around using genetics and genomics approaches to guide drug discovery and present them at national and international conferences.
- Ph.D. or M.Sc. with equivalent research experience in a relevant quantitative discipline (e.g., Computer Science, Computational Biology, Genetics, Bioinformatics, Physics, Engineering, or Applied Statistics/Mathematics).
- Previous ML work experience in scientific/academic environment (RA/Internships are considered as work experience).
- Strong knowledge of Python, including core data science libraries such as Scikit-Learn, SciPy, TensorFlow, and PyTorch.
- Expertise in machine learning algorithms and frameworks, with experience in designing, training, and deploying ML models.
- Proficiency in handling and processing large datasets, including techniques for data cleaning, feature engineering, and data augmentation.
- Experience with high-performance computing environments, including the use of GPUs for training large-scale machine learning models.
- Experience in natural language processing (NLP) and training models based on transformer architectures, such as BERT and GPT.
- Familiarity with generative models such as diffusion models and flow matching.
- Knowledge of software development good practices and collaboration tools, including git-based version control, Python package management, and code reviews.
- Strong problem-solving skills with the ability to analyze complex data and derive actionable insights.
- Excellent communication skills, with the ability to explain complex machine learning algorithms and statistical methods to non-technical stakeholders.
In addition to the above technical skills, you will also have the following:
- Ability to quickly understand scientific, technical, and process challenges and breakdown complex problems into actionable steps.
- Ability to work in a frequently changing environment with the capability to interpret management information to amend plans.
- Ability to prioritize, manage workload, and deliver agreed activities consistently on time.
- Demonstrate good networking, influencing and relationship building skills.
- Strategic thinking is the ability to see the ‘bigger picture.’
- Ability to build collaborative working relationships with internal and external stakeholders at all levels.
- Demonstrates inclusivity and respect for all.
Relevant publication of the groups:
- Lotfollahi, M ., Naghipourfar, M., Luecken, M. D., Khajavi, M., Büttner, M., Wagenstetter, M., Avsec, Ž., Gayoso, A., Yosef, N., Interlandi, M. & Others. Mapping single-cell data to reference atlases by transfer learning. Nature Biotechnology 1–10.
- Lotfollahi, M. , Wolf, F. A. & Theis, F. J. scGen predicts single-cell perturbation responses. Nature Methods 16, 715–721.
- Lotfollahi, M. , Rybakov, S., Hrovatin, K., Hediyeh-Zadeh, S., Talavera-López, C., Misharin, A. V. & Theis, F. J. Biologically informed deep learning to query gene programs in single cell atlases. Nature Cell Biology.
#J-18808-Ljbffr
Senior Data Scientist employer: TN United Kingdom
Contact Detail:
TN United Kingdom Recruiting Team
StudySmarter Expert Advice 🤫
We think this is how you could land Senior Data Scientist
✨Tip Number 1
Make sure to showcase your experience with advanced analytical techniques like regression and machine learning during the interview. Be prepared to discuss specific projects where you've successfully applied these skills, as this will demonstrate your technical expertise.
✨Tip Number 2
Familiarize yourself with the company's recent projects, especially those involving customer analytics and predictive modeling. This knowledge will help you engage in meaningful conversations about how you can contribute to their goals.
✨Tip Number 3
Highlight your ability to communicate complex data insights to clients. Prepare examples of how you've effectively advised clients on business objectives, as this is crucial for the role.
✨Tip Number 4
Demonstrate your flexibility and autonomy by discussing situations where you've successfully managed your workload independently. This will show that you can prioritize tasks effectively in a fast-paced environment.
We think you need these skills to ace Senior Data Scientist
Some tips for your application 🫡
Understand the Company: Dive deep into the boutique firm's services and specialisms. Familiarize yourself with their approach to consumer insights, predictive models, and analytics techniques they employ. This knowledge will help you tailor your application.
Highlight Relevant Experience: In your CV and cover letter, emphasize your experience with advanced analytical techniques, machine learning, and customer analytics. Provide specific examples of projects where you've successfully applied these skills.
Showcase Technical Skills: Make sure to detail your proficiency in R or Python, SQL, and any experience with cloud infrastructures like Azure or AWS. Mention any relevant tools or technologies you've used, especially in relation to web scraping or data integration.
Craft a Compelling Cover Letter: Use your cover letter to express your passion for data science and how you can contribute to the new customer analytics team. Discuss your ability to communicate with clients and how you can help them achieve their business objectives.
How to prepare for a job interview at TN United Kingdom
✨Showcase Your Technical Skills
Be prepared to discuss your experience with advanced analytical techniques like regression, machine learning, and NLP. Bring examples of projects where you've successfully applied these skills, especially in R or Python.
✨Demonstrate Client Interaction Experience
Since the role involves client interaction, be ready to share instances where you've confidently discussed business objectives with clients. Highlight how you advised them on deploying results effectively.
✨Discuss Your Flexibility and Autonomy
The company values a flexible approach to workload. Prepare to talk about situations where you've worked autonomously, prioritized tasks, and organized your work efficiently to meet deadlines.
✨Familiarize Yourself with Cloud Technologies
Understanding cloud infrastructures like Azure or AWS is crucial. Be ready to discuss how you've integrated analytical workflows in these environments, as well as any experience with accessing customer databases and data lakes.