At a Glance
- Tasks: Join our ML team to build genomic models and contribute to cutting-edge gene-editing technology.
- Company: We're a pioneering biotech firm revolutionising plant sciences with advanced machine learning techniques.
- Benefits: Enjoy a hybrid work model, collaborative environment, and opportunities for professional growth.
- Why this job: Be part of a dynamic team tackling real-world challenges in genetics and biotechnology.
- Qualifications: Postgraduate experience in ML with a focus on biological applications and proficiency in PyTorch required.
- Other info: Ideal for those passionate about genomics and eager to innovate in a supportive setting.
The predicted salary is between 48000 - 72000 £ per year.
While gene-editing is becoming increasingly efficient, identifying which genes to edit and how remains a significant challenge. To overcome this bottleneck, we use cutting-edge deep learning to accurately and efficiently identify high-value genetic targets for gene-editing. Our approach draws inspiration from recent advancements in the drug discovery space, incorporating large language models (LLMs), transformers, and graph-based technologies to build a best-in-class discovery platform for plant sciences.
Our team is currently composed of 12 members, including ML engineers, data engineers, and bioinformaticians. We also have a remote, part-time intern conducting ML research. The team primarily works together in person at our office in London 4 days per week.
As part of the core ML team, you will help us build genomic foundation models. Your responsibilities could range from model training to data curation to evaluations. We welcome applicants with specific expertise who feel they could uniquely contribute to the training lifecycle of large, complex models. The ideal applicant will have experience using genomic data in a machine learning context. We are particularly interested in individuals with experience working with foundational generative models of DNA or transcriptomic data. However, our modelling efforts have a strong focus on multi-modality, so experience with or interest in other data modalities (e.g., text) is a plus.
Core Responsibilities- Contribution to the development of proprietary -omics models, including model training and evaluation development.
- Recreation of state-of-the-art models from the scientific literature and benchmarking against internal models and evaluations.
- Model deployment to ensure flexible and scalable inference access to the wider Data Science team.
- Collaboration with the bioinformatics team to ingest, standardize, and QC data from multiple sources (internal and external) for use in training pipelines.
- Support for the wider ML team on model development and commercial projects.
- Postgraduate experience (MSc or PhD) in ML with a demonstrated application to a biological domain.
- Experience building modern ML architectures (e.g., transformers, diffusers) from scratch and applying them to real biological datasets.
- Experience working with large-scale transcriptomic datasets, ideally from non-human organisms (though not required).
- Experience with PyTorch, huggingface transformers, and diffusers.
- Experience working with ML accelerators.
- Relevant publications in reputable journals or contributions to open-source projects.
- Exposure to and interest in probabilistic ML, causal ML, or active learning.
- Experience with distributed model training (data and model parallelism).
- Experience working on biological data curation, including data cleansing and preprocessing of -omics datasets.
- Exposure to cloud-based ML orchestration frameworks such as Sagemaker and Vertex AI.
- Experience with model deployment in an enterprise setting.
For immediate consideration please send your most up to date CV to.
Machine Learning Engineer | Omics | RNA | DNA | PyTorch | Hybrid, London employer: Enigma
Contact Detail:
Enigma Recruiting Team
StudySmarter Expert Advice 🤫
We think this is how you could land Machine Learning Engineer | Omics | RNA | DNA | PyTorch | Hybrid, London
✨Tip Number 1
Familiarise yourself with the latest advancements in deep learning, particularly in the context of genomics. Being able to discuss recent breakthroughs or relevant research during your interview can demonstrate your passion and knowledge in the field.
✨Tip Number 2
Network with professionals in the biotechnology and machine learning sectors. Attend industry conferences or webinars where you can meet potential colleagues and learn more about the specific challenges they face, which can help you tailor your approach when applying.
✨Tip Number 3
Showcase any hands-on experience you have with PyTorch and large-scale transcriptomic datasets. If you have personal projects or contributions to open-source initiatives, be ready to discuss these in detail, as practical experience is highly valued.
✨Tip Number 4
Prepare to discuss how you would approach model training and evaluation for genomic data. Think about specific methodologies you would use and be ready to explain your reasoning, as this will highlight your problem-solving skills and technical expertise.
We think you need these skills to ace Machine Learning Engineer | Omics | RNA | DNA | PyTorch | Hybrid, London
Some tips for your application 🫡
Tailor Your CV: Make sure your CV highlights relevant experience in machine learning, particularly with genomic data and the technologies mentioned in the job description, such as PyTorch and transformers. Use specific examples to demonstrate your expertise.
Craft a Compelling Cover Letter: Write a cover letter that explains why you are passionate about the role and how your background aligns with the company's mission. Mention any specific projects or experiences that relate to gene-editing and deep learning.
Showcase Relevant Projects: If you have worked on projects involving large-scale transcriptomic datasets or foundational generative models, be sure to include these in your application. Highlight your contributions and the impact of your work.
Highlight Collaboration Skills: Since the role involves working closely with bioinformaticians and other team members, emphasise your teamwork and collaboration skills. Provide examples of how you've successfully worked in multidisciplinary teams in the past.
How to prepare for a job interview at Enigma
✨Showcase Your Technical Skills
Be prepared to discuss your experience with machine learning architectures, especially transformers and PyTorch. Bring examples of projects where you've applied these technologies, particularly in a biological context, to demonstrate your expertise.
✨Understand the Company’s Focus
Research the company’s approach to gene editing and their use of deep learning in identifying genetic targets. Familiarise yourself with their methodologies, such as large language models and graph-based technologies, to show that you’re aligned with their goals.
✨Prepare for Problem-Solving Questions
Expect to face technical questions that assess your problem-solving abilities. Be ready to discuss how you would approach model training, data curation, and evaluation, especially in relation to genomic data and multi-modality.
✨Highlight Collaboration Experience
Since the role involves working closely with bioinformaticians and other team members, share examples of past collaborative projects. Emphasise your ability to work in a team environment and how you’ve contributed to successful outcomes in interdisciplinary settings.