At a Glance
- Tasks: Join our ML team to develop genomic models and enhance gene-editing technologies.
- Company: We're a pioneering biotech firm focused on cutting-edge gene-editing solutions.
- Benefits: Enjoy a hybrid work model, collaborative environment, and opportunities for professional growth.
- Why this job: Be part of a dynamic team tackling real-world challenges in plant sciences with innovative tech.
- Qualifications: Postgraduate experience in ML, ideally with a focus on biological applications and modern architectures.
- Other info: Remote part-time intern support and a vibrant office culture in London.
The predicted salary is between 48000 - 84000 £ per year.
While gene-editing is becoming increasingly efficient, identifying which genes to edit and how remains a significant challenge. To overcome this bottleneck, we use cutting-edge deep learning to accurately and efficiently identify high-value genetic targets for gene-editing. Our approach draws inspiration from recent advancements in the drug discovery space, incorporating large language models (LLMs), transformers, and graph-based technologies to build a best-in-class discovery platform for plant sciences.
Our team is currently composed of 12 members, including ML engineers, data engineers, and bioinformaticians. We also have a remote, part-time intern conducting ML research. The team primarily works together in person at our office in London 4 days per week.
As part of the core ML team, you will help us build genomic foundation models. Your responsibilities could range from model training to data curation to evaluations. We welcome applicants with specific expertise who feel they could uniquely contribute to the training lifecycle of large, complex models.
The ideal applicant will have experience using genomic data in a machine learning context. We are particularly interested in individuals with experience working with foundational generative models of DNA or transcriptomic data. However, our modelling efforts have a strong focus on multi-modality, so experience with or interest in other data modalities (e.g., text) is a plus.
Core Responsibilities- Contribution to the development of proprietary -omics models, including model training and evaluation development.
- Recreation of state-of-the-art models from the scientific literature and benchmarking against internal models and evaluations.
- Model deployment to ensure flexible and scalable inference access to the wider Data Science team.
- Collaboration with the bioinformatics team to ingest, standardize, and QC data from multiple sources (internal and external) for use in training pipelines.
- Support for the wider ML team on model development and commercial projects.
- Postgraduate experience (MSc or PhD) in ML with a demonstrated application to a biological domain.
- Experience building modern ML architectures (e.g., transformers, diffusers) from scratch and applying them to real biological datasets.
- Experience working with large-scale transcriptomic datasets, ideally from non-human organisms (though not required).
- Experience with PyTorch, huggingface transformers, and diffusers.
- Experience working with ML accelerators.
- Relevant publications in reputable journals or contributions to open-source projects.
- Exposure to and interest in probabilistic ML, causal ML, or active learning.
- Experience with distributed model training (data and model parallelism).
- Experience working on biological data curation, including data cleansing and preprocessing of -omics datasets.
- Exposure to cloud-based ML orchestration frameworks such as Sagemaker and Vertex AI.
- Experience with model deployment in an enterprise setting.
For immediate consideration please send your most up to date CV to jason@enigma-rec.ai
Machine Learning Engineer | Omics | RNA | DNA | PyTorch | Hybrid, London (Greater London) employer: Enigma
Contact Detail:
Enigma Recruiting Team
StudySmarter Expert Advice 🤫
We think this is how you could land Machine Learning Engineer | Omics | RNA | DNA | PyTorch | Hybrid, London (Greater London)
✨Tip Number 1
Familiarise yourself with the latest advancements in deep learning, particularly in the context of genomics. Understanding how large language models and transformers are applied in gene-editing will give you a significant edge during discussions with our team.
✨Tip Number 2
Engage with the bioinformatics community by attending relevant conferences or webinars. Networking with professionals in the field can provide insights into current challenges and innovations, which could be beneficial when discussing your potential contributions to our projects.
✨Tip Number 3
Showcase any personal projects or contributions to open-source initiatives related to machine learning and genomics. This demonstrates your passion and practical experience, making you a more attractive candidate for our team.
✨Tip Number 4
Prepare to discuss your experience with model deployment and cloud-based ML orchestration frameworks. Being able to articulate your hands-on experience with tools like Sagemaker or Vertex AI will highlight your readiness to contribute effectively to our ML team.
We think you need these skills to ace Machine Learning Engineer | Omics | RNA | DNA | PyTorch | Hybrid, London (Greater London)
Some tips for your application 🫡
Tailor Your CV: Make sure your CV highlights relevant experience in machine learning, particularly with genomic data and the technologies mentioned in the job description, such as PyTorch and transformers. Use specific examples to demonstrate your expertise.
Craft a Compelling Cover Letter: Write a cover letter that explains why you are interested in this position and how your background aligns with the company's goals. Mention any specific projects or experiences that relate to the development of -omics models or large-scale datasets.
Showcase Relevant Skills: In your application, emphasise your skills in model training, evaluation, and deployment. If you have experience with multi-modality data or cloud-based ML frameworks, make sure to include that as well.
Highlight Collaborative Experience: Since the role involves collaboration with bioinformatics teams, mention any past experiences where you worked in a team setting, especially in interdisciplinary environments. This will show your ability to work effectively with others.
How to prepare for a job interview at Enigma
✨Showcase Your Technical Skills
Be prepared to discuss your experience with machine learning architectures, especially transformers and PyTorch. Bring examples of projects where you've applied these technologies, particularly in a biological context.
✨Understand the Company’s Focus
Research the company’s approach to gene editing and their use of deep learning in identifying genetic targets. This will help you align your answers with their goals and demonstrate your genuine interest in their work.
✨Prepare for Problem-Solving Questions
Expect technical questions that assess your problem-solving skills. Be ready to explain how you would approach model training and evaluation, especially with genomic data. Practice articulating your thought process clearly.
✨Highlight Collaboration Experience
Since the role involves working closely with bioinformaticians and other team members, share examples of past collaborations. Emphasise your ability to work in a team and how you’ve contributed to collective goals in previous roles.