At a Glance
- Tasks: Join us to develop AI-driven systems for groundbreaking biological discoveries.
- Company: Deep Genomics, a leader in AI for drug discovery.
- Benefits: Competitive pay, stock options, health coverage, and flexible work hours.
- Other info: Collaborative environment with a focus on learning and growth.
- Why this job: Make a real impact in genomics with cutting-edge AI technology.
- Qualifications: 3+ years in LLMs, MSc or PhD in relevant fields, strong Python skills.
The predicted salary is between 48000 - 72000 € per year.
About Us
Deep Genomics is at the forefront of using artificial intelligence to transform drug discovery. Our proprietary AI platform decodes the complexity of RNA biology to identify novel drug targets, mechanisms, and therapeutics inaccessible through traditional methods. With expertise spanning machine learning, bioinformatics, data science, engineering, and drug development, our multidisciplinary team in Toronto and Cambridge, MA is revolutionizing how new medicines are created.
About the Role
Join us in building the future of AI-driven therapeutic design as a (Senior) Research Scientist specializing in large language models for genomics within our Systems and Target Biology group. You will develop and implement systems for using large language models to discover and characterize evidence for new biological discoveries using Deep Genomics’ foundation models. As a part of this role, you will interact closely with the machine learning team developing foundation models, the engineering and infrastructure teams to build scalable systems, and the statistical genetics and experimental groups to synthesize evidence for therapeutic actionability.
Ideal Candidate
We are looking for someone with 3+ years of experience in using and developing solutions using LLMs for complex, multi-agent, workflows. The ideal candidate will be passionate about leveraging AI and foundation models to disrupt therapeutic design workflows and is adept at translating complex scientific requirements into robust computational solutions. A background in genomics or computational biology is highly desirable, as well as clear experience in MLOps and architecting agentic solutions from the ground up.
Key Responsibilities
- Design and implement multi-agent workflows that integrate internal foundation models (e.g., BigRNA, REPRESS, FlashRNA) and external tools to identify new biological hypotheses.
- Develop systems that leverage Retrieval Augmented Generation (RAG) by connecting LLMs to internal scientific documents, SOPs, and structured biological databases.
- Collaborate with the machine learning team on model distillation strategies to create smaller, faster models suitable for a real-time, interactive chat interface.
- Build out and maintain the infrastructure for the LLM agent, including databases and model context protocol (MCP) endpoints.
- Work closely with end-users in therapeutic design, target discovery, and experimental biology to identify key use cases, gather feedback, and rapidly iterate on the product.
- Ensure the system is transparent and trustworthy by building \"explainable AI\" features that help users understand and verify the AI\'s outputs and decisions.
Basic Qualifications
- MSc or PhD in Computer Science, Computational Biology, Bioinformatics, or a related field.
- 3+ years of hands-on experience architecting and building complex applications using Large Language Models.
- Expert knowledge of Python and modern MLOps frameworks and tools; experience with agentic frameworks like LangChain is essential.
- Demonstrated experience in building multi-agent systems that can plan, execute tasks, and interact with external tools and APIs.
- Familiarity with high-performance computing environments and cloud services (e.g., AWS, GCP).
- Excellent communication skills and the ability to work effectively in a multidisciplinary team, translating the needs of biologists and drug developers into technical solutions.
- Intellectual curiosity, critical thinking, and a commitment to innovation and scientific rigor.
Preferred Qualifications
- A strong background in genomics, computational biology, or bioinformatics, including experience with NGS data analysis or large-scale biological datasets.
- Prior experience in the biotech or pharmaceutical industry, particularly in a drug discovery context.
- Experience with model distillation or creating smaller, specialized models from larger foundation models.
- Familiarity with scientific workflow management systems and tools (e.g., Docker, Conda).
What we offer
- A collaborative and innovative environment at the frontier of computational biology, machine learning, and drug discovery.
- Highly competitive compensation, including meaningful stock ownership.
- Comprehensive benefits - including health, vision, and dental coverage for employees and families, employee and family assistance program.
- Flexible work environment - including flexible hours, extended long weekends, holiday shutdown, unlimited personal days.
- Maternity and parental leave top-up coverage, as well as new parent paid time off.
- Focus on learning and growth for all employees - learning and development budget & lunch and learns.
- Facilities located in the heart of Toronto - the epicenter of machine learning and AI research and development, and in Kendall Square, Cambridge, Mass. - a global center of biotechnology and life sciences.
Deep Genomics welcomes and encourages applications from people with disabilities. Accommodations are available on request for candidates taking part in all aspects of the selection process.
Deep Genomics thanks all applicants, however only those selected for an interview will be contacted.
(Senior) Research Scientist - Large Language Models for Genomics in Cambridge employer: Deep Genomics Inc.
Deep Genomics is an exceptional employer, offering a collaborative and innovative environment at the cutting edge of computational biology and AI-driven drug discovery. With a strong focus on employee growth, competitive compensation, and comprehensive benefits, including flexible work arrangements and a commitment to diversity and inclusion, our Toronto location provides a vibrant hub for those passionate about transforming healthcare through technology.
StudySmarter Expert Advice🤫
We think this is how you could land (Senior) Research Scientist - Large Language Models for Genomics in Cambridge
✨Tip Number 1
Network like a pro! Reach out to folks in the industry, attend meetups, and connect with people on LinkedIn. You never know who might have the inside scoop on job openings or can refer you directly.
✨Tip Number 2
Show off your skills! Create a portfolio or GitHub repository showcasing your projects related to large language models and genomics. This gives potential employers a taste of what you can do and sets you apart from the crowd.
✨Tip Number 3
Prepare for interviews by brushing up on your technical knowledge and soft skills. Practice explaining complex concepts in simple terms, as you'll need to communicate effectively with multidisciplinary teams.
✨Tip Number 4
Don't forget to apply through our website! It’s the best way to ensure your application gets seen by the right people. Plus, it shows you're genuinely interested in joining our team at Deep Genomics.
We think you need these skills to ace (Senior) Research Scientist - Large Language Models for Genomics in Cambridge
Some tips for your application 🫡
Show Your Passion:When writing your application, let your enthusiasm for AI and genomics shine through! We want to see how your experience aligns with our mission at Deep Genomics, so don’t hold back on sharing your excitement about transforming drug discovery.
Tailor Your CV:Make sure your CV is tailored to highlight your experience with large language models and any relevant projects. We love seeing specific examples of how you've tackled complex workflows or developed innovative solutions in your previous roles.
Craft a Compelling Cover Letter:Your cover letter is your chance to tell us why you’re the perfect fit for this role. Be sure to connect your background in computational biology or bioinformatics to the responsibilities outlined in the job description. Let’s see that connection!
Apply Through Our Website:We encourage you to apply directly through our website. It’s the best way to ensure your application gets into the right hands. Plus, it shows us you’re serious about joining our team at Deep Genomics!
How to prepare for a job interview at Deep Genomics Inc.
✨Know Your LLMs Inside Out
Make sure you’re well-versed in large language models, especially how they apply to genomics. Brush up on your understanding of Deep Genomics’ foundation models and be ready to discuss how you’ve used LLMs in past projects.
✨Showcase Your Collaboration Skills
This role involves working closely with various teams. Prepare examples that highlight your experience collaborating with machine learning, engineering, and experimental biology teams. Emphasise your ability to translate complex scientific needs into technical solutions.
✨Demonstrate Your Problem-Solving Approach
Be ready to discuss specific challenges you've faced in developing multi-agent workflows or MLOps solutions. Use the STAR method (Situation, Task, Action, Result) to structure your answers and showcase your critical thinking skills.
✨Ask Insightful Questions
Prepare thoughtful questions about the company’s AI-driven therapeutic design processes and their future direction. This shows your genuine interest in the role and helps you assess if it’s the right fit for you.