At a Glance
- Tasks: Lead AI/ML projects to revolutionise drug discovery using cutting-edge machine learning models.
- Company: Join Chemify, a pioneering company transforming chemistry with AI and robotics.
- Benefits: Competitive salary, collaborative environment, and opportunities for impactful work.
- Other info: Dynamic team atmosphere with excellent growth and learning opportunities.
- Why this job: Make a real-world impact by applying AI to discover new medicines.
- Qualifications: MSc or PhD in relevant fields with strong expertise in machine learning.
The predicted salary is between 70000 - 90000 £ per year.
About Chemify
Chemify is revolutionising chemistry. We are creating a future where the synthesis of previously unimaginable molecules, drugs, and materials is instantly accessible. By combining AI, robotics, and the world’s largest continually expanding database of chemical programs, we are accelerating chemical discovery to improve quality of life and extend the reach of humanity.
We seek a talented and motivated Senior AI/ML Data Scientist to pioneer the development and application of cutting-edge machine learning models for computer-aided drug design (CADD) and small molecule discovery. You will be joining a dynamic, cross-disciplinary team of computational scientists, medicinal chemists, and engineers. Your primary focus will be on architecting, training, and deploying sophisticated models to predict molecular properties, generate novel models, and ultimately accelerate our drug discovery pipelines.
To be successful in this role, you will need deep expertise in modern machine learning, particularly generative AI (Transformers, Diffusion Models), Graph Neural Networks, and predictive modeling. You will leverage your skills to tackle complex scientific challenges, working with vast and diverse chemical and biological datasets. If you are passionate about applying state-of-the-art AI to solve fundamental challenges in chemistry and are driven to see your work make a real-world impact on discovering new medicines, we’d love to have you join our team.
Key Responsibilities:
- Design, develop, and optimize state-of-the-art generative models (e.g., Transformers, GNNs, Diffusion Models) for robotic tasks synthetic routes.
- Architect and implement scalable MLOps pipelines for preprocessing large-scale chemical and biological datasets, model training, and rigorous evaluation.
- Translate cutting-edge research in AI/ML into practical solutions that address critical challenges in our drug discovery projects, such as property prediction (ADMET/QSAR), reaction prediction, and binding affinity prediction.
- Collaborate closely with computational chemists, medicinal chemists, and software engineers to define project goals, interpret model outputs, and integrate AI-driven insights into our discovery platform.
- Design and execute robust experiments to evaluate model performance, focusing on chemical validity, novelty, synthesizability, and predictive accuracy against experimental data.
- Clearly communicate complex technical concepts, model results, and strategic recommendations to both technical and non-technical stakeholders.
- Stay at the forefront of AI for drug discovery, foundation models for science, and multimodal learning, continuously identifying and championing opportunities to enhance our capabilities.
What you’ll bring:
- MSc or PhD with 5+ years of industry or academic experience in Computer Science, Machine Learning, Computational Chemistry/Biology, or a closely related field.
- Demonstrated proficiency in Python and deep learning frameworks such as PyTorch or TensorFlow.
- Deep theoretical and practical knowledge of modern machine learning architectures, including Transformers, Graph Neural Networks (GNNs), and generative models (VAEs, GANs, Diffusion Models) as applied to scientific problems.
- Proven ability to lead complex AI/ML projects from concept to deployment in a scientific or drug discovery context.
- Extensive experience working with large-scale molecular datasets (e.g., SMILES, 3D conformations), biological data (e.g., protein sequences, assay data), and other scientific data formats.
- Experience with efficient model training and fine-tuning techniques, such as LoRA, quantization, distillation, and model pruning.
- Strong background or hands-on experience applying ML to problems involving protein structures, small molecule interactions, or related biological data.
- Familiarity with scalable computing environments, GPU acceleration, and distributed training.
- Excellent communication and interpersonal skills for effective collaboration in a multidisciplinary team.
- A collaborative mindset, strong communication skills, and the ability to work effectively within a cross-disciplinary team.
- Excellent problem-solving skills and a proactive, can-do attitude.
- An eagerness to learn new scientific concepts, computational methods, and software engineering practices from experienced mentors.
- Good understanding of version control with Git.
Beneficial Skills:
- Hands-on experience with cheminformatics toolkits such as RDKit.
- Experience with Retrieval-Augmented Generation (RAG) systems, including vector databases (e.g., Redis, FAISS, Milvus, Pinecone) for querying large chemical or biological databases.
- Experience with Protein/DNA language models (e.g., ProtBERT, ESM, Evo) or protein structure prediction models (e.g., AlphaFold-like approaches).
- Experience with evaluation frameworks for reaction and synthetic route design, including human-in-the-loop assessment and metrics for novelty, diversity, and feasibility of synthetic pathways.
- Strong experience with relational and non-relational databases (SQL/NoSQL), including data modeling and efficient querying for large-scale AI workflows.
- A portfolio of projects or open-source contributions (e.g., a GitHub profile) that demonstrates your skills and passion for AI/ML development.
Senior Data Scientist - AI/ML (CADD) in Glasgow employer: Chemify
Contact Detail:
Chemify Recruiting Team
StudySmarter Expert Advice 🤫
We think this is how you could land Senior Data Scientist - AI/ML (CADD) in Glasgow
✨Tip Number 1
Network like a pro! Reach out to people in the industry, attend meetups, and connect with professionals on LinkedIn. You never know who might have the inside scoop on job openings or can refer you directly.
✨Tip Number 2
Show off your skills! Create a portfolio showcasing your projects, especially those related to AI/ML and drug discovery. This will give potential employers a taste of what you can do and set you apart from the crowd.
✨Tip Number 3
Prepare for interviews by brushing up on your technical knowledge and problem-solving skills. Practice explaining complex concepts in simple terms, as you'll need to communicate effectively with both technical and non-technical team members.
✨Tip Number 4
Don't forget to apply through our website! We love seeing candidates who are genuinely interested in joining our mission at Chemify. Plus, it shows you're proactive and keen to be part of our innovative team.
We think you need these skills to ace Senior Data Scientist - AI/ML (CADD) in Glasgow
Some tips for your application 🫡
Show Off Your Skills: Make sure to highlight your expertise in AI/ML, especially with generative models and predictive modelling. We want to see how your skills can directly contribute to our mission of revolutionising chemistry!
Tailor Your Application: Don’t just send a generic CV! Tailor your application to reflect the specific requirements mentioned in the job description. Show us how your experience aligns with the role of Senior Data Scientist in CADD.
Be Clear and Concise: When writing your application, keep it clear and to the point. We appreciate straightforward communication, so make sure your technical concepts are easy to understand for both technical and non-technical folks.
Apply Through Our Website: We encourage you to apply through our website for a smoother process. It helps us keep track of applications better and ensures you don’t miss out on any important updates from us!
How to prepare for a job interview at Chemify
✨Know Your Models Inside Out
Make sure you can discuss the generative models mentioned in the job description, like Transformers and Graph Neural Networks. Be prepared to explain how you've applied these in past projects, as well as their strengths and weaknesses in drug discovery.
✨Showcase Your Data Skills
Since you'll be working with large-scale molecular datasets, come ready to talk about your experience with data preprocessing and model training. Bring examples of how you've tackled challenges with datasets in previous roles, especially in a scientific context.
✨Collaborate Like a Pro
This role involves working closely with chemists and engineers, so highlight your teamwork skills. Prepare anecdotes that demonstrate your ability to communicate complex ideas clearly to both technical and non-technical team members.
✨Stay Current with AI Trends
Chemify is looking for someone who stays at the forefront of AI for drug discovery. Be ready to discuss recent advancements in AI/ML and how they could apply to the role. Showing your passion for continuous learning will definitely impress the interviewers.