At a Glance
- Tasks: Design and scale experimental datasets for machine learning in biomolecular research.
- Company: Join Microsoft Research AI for Science, a leader in innovative biomedical research.
- Benefits: Collaborative environment, impactful work, and opportunities for professional growth.
- Other info: Work with top researchers and contribute to high-impact projects in a dynamic setting.
- Why this job: Make a real difference in drug discovery and molecular understanding with cutting-edge technology.
- Qualifications: PhD or equivalent experience in science/engineering; strong Python skills required.
The predicted salary is between 50000 - 70000 £ per year.
At Microsoft Research AI for Science, we seek highly motivated Postdoctoral Researchers for experimental data integration into the next Biomolecular Emulator (BioEmu) model. Microsoft Research AI for Science focuses on the development of machine learning and artificial intelligence methods for transforming molecular simulation and discovery of novel materials, drugs, and chemical reactions. The BioEmu project aims to model the dynamics and function of proteins, how they change shape, bind to each other, and bind small molecules. This approach will help us to understand biological function and dysfunction on a structural level and lead to more effective and targeted drug discovery.
The successful candidate will have the opportunity to work on the following:
- Design and scale experimental datasets for ML
- Bridge models with real-world biological measurements (e.g., cryo-em, binding assays)
- Develop workflows that connect noisy experimental signals to actionable model insights
Responsibilities:
- Bridging Models with Real-World Experimental Signals: Develop methods to connect ML models with experimental observables, such as cryo-em density maps, binding affinity/kinetics assays, and proteomics/sequencing data. Enable model inference conditioned on or steered by experimental data. Interpret discrepancies between model predictions and experimental outcomes to guide iteration. Integrate heterogeneous datasets into coherent representations for modeling.
- Experimental Data Strategy & Dataset Development: Design high-quality, ML-ready experimental datasets (e.g., protein interactions, conformational dynamics, binding measurements, cryo-em density). Translate research questions into scalable experimental campaigns with clear success criteria. Define dataset standards, metadata, and quality metrics for downstream modeling. Identify gaps in existing datasets and propose novel data generation strategies.
- Model-Aware Experimental Design: Establish closed-loop workflows where experimental results refine models and vice versa. Define evaluation metrics that reflect real-world biological utility, not just benchmarks.
- Scalable Data Processing & Automation: Build automated, reproducible pipelines for data ingestion, processing, and analysis (Python-based). Develop systems for data curation, QC, and uncertainty estimation on noisy experimental data. Leverage modern tooling (databases, distributed compute, LLM-assisted workflows) to scale beyond manual analysis.
- Collaboration & External Coordination: Partner with ML researchers, computational biologists, and experimental collaborators (academic + CROs). Provide technical guidance on experimental design, data quality, and iteration cycles. Translate between disciplines to ensure alignment between model needs and experimental outputs.
- Independent Research & Impact: Contribute to novel methods at the model–experiment interface. Publish research, release datasets/software, and shape internal research direction. Drive projects from ambiguous ideas to high-impact, usable artifacts.
Qualifications:
Required Qualifications:
- Completed or nearly complete PhD or equivalent experience in a science or engineering discipline.
- Deep expertise in at least one relevant area, such as machine learning for biomolecular systems, molecular modeling and simulation, structural biology, experimental protein assays, or statistical mechanics.
- Strong Python skills and experience building data analysis, modeling, or machine learning pipelines.
- Experience working with real-world biological, structural, experimental, or molecular datasets.
- Ability to work across disciplines and communicate complex ideas clearly.
- Track record of independently owning and delivering research projects.
Preferred Qualifications:
- Experience connecting computational models to experimental data, such as cryo-EM, X-ray, NMR, SPR, mass spectrometry, NGS, or other assay readouts.
- Background in generative models, diffusion models, representation learning, molecular dynamics, or statistical mechanics for biomolecular systems.
- Experience with large-scale dataset generation, curation, or automated analysis workflows.
- Familiarity with experimental workflows such as protein expression, purification, interaction assays, or high-throughput systems.
- Interest in closing the loop between modeling and experiment.
- Experience or interest in drug discovery, therapeutics, or real-world biomedical applications.
- Ability to collaborate with external partners and align research goals with practical health challenges.
This position will be open for a minimum of 5 days, with applications accepted on an ongoing basis until the position is filled. Microsoft is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, citizenship, color, family or medical care leave, gender identity or expression, genetic information, immigration status, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran or military status, race, ethnicity, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable local laws, regulations, and ordinances.
If you need assistance with religious accommodations and/or a reasonable accommodation due to a disability during the application process, read more about requesting accommodations.
AI for Science Postdoctoral Researcher - Biomolecular AI & Experimental Data Integration in Cambridge employer: Microsoft
At Microsoft Research AI for Science, we pride ourselves on fostering a dynamic and inclusive work culture that encourages innovation and collaboration. As a Postdoctoral Researcher, you will have access to cutting-edge resources and the opportunity to contribute to impactful research in biomolecular AI, all while working alongside leading experts in the field. Our commitment to employee growth is evident through our supportive environment, where you can develop your skills and advance your career in a meaningful way.
StudySmarter Expert Advice🤫
We think this is how you could land AI for Science Postdoctoral Researcher - Biomolecular AI & Experimental Data Integration in Cambridge
✨Tip Number 1
Network like a pro! Reach out to people in your field on LinkedIn or at conferences. A friendly chat can lead to opportunities that aren’t even advertised yet.
✨Tip Number 2
Show off your skills! Create a portfolio or GitHub repository showcasing your projects, especially those related to ML and biomolecular systems. This gives potential employers a taste of what you can do.
✨Tip Number 3
Prepare for interviews by practising common questions and scenarios specific to AI and experimental data integration. We recommend doing mock interviews with friends or mentors to boost your confidence.
✨Tip Number 4
Don’t forget to apply through our website! It’s the best way to ensure your application gets seen. Plus, we love seeing candidates who are proactive about their job search.
We think you need these skills to ace AI for Science Postdoctoral Researcher - Biomolecular AI & Experimental Data Integration in Cambridge
Some tips for your application 🫡
Tailor Your Application:Make sure to customise your CV and cover letter for the role. Highlight your relevant experience in machine learning, biomolecular systems, and any specific projects that align with the BioEmu model. We want to see how your skills fit into our exciting research!
Showcase Your Skills:Don’t just list your qualifications; demonstrate them! Include examples of your Python skills and any data analysis or modelling pipelines you've built. We love seeing practical applications of your expertise, so make it shine!
Be Clear and Concise:When writing your application, keep it straightforward. Use clear language to explain complex ideas, especially if you're bridging disciplines. We appreciate clarity, and it helps us understand your thought process better!
Apply Through Our Website:We encourage you to submit your application through our website. It’s the best way to ensure your application gets seen by the right people. Plus, it shows you’re keen on joining our team at Microsoft Research AI for Science!
How to prepare for a job interview at Microsoft
✨Know Your Stuff
Make sure you have a solid understanding of the key concepts related to biomolecular AI and experimental data integration. Brush up on your knowledge of machine learning methods, molecular simulation, and any relevant experimental techniques. This will help you answer technical questions confidently.
✨Showcase Your Experience
Prepare to discuss your previous research projects in detail, especially those that involved connecting computational models with experimental data. Highlight specific challenges you faced and how you overcame them. This will demonstrate your problem-solving skills and ability to work independently.
✨Ask Insightful Questions
Come prepared with thoughtful questions about the BioEmu project and the team’s current challenges. This shows your genuine interest in the role and helps you understand how you can contribute effectively. It also gives you a chance to assess if the team is the right fit for you.
✨Practice Collaboration Scenarios
Since this role involves working with ML researchers and experimental collaborators, think of examples where you've successfully collaborated across disciplines. Be ready to discuss how you communicate complex ideas clearly and ensure alignment between model needs and experimental outputs.