At a Glance
- Tasks: Design and build AI solutions for scientific discovery using machine learning and NLP.
- Company: Join a forward-thinking tech company focused on research intelligence and knowledge discovery.
- Benefits: Enjoy flexible working hours, wellbeing initiatives, and opportunities for professional growth.
- Other info: Collaborative environment with a commitment to work/life balance and equal opportunity.
- Why this job: Make a real impact by turning complex challenges into practical AI solutions for researchers.
- Qualifications: Experience in data science, machine learning, and strong Python skills required.
The predicted salary is between 50000 - 70000 £ per year.
Are you excited by the opportunity to use machine learning, NLP, and generative AI to help researchers discover knowledge faster and make better decisions? Would you enjoy turning complex scientific and business challenges into practical, production‑ready AI solutions that create real user value?
About the Role
In this role, you will design and build machine learning, NLP, and generative AI solutions that support scientific discovery, knowledge extraction, decision support, and intelligent content understanding. You will work with large‑scale scientific content and data, applying the right techniques to solve complex problems and deliver reliable, production‑ready systems. Working closely with cross‑functional partners, you will help turn ambiguous challenges into measurable outcomes that improve how researchers discover and use knowledge.
Responsibilities
- Design and build machine learning, NLP, and generative AI systems for scientific discovery, knowledge extraction, decision support, and intelligent content understanding.
- Work with large‑scale, complex, and heterogeneous data, including scientific publications, research datasets, knowledge graphs, ontologies, taxonomies, citations, metadata, and content from every scientific discipline.
- Apply the right technique to each problem, using approaches such as classification, regression, clustering, ranking, feature engineering, deep learning, embeddings, LLMs, retrieval, and generative AI.
- Develop capabilities for semantic search, information retrieval, entity extraction, content classification, recommendation, ranking, summarization, question answering, and evidence‑grounded generation.
- Build, evaluate, fine‑tune, prompt, and integrate models into robust production systems, while continuously improving quality, relevance, reliability, and user value.
- Write clean, tested, production‑quality Python and contribute reusable data science components, packages, and scalable data pipelines for preprocessing, inference, experimentation, monitoring, and continuous improvement.
- Support deployment, monitoring, model maintenance, drift detection, automated retraining, and ongoing optimization of data science systems.
- Collaborate with engineering, product, UX, analytics, research, and domain experts, and communicate technical concepts, model behavior, insights, trade‑offs, and recommendations clearly to technical and non‑technical audiences.
Requirements
- Experience in data science, machine learning, artificial intelligence, NLP, statistics, applied mathematics, computer science, or a related quantitative area.
- Experience working with frontier LLMs such as OpenAI’s GPTs, Anthropic’s Claude, and Google’s Gemini, including fine‑tuning LLMs and/or SLMs.
- Strong Python skills and a habit of writing clean, maintainable, well‑tested code.
- A solid grasp of machine learning fundamentals, including supervised and unsupervised learning, feature engineering, model evaluation, model selection, and performance measurement.
- Experience working with structured, semi‑structured, or unstructured data, especially large‑scale text or content datasets.
- Familiarity with common data science and machine learning tools such as Pandas, NumPy, SciPy, Scikit‑learn, PyTorch, TensorFlow, or Matplotlib.
- The ability to translate complex and ambiguous requirements into practical, measurable, data‑driven solutions, with strong analytical thinking, problem‑solving skills, and attention to quality.
- Clear communication skills, a collaborative approach to working with engineering, product, and business stakeholders, and a genuine interest in building production‑ready systems that deliver real user value.
Working Pattern
Working flexible hours - flexing the times when you work in the day to help you fit everything in and work when you are the most productive.
Benefits
We promote a healthy work/life balance across the organisation. We offer an appealing working prospect for our people. With numerous wellbeing initiatives, shared parental leave, study assistance, and sabbaticals, we will help you meet your immediate responsibilities and your long‑term goals.
Equal Opportunity Employer
We are an equal opportunity employer: qualified applicants are considered for and treated during employment without regard to race, color, creed, religion, sex, national origin, citizenship status, disability status, protected veteran status, age, marital status, sexual orientation, gender identity, genetic information, or any other characteristic protected by law. We are committed to providing a fair and accessible hiring process. If you have a disability or other need that requires accommodation or adjustment, please let us know.
Data Scientist II employer: RELX INC
Join a forward-thinking organisation that champions innovation and collaboration, where as a Data Scientist II, you will leverage cutting-edge machine learning and AI technologies to drive scientific discovery. Our commitment to employee wellbeing is reflected in our flexible working hours, extensive support for professional development, and a culture that values work-life balance, ensuring you thrive both personally and professionally. With a focus on meaningful contributions and a diverse, inclusive environment, we empower you to turn complex challenges into impactful solutions.