Data Curation Developer
Data Curation Developer

Data Curation Developer

Gravesend Full-Time 36000 - 60000 £ / year (est.) No home office possible
Go Premium
Gsk

At a Glance

  • Tasks: Curate and harmonise data for impactful R&D analysis in a collaborative environment.
  • Company: Join GSK, a leader in healthcare innovation with a focus on inclusivity.
  • Benefits: Enjoy competitive salary, bonuses, healthcare, and flexible hybrid working options.
  • Why this job: Make a real difference by transforming data into valuable insights for healthcare advancements.
  • Qualifications: BSc/MSc/PhD in relevant fields and experience with scientific clinical data.
  • Other info: Dynamic team culture with opportunities for professional growth and development.

The predicted salary is between 36000 - 60000 £ per year.

421498 Data Curation Developer

This role focuses on the technical experience required to curate (e.g. pre-process, harmonize, wrangle and contextualise) data to produce high-quality data assets for R&D analysis. The aim is to support GSK’s Disease Area Strategies and other key R&D priority areas by making data analysis-ready, enabling efficient and effective decision-making across various therapeutic areas.

Please note that depending on experience level, candidates may be considered at either the G6 or G7 level.

We create a place where people can grow, be their best, be safe, and feel welcome, valued and included. We offer a competitive salary, an annual bonus based on company performance, healthcare and wellbeing programmes, pension plan membership, and shares and savings programme.

We embrace modern work practises; our Performance with Choice programme offers a hybrid working model, empowering you to find the optimal balance between remote and in-office work.

Discover more about our company wide benefits and life at GSK on our webpage Life at GSK | GSK

In this role you will

  • Lead the development of business requirements for data curation through collaboration with R&D business and data platform teams.
  • Maintain strong connections with analytical groups and R&D Data Platform teams to ensure seamless data integration and usage.
  • Deliver pre-packaged, curated (e.g. pre-process, harmonize, wrangle, contextualise and/or anonymise) datasets aligned to business requirements for analytics, which includes documenting data specification that clearly describes the required processing steps to generate analysis-ready datasets ensuring providence, lineage and privacy requirements is maintained.
  • Integrate diverse datasets (e.g., clinical trials, real-world data, omics) into a unified format for consistent analysis.
  • Ensure all datasets meet analysis-ready and privacy requirements by performing necessary data curation activities (e.g. pre-process, contextualise and/or anonymise).
  • Provide coaching and peer review to ensure that the team’s work reflects industry best practices for data curation activities, including data privacy and anonymization standards.
  • Ensure that datasets are processed to meet conditions mentioned in the approved data re-use request (e.g., remove subjects from countries that do not allow re-use). Write clean, readable code.
  • Ensure that deliverables are appropriately quality controlled, documented, and when required, can be handed over to R&D Tech team for production pipeline implementation.

Why you?

Basic Qualifications & Skills

We are looking for professionals with these required skills to achieve our goals:

  • BSc/MSc/PhD (or equivalent) in Computer Science, Mathematics, Statistics, or related subject
  • Proven experience of handling various modalities of scientific clinical data such as clinical trial data (including biomarkers), real world data (RWD), omics etc.
  • Experience in Python, Databricks, Delta Lake, PySpark, Pandas, other data engineering frameworks and applying them to achieve industry standards-compliant datasets
  • Proven ability to handle and process large structured, semi-structured, and unstructured datasets efficiently
  • Strong communication skills and expertise to translate business needs into technical data requirements and processes
  • Ability to quantify and provide insights to business impact and value creation from data curation activities
  • Experience with at least one of the industry data standards such as CDISC(ODM: CDASH, SDTM, ADaM), HL7 FHIR, OMOP(CDM) etc.

Preferred Qualifications & Skills

Please note the following skills are not necessary, just preferred, if you do not have them, please still apply:

  • Experience in R
  • Agile mindset with the ability to deliver prototypes quickly and iterate improvements based on stakeholder feedback
  • Experience with digital clinical trials protocol and Unified Study Definition Model (USDM)Experience in data modelling

Closing Date for Applications – 26th of October, 2025 (COB)

Please take a copy of the Job Description, as this will not be available post closure of the advert.

When applying for this role, please use the ‘cover letter’ of the online application or your CV to describe how you meet the competencies for this role, as outlined in the job requirements above. The information that you have provided in your cover letter and CV will be used to assess your application.

Why GSK?

Uniting science, technology, and talent to get ahead of disease together.

GSK is an Equal Opportunity Employer. This ensures that all qualified applicants will receive equal consideration for employment without regard to race, color, religion, sex (including pregnancy, gender identity, and sexual orientation), parental status, national origin, age, disability, genetic information (including family medical history), military service or any basis prohibited under federal, state or local law.

Should you require any adjustments to our process to assist you in demonstrating your strengths and capabilities contact UKRecruitment.Adjustments@gsk.com or 0808 234 4391. The helpline is available from 8.30am to 12.00 noon Monday to Friday, during bank holidays these times and days may vary.

For more information, please visit the Centers for Medicare and Medicaid Services (CMS) website at https://openpaymentsdata.cms.gov/

#J-18808-Ljbffr

Data Curation Developer employer: Gsk

GSK is an exceptional employer that prioritises employee growth and well-being, offering a competitive salary, annual bonuses, and comprehensive healthcare programmes. With a strong commitment to inclusivity and modern work practices, including a hybrid working model, GSK fosters a collaborative environment where data professionals can thrive while contributing to impactful R&D initiatives in a dynamic and supportive culture.
Gsk

Contact Detail:

Gsk Recruiting Team

StudySmarter Expert Advice 🤫

We think this is how you could land Data Curation Developer

✨Network Like a Pro

Get out there and connect with folks in the industry! Attend meetups, webinars, or even just grab a coffee with someone who works at GSK. Building relationships can open doors that a CV just can't.

✨Show Off Your Skills

When you get the chance to chat with potential employers, don’t hold back! Share specific examples of how you've tackled data curation challenges in the past. This is your moment to shine and show them what you can bring to the table.

✨Tailor Your Approach

Make sure you understand GSK’s mission and values. When you’re networking or interviewing, weave in how your skills and experiences align with their goals. It shows you’re not just looking for any job, but you’re genuinely interested in being part of their team.

✨Apply Through Our Website

Don’t forget to apply through our website! It’s the best way to ensure your application gets seen by the right people. Plus, it shows you’re serious about joining GSK and ready to contribute to our mission.

We think you need these skills to ace Data Curation Developer

Data Curation
Data Pre-processing
Data Harmonisation
Data Wrangling
Data Contextualisation
Python
Databricks
Delta Lake
PySpark
Pandas
Data Integration
Data Privacy
Anonymisation Standards
Communication Skills
Industry Data Standards

Some tips for your application 🫡

Tailor Your Cover Letter: Make sure to customise your cover letter for the Data Curation Developer role. Highlight your relevant experience with data curation and how it aligns with GSK’s goals. We want to see your passion for data and how you can contribute to our team!

Showcase Your Technical Skills: In your CV, don’t hold back on showcasing your technical skills! Mention your experience with Python, Databricks, and any other tools you've used. We’re looking for candidates who can handle various data modalities, so make sure to highlight that experience.

Be Clear and Concise: When writing your application, clarity is key! Use straightforward language and avoid jargon where possible. We appreciate a well-structured application that gets straight to the point, making it easy for us to see your qualifications.

Apply Through Our Website: Don’t forget to apply through our website! It’s the best way to ensure your application gets to us directly. Plus, you’ll find all the details about the role and our company culture there, which can help you tailor your application even more.

How to prepare for a job interview at Gsk

✨Know Your Data Inside Out

Make sure you’re well-versed in the types of data you'll be working with, like clinical trial data and real-world data. Brush up on your knowledge of data curation processes such as pre-processing, harmonising, and anonymising datasets. This will help you confidently discuss how you can contribute to making data analysis-ready.

✨Showcase Your Technical Skills

Be prepared to talk about your experience with Python, Databricks, and other data engineering frameworks. Bring examples of projects where you've successfully handled large datasets and applied industry standards. If you can, demonstrate your coding skills during the interview to show you can write clean, readable code.

✨Communicate Clearly

Strong communication is key in this role. Practice explaining complex technical concepts in simple terms, as you’ll need to translate business needs into technical requirements. Think of examples where you’ve successfully collaborated with teams to ensure seamless data integration and usage.

✨Prepare for Scenario Questions

Expect questions that assess your problem-solving abilities. Prepare for scenarios where you might need to integrate diverse datasets or ensure compliance with data privacy standards. Think through your approach to these challenges and be ready to share your thought process during the interview.

Data Curation Developer
Gsk
Location: Gravesend
Go Premium

Land your dream job quicker with Premium

You’re marked as a top applicant with our partner companies
Individual CV and cover letter feedback including tailoring to specific job roles
Be among the first applications for new jobs with our AI application
1:1 support and career advice from our career coaches
Go Premium

Money-back if you don't land a job in 6-months

>