Data Engineer - Science

Data Engineer - Science

Full-Time 50000 - 70000 € / year (est.) No home office possible
Qureight Ltd

At a Glance

  • Tasks: Design and implement data infrastructure for large-scale machine learning workflows.
  • Company: Join Qureight, a pioneering company accelerating clinical trials with AI technology.
  • Benefits: Enjoy competitive benefits, including private medical insurance and 25 days annual leave.
  • Other info: Diverse team environment with opportunities for continuous learning and growth.
  • Why this job: Make a real impact in healthcare by working with cutting-edge imaging data.
  • Qualifications: Experience as a Data Engineer with strong Python skills and cloud infrastructure knowledge.

The predicted salary is between 50000 - 70000 € per year.

Qureight’s mission is to accelerate clinical trials and ensure breakthroughs in lung and heart disease reach patients without delay. Our AI-powered data and imaging curation platform enables the analysis of clinical imaging and other healthcare data, helping our customers bring treatments to market, faster. We’re looking for talented people who want their work to matter. With offices in Cambridge and London, you’ll join our multidisciplinary team of clinicians, scientists, and engineers. What unites us is our open culture, continuous learning mindset, and a shared mission to help biopharma run faster, smarter trials.

About the role

As Qureight scales its AI-driven imaging platform and advances development of foundation models and disease-specific AI models, we are building the data engineering capability required to support large-scale data preparation for machine learning. We are looking for a Data Engineer to focus on preparing and managing large imaging datasets (including CT scans and DICOM metadata) for use in machine learning workflows. This role sits within the Science function and works closely with Machine Learning Scientists as well as other Data Engineers to ensure that data is delivered in a consistent, high-quality, and efficient format ready for model development. It will focus on designing and implementing the next iteration of our data infrastructure to accelerate our integration of machine learning into clinical trials.

What you will do

  • Collaborate on designing and implementing new data infrastructure and pipelines preparing data for large-scale ML workflows
  • Care about data quality, and ensuring the pipelines you build are robust, scalable, and maintainable
  • Work with DICOM data to feed into foundation model and disease-specific imaging model development
  • Collaborate closely with Machine Learning Scientists, DevOps Engineers, and other Data Engineers to create a tight feedback loop and ensure the end-to-end process is effective and efficient
  • Ensure that our data processes have quality and compliance designed in from the start to make reproducibility, lineage tracking, and data quality painless
  • Scale pipelines to handle millions of scans – ingesting the imaging data, transforming it, filtering and structuring ready for foundation model development.

What we need

  • Proven experience as a Data Engineer in complex, data-rich environments
  • Strong programming skills in Python
  • Experience building and maintaining production ML data pipelines, including orchestration tools such as Dagster and cloud infrastructure on AWS
  • Experience with Docker and Kubernetes based infrastructure
  • Experience working with large datasets
  • Understanding of data preprocessing and quality control for machine learning
  • Strong collaboration skills with machine learning or technical teams

Even better if you have experience of

  • Medical imaging data such as CT, MRI, or DICOM
  • Large-scale datasets or foundation model workflows
  • Deployment tooling (Helm and familiarity with Gitops tooling such as Flux and Kustomize)
  • Data versioning and reproducibility frameworks
  • Database design and data modelling
  • Working in regulated or GxP or ISO 13485 environments
  • Experience with ML experiment tracking or metadata management (MLFlow)

Benefits

  • A comprehensive benefits package that includes an annual bonus plan, private medical insurance, life insurance, and a contributory pension scheme
  • 25 days annual leave, plus bank holidays and enhanced maternity leave
  • A diverse work environment that brings together experts in many fields, including software engineering, devops, data science, machine learning, quality assurance, regulatory affairs, and clinical operations.

Everyone is welcome at Qureight. We are an equal opportunities employer and encourage applications from all suitably qualified candidates regardless of age, disability, ethnicity, sex, gender reassignment, religion or belief, sexual orientation, marriage and civil partnership, or pregnancy and maternity. Women and other underrepresented groups may be less likely to apply for a role unless they meet all or nearly all of the requirements. If this applies to you, we still encourage you to apply – you may be a great fit, even if you don’t meet every qualification. We’d love to hear from you. If you require any adjustments to the application or selection process, please let us know. We will be happy to support you.

Data Engineer - Science employer: Qureight Ltd

At Qureight, we pride ourselves on being an exceptional employer, offering a dynamic work environment in the heart of Cambridge and London. Our commitment to continuous learning and collaboration fosters a culture where every team member's contributions are valued, and our comprehensive benefits package ensures that you are well-supported both personally and professionally. Join us in our mission to revolutionise clinical trials and make a real impact in healthcare, while enjoying opportunities for growth and development within a diverse and inclusive team.

Qureight Ltd

Contact Detail:

Qureight Ltd Recruiting Team

StudySmarter Expert Advice🤫

We think this is how you could land Data Engineer - Science

Tip Number 1

Network like a pro! Reach out to people in the industry, especially those at Qureight. A friendly chat can open doors and give you insights that a job description just can't.

Tip Number 2

Show off your skills! If you've got a portfolio or GitHub with projects related to data engineering or machine learning, make sure to highlight them. It’s a great way to demonstrate your expertise beyond just words.

Tip Number 3

Prepare for the interview by brushing up on relevant technologies like Python, DICOM, and cloud infrastructure. Be ready to discuss how you've tackled challenges in past projects – real examples go a long way!

Tip Number 4

Don’t forget to apply through our website! It’s the best way to ensure your application gets seen by the right people. Plus, it shows you’re genuinely interested in joining our mission at Qureight.

We think you need these skills to ace Data Engineer - Science

Data Engineering
Python Programming
Machine Learning Pipelines
DICOM Data Management
Cloud Infrastructure (AWS)
Docker
Kubernetes

Some tips for your application 🫡

Tailor Your CV:Make sure your CV reflects the skills and experiences that match the Data Engineer role. Highlight your programming skills in Python and any experience with DICOM data or large datasets. We want to see how your background aligns with our mission!

Craft a Compelling Cover Letter:Your cover letter is your chance to shine! Share why you’re passionate about accelerating clinical trials and how your expertise can contribute to our AI-driven imaging platform. Let us know what makes you tick and why you want to join our team.

Showcase Collaboration Skills:Since we value teamwork, mention any past experiences where you collaborated with machine learning scientists or technical teams. We love seeing how you’ve worked together to solve problems and improve processes!

Apply Through Our Website:Don’t forget to apply through our website! It’s the best way for us to receive your application and ensures you’re considered for the role. Plus, it gives you a chance to explore more about our culture and values.

How to prepare for a job interview at Qureight Ltd

Know Your Data Inside Out

Before the interview, make sure you’re well-versed in the types of data Qureight works with, especially DICOM and imaging datasets. Brush up on your experience with large-scale data pipelines and be ready to discuss specific projects where you ensured data quality and compliance.

Showcase Your Collaboration Skills

Since this role involves working closely with Machine Learning Scientists and other engineers, prepare examples that highlight your teamwork. Think of times when you collaborated on a project, how you communicated effectively, and how you contributed to a successful outcome.

Demonstrate Your Technical Proficiency

Be ready to talk about your programming skills, particularly in Python, and your experience with tools like Dagster, Docker, and Kubernetes. You might even want to prepare a mini-case study or example of a pipeline you built, focusing on its scalability and robustness.

Prepare Questions That Matter

Interviews are a two-way street! Prepare thoughtful questions about Qureight’s data infrastructure and how they integrate machine learning into clinical trials. This shows your genuine interest in their mission and helps you assess if it’s the right fit for you.