Data Engineer - Science in Cambridge

Data Engineer - Science in Cambridge

Cambridge Full-Time 50000 - 70000 € / year (est.) No home office possible
Qureight Ltd

At a Glance

  • Tasks: Design and implement data infrastructure for large-scale machine learning workflows.
  • Company: Join Qureight, a pioneering company accelerating clinical trials with AI technology.
  • Benefits: Enjoy a comprehensive benefits package, including private medical insurance and 25 days annual leave.
  • Other info: Diverse team environment with opportunities for continuous learning and growth.
  • Why this job: Make a real impact in healthcare by working with cutting-edge imaging data.
  • Qualifications: Experience as a Data Engineer with strong Python skills and knowledge of ML pipelines.

The predicted salary is between 50000 - 70000 € per year.

Qureight’s mission is to accelerate clinical trials and ensure breakthroughs in lung and heart disease reach patients without delay. Our AI-powered data and imaging curation platform enables the analysis of clinical imaging and other healthcare data, helping our customers bring treatments to market, faster. We’re looking for talented people who want their work to matter. With offices in Cambridge and London, you’ll join our multidisciplinary team of clinicians, scientists, and engineers. What unites us is our open culture, continuous learning mindset, and a shared mission to help biopharma run faster, smarter trials.

About the role

As Qureight scales its AI-driven imaging platform and advances development of foundation models and disease-specific AI models, we are building the data engineering capability required to support large-scale data preparation for machine learning. We are looking for a Data Engineer to focus on preparing and managing large imaging datasets (including CT scans and DICOM metadata) for use in machine learning workflows. This role sits within the Science function and works closely with Machine Learning Scientists as well as other Data Engineers to ensure that data is delivered in a consistent, high-quality, and efficient format ready for model development. It will focus on designing and implementing the next iteration of our data infrastructure to accelerate our integration of machine learning into clinical trials.

What you will do

  • Collaborate on designing and implementing new data infrastructure and pipelines preparing data for large-scale ML workflows
  • Care about data quality, and ensuring the pipelines you build are robust, scalable, and maintainable
  • Work with DICOM data to feed into foundation model and disease-specific imaging model development
  • Collaborate closely with Machine Learning Scientists, DevOps Engineers, and other Data Engineers to create a tight feedback loop and ensure the end-to-end process is effective and efficient
  • Ensure that our data processes have quality and compliance designed in from the start to make reproducibility, lineage tracking, and data quality painless
  • Scale pipelines to handle millions of scans – ingesting the imaging data, transforming it, filtering and structuring ready for foundation model development.

What we need

  • Proven experience as a Data Engineer in complex, data-rich environments
  • Strong programming skills in Python
  • Experience building and maintaining production ML data pipelines, including orchestration tools such as Dagster and cloud infrastructure on AWS
  • Experience with Docker and Kubernetes based infrastructure
  • Experience working with large datasets
  • Understanding of data preprocessing and quality control for machine learning
  • Strong collaboration skills with machine learning or technical teams

Even better if you have experience of

  • Medical imaging data such as CT, MRI, or DICOM
  • Large-scale datasets or foundation model workflows
  • Deployment tooling (Helm and familiarity with Gitops tooling such as Flux and Kustomize)
  • Data versioning and reproducibility frameworks
  • Database design and data modelling
  • Working in regulated or GxP or ISO 13485 environments
  • Experience with ML experiment tracking or metadata management (MLFlow)

Benefits

  • A comprehensive benefits package that includes an annual bonus plan, private medical insurance, life insurance, and a contributory pension scheme
  • 25 days annual leave, plus bank holidays and enhanced maternity leave
  • A diverse work environment that brings together experts in many fields, including software engineering, devops, data science, machine learning, quality assurance, regulatory affairs, and clinical operations.

Everyone is welcome at Qureight. We are an equal opportunities employer and encourage applications from all suitably qualified candidates regardless of age, disability, ethnicity, sex, gender reassignment, religion or belief, sexual orientation, marriage and civil partnership, or pregnancy and maternity. Women and other underrepresented groups may be less likely to apply for a role unless they meet all or nearly all of the requirements. If this applies to you, we still encourage you to apply – you may be a great fit, even if you don’t meet every qualification. We’d love to hear from you. If you require any adjustments to the application or selection process, please let us know. We will be happy to support you.

Data Engineer - Science in Cambridge employer: Qureight Ltd

At Qureight, we pride ourselves on being an exceptional employer, fostering a collaborative and inclusive work culture that empowers our employees to make a meaningful impact in the healthcare sector. With offices in the vibrant cities of Cambridge and London, we offer a comprehensive benefits package, including private medical insurance and generous annual leave, alongside ample opportunities for professional growth within our multidisciplinary team of experts. Join us in our mission to revolutionise clinical trials and bring life-saving treatments to patients faster, all while working in an environment that values continuous learning and innovation.

Qureight Ltd

Contact Detail:

Qureight Ltd Recruiting Team

StudySmarter Expert Advice🤫

We think this is how you could land Data Engineer - Science in Cambridge

Tip Number 1

Network like a pro! Reach out to people in the industry, especially those at Qureight. A friendly chat can open doors and give you insights that a job description just can't.

Tip Number 2

Show off your skills! If you've got a portfolio or GitHub with projects related to data engineering or machine learning, make sure to highlight them. It’s a great way to demonstrate your expertise beyond the CV.

Tip Number 3

Prepare for the interview by brushing up on relevant technologies like Python, DICOM, and cloud infrastructure. Be ready to discuss how you've tackled challenges in past projects – real-world examples go a long way!

Tip Number 4

Don’t forget to apply through our website! It’s the best way to ensure your application gets seen. Plus, it shows you're genuinely interested in being part of our mission at Qureight.

We think you need these skills to ace Data Engineer - Science in Cambridge

Data Engineering
Python Programming
Machine Learning Pipelines
DICOM Data Management
Cloud Infrastructure (AWS)
Docker
Kubernetes

Some tips for your application 🫡

Tailor Your CV:Make sure your CV reflects the skills and experiences that match the Data Engineer role. Highlight your programming skills in Python and any experience with DICOM data or large datasets. We want to see how your background aligns with our mission!

Craft a Compelling Cover Letter:Your cover letter is your chance to shine! Share why you’re passionate about accelerating clinical trials and how your expertise can contribute to our AI-driven imaging platform. Let us know what makes you tick and why you want to join our team.

Showcase Collaboration Skills:Since we value teamwork, mention any past experiences where you collaborated with machine learning scientists or engineers. We’re looking for someone who can work closely with others to create effective data pipelines, so don’t hold back on those examples!

Apply Through Our Website:We encourage you to apply directly through our website. It’s the best way for us to receive your application and ensures you’re considered for the role. Plus, it shows you’re keen on joining our awesome team at Qureight!

How to prepare for a job interview at Qureight Ltd

Know Your Data Inside Out

Make sure you’re familiar with the types of data Qureight works with, especially DICOM and imaging datasets. Brush up on your understanding of how these datasets are used in machine learning workflows, as this will show your genuine interest and readiness for the role.

Showcase Your Programming Skills

Be prepared to discuss your experience with Python and any orchestration tools like Dagster. Bring examples of past projects where you built or maintained ML data pipelines, as this will demonstrate your hands-on experience and technical prowess.

Collaboration is Key

Highlight your teamwork skills by sharing experiences where you collaborated with machine learning scientists or engineers. Qureight values a tight feedback loop, so showing that you can work effectively in a multidisciplinary team will set you apart.

Prepare for Technical Questions

Expect questions about data quality, preprocessing, and compliance. Brush up on best practices in these areas and be ready to discuss how you ensure robust and scalable data pipelines. This will help you convey your expertise and alignment with Qureight’s mission.