Data Engineer in Cambridge

Data Engineer in Cambridge

Cambridge Full-Time 39000 - 52000 ÂŁ / year (est.) No home office possible
C

At a Glance

  • Tasks: Build and optimise data pipelines for cutting-edge healthcare diagnostics.
  • Company: Join Cyted Health, a leader in gastrointestinal health innovation.
  • Benefits: Competitive salary, 25 days holiday, medical insurance, and learning budget.
  • Why this job: Make a real impact on patient outcomes with your data engineering skills.
  • Qualifications: Experience in data engineering, Python, and cloud platforms required.
  • Other info: Collaborative culture with opportunities for growth and innovation.

The predicted salary is between 39000 - 52000 ÂŁ per year.

This range is provided by Cyted Health. Your actual pay will be based on your skills and experience — talk with your recruiter to learn more.

As a Data Engineer at Cyted, you’ll build the data infrastructure that powers our diagnostics and research. You’ll transform experimental workflows into reliable, production‑grade data pipelines, implementing reproducible ingestion and analysis processes (primarily using Nextflow) and developing automation and orchestration for both operational and research workloads. You’ll establish strong data governance and observability practices, ensuring datasets are versioned, catalogued, and fully traceable from source to output. Security and compliance will be embedded in everything you design, meeting the standards required for regulated healthcare and diagnostics environments. You’ll work closely with computational biologists in R&D and software engineers in the Technology team to translate scientific and product requirements into scalable, maintainable solutions. Alongside delivery, you’ll maintain clear technical documentation, contribute to code reviews, and help raise engineering standards across the team.

The role is a full‑time position with a standard 37.5‑hour working week. The role holder may be required to work flexibly. The Data Engineer will be based at Cyted’s Head Office, Ground Floor Building 3 Old Swiss, 149 Cherry Hinton Road, Cambridge, United Kingdom, CB1 7BX.

What You Will Be Doing

  • Pipeline Design and Development: Build, maintain, and optimise scalable data ingestion and analysis pipelines using workflow engines such as Nextflow. Translate scientific and analytical prototypes into robust, reproducible, and automated workflows suitable for production use. Create modular, testable components and establish clear versioning to ensure reproducibility across environments.
  • Data Architecture and Governance: Design and maintain data models, storage solutions, and metadata catalogues that support efficient querying and lineage tracking. Implement and enforce data governance practices, including data classification, retention policies, and access control frameworks. Maintain comprehensive lineage tracking (e.g., with OpenLineage or equivalent) and ensure auditability of all datasets.
  • Automation, Monitoring, and Reliability: Develop orchestration and scheduling frameworks to automate both operational and R&D pipelines. Implement observability practices — monitoring, alerting, and automated recovery — to ensure high reliability and performance. Drive continuous improvement in efficiency, scalability, and cost optimisation of data workflows across AWS/GCP/Azure.
  • Security and Compliance: Embed security‑by‑design principles into all data handling, including encryption, authentication, and secrets management. Ensure all pipelines and data stores comply with regulatory requirements relevant to diagnostics and healthcare (e.g., ISO27001, ISO13485, CLIA/CAP, GDPR). Contribute to technical documentation and evidence for audits and certification processes.
  • Collaboration and Communication: Partner with computational biologists and product engineers to define data requirements and shape infrastructure decisions. Provide technical mentorship and guidance to team members on data engineering best practices. Document systems and processes through runbooks, design specifications, and operational guides. Contribute to code reviews, internal knowledge‑sharing sessions, and cross‑functional project planning.
  • Innovation and Continuous Improvement: Evaluate and integrate new technologies to improve data processing, observability, and scalability. Identify and remove bottlenecks in the data lifecycle — from ingestion to reporting — to accelerate insight generation. Support the adoption of modern DevOps and MLOps approaches for scientific and product data pipelines.

How We Work

At Cyted, how we work is just as important as what we’re building. Our values shape how we collaborate, innovate, and deliver for patients and partners. As our Data Engineer, you’ll bring these values to life from day one. We care deeply about data integrity, patient outcomes, and the clinicians who rely on our insights. In this role, care means building systems that are accurate, traceable, and resilient—because real people depend on the results we generate. You’ll take pride in clean code, reproducible pipelines, and the knowledge that every dataset you shape contributes to earlier, better diagnosis. We expect you to own the work and contributions to your functions with confidence and curiosity. You’ll be responsible for designing and maintaining the infrastructure that connects our science, operations, and technology. You’ll take initiative, move with purpose, and be trusted to make critical decisions that keep our data ecosystem secure, scalable, and compliant. We aim high. We’re scaling fast, working across complex regulated environments, and pushing boundaries in how data accelerates diagnostics. You’ll be empowered to build with ambition—optimising workflows, streamlining automation, and helping define what great data engineering looks like in healthcare. You’ll be expected to dive deep into the science, the systems, and the standards. You’ll understand the technical and regulatory nuance behind every workflow, and you’ll be just as comfortable debugging a Nextflow pipeline as you are explaining architecture decisions to cross‑functional teams. You won’t just maintain systems, you’ll actively improve them. We encourage everyone to challenge and commit. You’ll help shape how we work as a data‑led company, questioning assumptions, sharing ideas, and being open to better ways. But once we align, you’ll deliver with clarity, ownership, and precision. And most of all, we deliver. This is a role for someone who thrives on progress, who builds with intent and sees impact in every successful workflow run, every insight delivered, and every patient outcome improved.

Person Specification

  • A degree in Computer Science, Bioinformatics, Computational Biology, or a related field—or equivalent practical experience.
  • 2–3 years of industry experience working in a regulated data environment (e.g., biotech, healthtech, or clinical diagnostics).
  • Proven experience designing and maintaining reliable data pipelines on AWS, GCP, or Azure.
  • Strong proficiency in Python, with solid Linux/Bash fundamentals.
  • Hands‑on experience with at least one workflow engine (e.g., Nextflow, Snakemake).
  • Familiarity with version control systems (Git, GitHub) and CI/CD best practices.
  • Working knowledge of regulated frameworks (CLIA, CAP, IVD, ISO27001, ISO13485) and audit readiness requirements.
  • Understanding of NGS data, associated tools, and standard QC practices.
  • Experience with data cataloging and governance platforms (e.g., DataHub), lineage tracking (e.g., OpenLineage), and access control management.
  • Knowledge of Infrastructure‑as‑Code (e.g., Terraform), identity and secrets management (IAM), and cloud cost optimisation at scale.
  • Exposure to the R programming language and genomics workflows such as RNAseq, single‑cell, or structural variant/CNV pipelines.
  • A strong focus on testing, monitoring, and observability to ensure data integrity and reliability.
  • Clear, concise communication and a collaborative approach to problem‑solving.

Benefits

  • Salary in the range of ÂŁ45,000 – ÂŁ65,000 per annum depending on your skills and experience.
  • 25 days holiday per holiday year, plus public holidays.
  • Pension scheme.
  • An annual learning and development budget.
  • Medical insurance including dental and optical cover.
  • Life/critical illness cover.
  • Social events including Christmas and summer parties.
  • Cycle to work scheme.
  • Electric Vehicle Scheme.
  • Sabbatical: 4 years of service.

About Us

We are a leading gastrointestinal health company delivering minimally invasive diagnostics to transform access to esophageal care. Our EndoSign test combines a simple, swallowable device with cutting‑edge laboratory biomarkers and analytics to detect esophageal cancer and its precursor, Barrett’s esophagus. Operating across the US and UK life‑science hub, with hybrid, remote and onsite teams, we are expanding our pipeline to address new high‑impact targets across gastroenterology and related fields. You’ll join a close‑knit team of experts who collaborate daily to translate breakthrough ideas into real‑world solutions. At Cyted Health, every voice matters. Whether you’re in R&D, Commercialisation, Medical Affairs or Operations, you’ll have the chance to lead projects, influence strategy, and broaden your skill set across the company. We champion diverse backgrounds and perspectives, fostering an inclusive culture where everyone can thrive and innovate. If you’re inspired by purpose, motivated by challenge, and eager to make a meaningful impact on patient lives, we’d love to hear from you. We usually recruit on a rolling basis.

Data Engineer in Cambridge employer: Cyted Health

Cyted Health is an exceptional employer that prioritises innovation and collaboration in the healthcare sector. Located in the vibrant city of Cambridge, employees benefit from a supportive work culture that encourages professional growth through continuous learning and development opportunities. With a strong focus on data integrity and patient outcomes, Cyted fosters an inclusive environment where every team member's contributions are valued, making it an ideal place for those looking to make a meaningful impact in diagnostics and research.
C

Contact Detail:

Cyted Health Recruiting Team

StudySmarter Expert Advice 🤫

We think this is how you could land Data Engineer in Cambridge

✨Tip Number 1

Network like a pro! Reach out to folks in the industry, attend meetups, and connect with people on LinkedIn. You never know who might have the inside scoop on job openings or can put in a good word for you.

✨Tip Number 2

Prepare for those interviews! Research Cyted Health, understand their values, and be ready to discuss how your skills align with their mission. Practise common interview questions and have examples ready that showcase your experience in data engineering.

✨Tip Number 3

Show off your projects! If you've built any data pipelines or worked on relevant projects, make sure to highlight them during interviews. Having tangible examples of your work can really set you apart from other candidates.

✨Tip Number 4

Apply through our website! It’s the best way to ensure your application gets seen by the right people. Plus, it shows you're genuinely interested in joining our team at Cyted Health.

We think you need these skills to ace Data Engineer in Cambridge

Data Pipeline Design
Nextflow
Data Governance
Data Architecture
AWS
GCP
Azure
Python
Linux/Bash
Version Control (Git, GitHub)
CI/CD Best Practices
Regulatory Frameworks (CLIA, CAP, ISO27001, ISO13485)
Data Cataloging and Governance Platforms
Infrastructure-as-Code (Terraform)
Monitoring and Observability

Some tips for your application 🫡

Tailor Your CV: Make sure your CV is tailored to the Data Engineer role. Highlight your experience with data pipelines, Nextflow, and any relevant projects that showcase your skills in a regulated environment. We want to see how your background aligns with what we do!

Craft a Compelling Cover Letter: Your cover letter is your chance to shine! Use it to explain why you're passionate about data engineering and how you can contribute to our mission at Cyted. Be sure to mention specific experiences that relate to the job description.

Showcase Your Technical Skills: Don’t forget to highlight your technical skills, especially in Python, cloud platforms, and workflow engines. We love seeing hands-on experience, so include any relevant projects or achievements that demonstrate your expertise.

Apply Through Our Website: We encourage you to apply through our website for the best chance of getting noticed. It’s the easiest way for us to keep track of your application and ensure it reaches the right people. Plus, it shows you’re serious about joining our team!

How to prepare for a job interview at Cyted Health

✨Know Your Tech Stack

Make sure you’re well-versed in the technologies mentioned in the job description, especially Nextflow, Python, and cloud platforms like AWS, GCP, or Azure. Brush up on your knowledge of data governance and compliance standards relevant to healthcare, as these will likely come up during the interview.

✨Showcase Your Problem-Solving Skills

Prepare to discuss specific examples where you've designed and optimised data pipelines or tackled challenges in a regulated environment. Use the STAR method (Situation, Task, Action, Result) to structure your answers and clearly demonstrate your impact.

✨Ask Insightful Questions

Interviews are a two-way street! Prepare thoughtful questions about the team dynamics, the projects you'll be working on, and how they measure success in this role. This shows your genuine interest in the position and helps you assess if it's the right fit for you.

✨Emphasise Collaboration

Since the role involves working closely with computational biologists and software engineers, highlight your teamwork experiences. Share examples of how you’ve collaborated on projects, contributed to code reviews, or mentored others, showcasing your ability to communicate effectively across disciplines.

Data Engineer in Cambridge
Cyted Health
Location: Cambridge

Land your dream job quicker with Premium

You’re marked as a top applicant with our partner companies
Individual CV and cover letter feedback including tailoring to specific job roles
Be among the first applications for new jobs with our AI application
1:1 support and career advice from our career coaches
Go Premium

Money-back if you don't land a job in 6-months

C
Similar positions in other companies
UK’s top job board for Gen Z
discover-jobs-cta
Discover now
>