Data Engineer

Data Engineer

Full-Time 39636 - 39636 £ / year (est.) No working from home possible
E

At a Glance

  • Tasks: Optimise data pipelines and collaborate with bioinformaticians to enhance data processing.
  • Company: Join an international team at a leading macromolecular structure database organisation.
  • Benefits: Enjoy hybrid working, generous leave, and comprehensive health insurance.
  • Other info: Dynamic work environment with excellent career growth and international collaboration.
  • Why this job: Make a real impact in the life sciences while working with cutting-edge technologies.
  • Qualifications: MSc in computer science or bioinformatics, with strong SQL and Python skills.

The predicted salary is between 39636 - 39636 £ per year.

About The Team

The Velankar team maintains macromolecular structure databases that form essential resources for biologists and other life scientists worldwide. PDBe is a founding partner of the Worldwide Protein Data Bank organisation, which maintains the global archive of 3D structural data on macromolecules the Protein Data Bank (PDB). The PDBe team also develops the PDBe Knowledge Base (PDBe-KB) and AlphaFold Protein Structure Database (AFDB). The PDBe team is international and inter-disciplinary and consists of expert data curators, bioinformaticians, scientific software developers and IT specialists.

Role Overview

We seek a skilled and motivated Data Engineer to join our dynamic team. As a Data Engineer, you will play a crucial role in optimising and enhancing our data pipelines, ensuring efficient data processing, storage and retrieval. You will work closely with cross‑functional teams to analyse requirements, propose new data pipeline architectures, and implement solutions to improve performance and scalability.

Responsibilities

  • Analyse existing data pipelines and identify areas for improvement, optimisation, and scalability.
  • Work closely with Bioinformaticians and annotators to integrate data pipelines with existing systems and applications.
  • Monitor data pipeline performance, troubleshoot issues, and implement solutions to ensure reliability and efficiency.
  • Stay current with industry trends and best practices in data engineering and recommend new technologies or tools to enhance data infrastructure.
  • Document data pipelines, processes, and workflows for internal reference and knowledge sharing.

Required Qualifications

  • MSc in computer science, IT or a related field, or in bioinformatics with a demonstrated IT expertise.
  • Expertise in Data Modelling and Advanced SQL.
  • Proficiency in Python programming.
  • Proficiency in ETL (Extract, Transform, Load) processes and tools for large‑scale data processing.
  • Strong understanding of relational databases with hands‑on experience across multiple RDBMS platforms (PostgreSQL, Oracle, MySQL/MariaDB).
  • Experience with database migration, including Oracle to PostgreSQL projects and associated tools.
  • Proficiency in data warehousing (Redshift, BigQuery).
  • Strong communication and collaboration skills, with the ability to work effectively in a team environment.
  • Proficiency in oral and written English.

Preferred Qualifications

  • PhD in computer science, IT or a related field, or in bioinformatics with a demonstrated IT expertise.
  • Experience in big data technologies and frameworks such as Apache Spark, Hadoop or similar platforms.
  • Hands‑on experience with CI/CD (GitLab CI/GitHub Actions).
  • Familiarity with Java.
  • Familiarity with Google Cloud Platform or AWS.
  • Familiarity with data modelling techniques for AI and ML applications.
  • Familiarity with Neo4J or other graph databases is an added advantage.
  • Familiarity with data visualisation tools such as Tableau or PowerBI.
  • Knowledge of, or affinity with, structural biology and bioinformatics.
  • Experience working in international teams.

Other Helpful Information

  • Hybrid Working: We offer hybrid working options. A dedicated desk will be available every day; the team works two days on site and three from home.
  • Interviews: First‑round technical interviews will be held remotely starting mid‑April 2026.
  • Contract length: Grant‑based contract for 3 years.
  • Salary: Grade 5 monthly salary starting at £3,303 per month after tax (excluding pension and insurance contributions) plus generous benefits.

Benefits

  • Financial incentives: Monthly family, child and non‑resident allowances, annual salary review, pension scheme, death benefit, long‑term care, accident‑at‑work and unemployment insurances.
  • Flexible working arrangements – including hybrid working patterns.
  • Private medical insurance for you and your immediate family (including all prescriptions and generous dental & optical cover).
  • Generous time off: 30 days annual leave per year, plus public holidays.
  • Relocation package including installation grant (if required).
  • Campus life: Free shuttle bus to and from work, on‑site library, subsidised on‑site gym and cafeteria, casual dress code, extensive sports and social club activities (on campus and remotely).
  • Family benefits: On‑site nursery, 10 days of child sick leave, generous parental leave, holiday clubs on campus and monthly family and child allowances.
  • Benefits for non‑UK residents: Visa exemption, education grant for private schooling, financial support to travel back to your home country every second year and a monthly non‑resident allowance.

EEO Statement

What else you need to know

  • International applicants: We recruit internationally and successful candidates are offered visa exemptions.
  • Diversity and inclusion: At EMBL, we believe that diverse teams drive innovation and scientific excellence. We encourage applications from candidates of all genders, identities, nationalities and any other diverse backgrounds.
  • How to apply: To apply please submit a cover letter and a CV through our online system. Applications will close at 23:59 CET on the date shown below. We aim to provide a response within two weeks after the closing date.

Closing Date 06/05/2026

Data Engineer employer: European Bioinformatics Institute | EMBL-EBI

At EMBL, we pride ourselves on being an exceptional employer, offering a collaborative and innovative work culture that fosters professional growth and development. As a Data Engineer in our international team, you will benefit from flexible hybrid working arrangements, generous annual leave, and comprehensive family benefits, all while contributing to groundbreaking research in structural biology. Our commitment to diversity and inclusion ensures a supportive environment where every team member can thrive and make a meaningful impact.

E

Contact Details:

European Bioinformatics Institute | EMBL-EBI Recruitment Team

StudySmarter Expert Advice🤫

We think this is how you could land Data Engineer

Network Like a Pro

Get out there and connect with people in the industry! Attend meetups, webinars, or even just grab a coffee with someone who works in data engineering. Building relationships can lead to job opportunities that aren't even advertised.

Show Off Your Skills

Create a portfolio showcasing your projects and skills. Whether it's a GitHub repository or a personal website, having tangible examples of your work can really impress potential employers. Don't forget to highlight any experience with data pipelines and ETL processes!

Ace the Interview

Prepare for technical interviews by practicing common data engineering questions and problems. Brush up on your SQL and Python skills, and be ready to discuss your past projects. Remember, they want to see how you think and solve problems!

Apply Through Our Website

Don't forget to apply directly through our website! It’s the best way to ensure your application gets seen by the right people. Plus, it shows you're genuinely interested in joining our team at StudySmarter!

We think you need these skills to ace Data Engineer

Data Modelling
Advanced SQL
Python Programming
ETL Processes
Large-Scale Data Processing
Relational Databases
PostgreSQL

Some tips for your application 🫡

Craft a Tailored Cover Letter:Make sure your cover letter speaks directly to the role of Data Engineer. Highlight your relevant experience and how it aligns with the responsibilities mentioned in the job description. We want to see your passion for data engineering shine through!

Showcase Your Skills in Your CV:Your CV should clearly outline your qualifications, especially your expertise in Data Modelling, SQL, and Python. Use bullet points to make it easy for us to spot your key skills and achievements that relate to the position.

Be Clear and Concise:When writing your application, keep it straightforward. Avoid jargon unless it's relevant to the role. We appreciate clarity and brevity, so make every word count!

Apply Through Our Website:Don’t forget to submit your application through our online system! It’s the best way for us to receive your materials and ensures you’re considered for the role. Plus, it’s super easy to do!

How to prepare for a job interview at European Bioinformatics Institute | EMBL-EBI

Know Your Data Pipelines

Before the interview, brush up on your understanding of data pipelines, especially in the context of bioinformatics. Be ready to discuss how you've optimised or enhanced data processing in previous roles, and think about specific examples that showcase your skills in ETL processes.

Showcase Your Technical Skills

Make sure you can confidently talk about your expertise in SQL, Python, and any big data technologies you've worked with. Prepare to answer technical questions or even solve problems on the spot, as this will demonstrate your proficiency and problem-solving abilities.

Collaborate and Communicate

Since the role involves working closely with cross-functional teams, be prepared to discuss your experience in collaboration. Share examples of how you've effectively communicated complex technical concepts to non-technical team members, highlighting your strong communication skills.

Stay Current with Industry Trends

Research the latest trends in data engineering and bioinformatics. Be ready to discuss new tools or technologies you believe could enhance data infrastructure, showing that you're proactive and engaged with the field. This will impress the interviewers and demonstrate your commitment to continuous learning.