At a Glance
- Tasks: Optimise data pipelines and collaborate with bioinformaticians to enhance data processing.
- Company: Join an international team at a leading macromolecular structure database.
- Benefits: Enjoy hybrid working, generous leave, and comprehensive health insurance.
- Other info: Dynamic environment with excellent career growth and international collaboration opportunities.
- Why this job: Make a real impact in life sciences while working with cutting-edge technologies.
- Qualifications: MSc in computer science or bioinformatics, with strong SQL and Python skills.
The predicted salary is between 39636 - 39636 ÂŁ per year.
About The Team
The Velankar team maintains macromolecular structure databases that form essential resources for biologists and other life scientists worldwide. PDBe is a founding partner of the Worldwide Protein Data Bank organisation, which maintains the global archive of 3D structural data on macromolecules the Protein Data Bank (PDB). The PDBe team also develops the PDBe Knowledge Base (PDBe-KB) and AlphaFold Protein Structure Database (AFDB). The PDBe team is international and inter-disciplinary and consists of expert data curators, bioinformaticians, scientific software developers and IT specialists.
Role Overview
We seek a skilled and motivated Data Engineer to join our dynamic team. As a Data Engineer, you will play a crucial role in optimising and enhancing our data pipelines, ensuring efficient data processing, storage and retrieval. You will work closely with cross‑functional teams to analyse requirements, propose new data pipeline architectures, and implement solutions to improve performance and scalability.
Responsibilities
- Analyse existing data pipelines and identify areas for improvement, optimisation, and scalability.
- Work closely with Bioinformaticians and annotators to integrate data pipelines with existing systems and applications.
- Monitor data pipeline performance, troubleshoot issues, and implement solutions to ensure reliability and efficiency.
- Stay current with industry trends and best practices in data engineering and recommend new technologies or tools to enhance data infrastructure.
- Document data pipelines, processes, and workflows for internal reference and knowledge sharing.
Required Qualifications
- MSc in computer science, IT or a related field, or in bioinformatics with a demonstrated IT expertise.
- Expertise in Data Modelling and Advanced SQL.
- Proficiency in Python programming.
- Proficiency in ETL (Extract, Transform, Load) processes and tools for large‑scale data processing.
- Strong understanding of relational databases with hands‑on experience across multiple RDBMS platforms (PostgreSQL, Oracle, MySQL/MariaDB).
- Experience with database migration, including Oracle to PostgreSQL projects and associated tools.
- Proficiency in data warehousing (Redshift, BigQuery).
- Strong communication and collaboration skills, with the ability to work effectively in a team environment.
- Proficiency in oral and written English.
Preferred Qualifications
- PhD in computer science, IT or a related field, or in bioinformatics with a demonstrated IT expertise.
- Experience in big data technologies and frameworks such as Apache Spark, Hadoop or similar platforms.
- Hands‑on experience with CI/CD (GitLab CI/GitHub Actions).
- Familiarity with Java.
- Familiarity with Google Cloud Platform or AWS.
- Familiarity with data modelling techniques for AI and ML applications.
- Familiarity with Neo4J or other graph databases is an added advantage.
- Familiarity with data visualisation tools such as Tableau or PowerBI.
- Knowledge of, or affinity with, structural biology and bioinformatics.
- Experience working in international teams.
Other Helpful Information
- Hybrid Working: We offer hybrid working options. A dedicated desk will be available every day; the team works two days on site and three from home.
- Interviews: First‑round technical interviews will be held remotely starting mid‑April 2026.
- Contract length: Grant‑based contract for 3 years.
- Salary: Grade 5 monthly salary starting at ÂŁ3,303 per month after tax (excluding pension and insurance contributions) plus generous benefits.
Benefits
- Financial incentives: Monthly family, child and non‑resident allowances, annual salary review, pension scheme, death benefit, long‑term care, accident‑at‑work and unemployment insurances.
- Flexible working arrangements – including hybrid working patterns.
- Private medical insurance for you and your immediate family (including all prescriptions and generous dental & optical cover).
- Generous time off: 30 days annual leave per year, plus public holidays.
- Relocation package including installation grant (if required).
- Campus life: Free shuttle bus to and from work, on‑site library, subsidised on‑site gym and cafeteria, casual dress code, extensive sports and social club activities (on campus and remotely).
- Family benefits: On‑site nursery, 10 days of child sick leave, generous parental leave, holiday clubs on campus and monthly family and child allowances.
- Benefits for non‑UK residents: Visa exemption, education grant for private schooling, financial support to travel back to your home country every second year and a monthly non‑resident allowance.
EEO Statement
What else you need to know
- International applicants: We recruit internationally and successful candidates are offered visa exemptions.
- Diversity and inclusion: At EMBL, we believe that diverse teams drive innovation and scientific excellence. We encourage applications from candidates of all genders, identities, nationalities and any other diverse backgrounds.
- How to apply: To apply please submit a cover letter and a CV through our online system. Applications will close at 23:59 CET on the date shown below. We aim to provide a response within two weeks after the closing date.
Closing Date 06/05/2026
Data Engineer employer: European Bioinformatics Institute | EMBL-EBI
Contact Detail:
European Bioinformatics Institute | EMBL-EBI Recruiting Team
StudySmarter Expert Advice 🤫
We think this is how you could land Data Engineer
✨Tip Number 1
Network like a pro! Reach out to people in the industry, attend meetups or webinars, and connect with current employees at the company. A friendly chat can sometimes lead to job opportunities that aren't even advertised!
✨Tip Number 2
Prepare for those interviews! Research common data engineering questions and practice your answers. We recommend doing mock interviews with friends or using online platforms to get comfortable with the format.
✨Tip Number 3
Show off your skills! Create a portfolio showcasing your projects, especially those related to data pipelines or bioinformatics. This gives you a chance to demonstrate your expertise beyond just your CV.
✨Tip Number 4
Apply through our website! It’s the best way to ensure your application gets seen by the right people. Plus, it shows you're genuinely interested in joining our team at StudySmarter!
We think you need these skills to ace Data Engineer
Some tips for your application 🫡
Tailor Your CV: Make sure your CV is tailored to the Data Engineer role. Highlight your experience with data pipelines, SQL, and Python. We want to see how your skills match what we're looking for!
Craft a Compelling Cover Letter: Your cover letter is your chance to shine! Use it to explain why you're passionate about data engineering and how you can contribute to our team. Keep it engaging and relevant to the job description.
Showcase Your Projects: If you've worked on any relevant projects, make sure to mention them! Whether it's a personal project or something from your previous job, we love seeing practical examples of your skills in action.
Apply Through Our Website: Don't forget to apply through our online system! It's the easiest way for us to keep track of your application and ensures you get considered for the role. We can't wait to hear from you!
How to prepare for a job interview at European Bioinformatics Institute | EMBL-EBI
✨Know Your Data Pipelines
Before the interview, brush up on your understanding of data pipelines, especially in the context of bioinformatics. Be ready to discuss how you've optimised or enhanced data processing in previous roles, and think about specific examples that showcase your skills in ETL processes.
✨Showcase Your Technical Skills
Make sure you can confidently talk about your expertise in SQL, Python, and any big data technologies you've worked with. Prepare to answer technical questions or even solve problems on the spot, as this will demonstrate your proficiency and problem-solving abilities.
✨Collaborate and Communicate
Since the role involves working closely with cross-functional teams, be prepared to discuss your experience in collaboration. Share examples of how you've effectively communicated complex technical concepts to non-technical team members, highlighting your strong communication skills.
✨Stay Current with Industry Trends
Research the latest trends in data engineering and bioinformatics. Be ready to discuss new tools or technologies you believe could enhance data infrastructure. This shows your passion for the field and your commitment to continuous learning, which is crucial for a Data Engineer.