Bioinformatics Data Engineer (RNA Resources)

Bioinformatics Data Engineer (RNA Resources)

Full-Time 39636 - 39636 £ / year (est.) Home office (partial)
E

At a Glance

  • Tasks: Develop and maintain databases for RNA biology, optimising data pipelines and workflows.
  • Company: Join a leading bioinformatics team at the forefront of RNA research.
  • Benefits: Competitive salary, hybrid working, generous leave, and family-friendly perks.
  • Other info: Diverse and inclusive environment with excellent career growth opportunities.
  • Why this job: Make a real impact in RNA science while working with cutting-edge technologies.
  • Qualifications: Master’s degree in a relevant field and proficiency in Python and bioinformatics tools.

The predicted salary is between 39636 - 39636 £ per year.

About The Team

Rfam and RNAcentral are key resources for RNA biology, serving tens of thousands of users every year and widely cited in the scientific literature. We are recruiting a Bioinformatics Data Engineer to develop and maintain both the Rfam and RNAcentral databases. They are funded by the BBSRC and Wellcome. The RNA Resources team is part of the Sequence Families group led by Alex Bateman. The role reports to the Project Leader for RNA Resources and works closely with an RNA bioinformatician, two full-stack software developers, and an Rfam biocurator.

Responsibilities

  • Run, maintain, and optimise data pipelines to ensure efficient processing, storage, and retrieval for Rfam and RNAcentral.
  • Analyse requirements and propose new data pipeline architectures that improve performance and scalability.
  • Analyse existing data curation and data production pipelines and identify areas for improvement, optimisation, and scalability.
  • Modernise and containerise Rfam curation pipelines, and implement human‑in‑the‑loop, AI‑assisted agentic curation.
  • Develop and scale LLM pipelines used in RNAcentral for literature summarisation and curation.
  • Develop scalable workflows for ncRNA annotation in genomes.
  • Document data pipelines, processes, and workflows for internal reference and knowledge sharing.
  • Participate in RNAcentral and Rfam data releases.
  • Outreach to the scientific community through presentations at major conferences and consortium meetings.
  • Keep up to date with the latest developments in RNA science to ensure the resources provide valuable data and analysis.

Qualifications

  • Master’s level or equivalent qualification in a computational, biological or related scientific discipline.
  • Proficiency in Python and other relevant languages for bioinformatics tool development.
  • Experience with relational databases (PostgreSQL, MySQL) and SQL: knowledge of database architecture, performance tuning, partitioning strategies, indexing techniques, and query optimisation.
  • Track record of developing and maintaining production bioinformatics pipelines with workflow management systems such as Nextflow or Snakemake.
  • Experience building applications with LLMs and other AI technologies.
  • Familiarity with Docker or other containerisation technologies, such as Singularity.
  • Comfortable using Git/GitHub, Unix, and Bash.
  • Experience with AI assisted coding tools.
  • Ability to apply best‑practice software development methodologies.
  • Strong communication skills.

Preferred Qualifications

  • Knowledge of RNA biology and practical experience with Rfam, Infernal, R‑scape, and tools for secondary structure prediction.
  • Familiarity with gene annotation or genome feature representation.
  • Experience with high‑performance computing environments such as Slurm.
  • Experience planning and executing data migration projects, including downtime management, data consistency verification, and rollback strategies.
  • Experience with AI workflow libraries such as LangChain and LangGraph.
  • Experience with Kubernetes and cloud infrastructure platforms such as OpenStack.
  • Experience with the Rust programming language.

Other Helpful Information

  • Hybrid Working: two days from the office in Hinxton per week, with flexibility for onsite work.
  • Contract length: 3 years (grant‑based contract).
  • Salary: Grade 5 monthly salary starting at £3,303 per month after tax (excluding pension and insurance contributions).
  • Benefits: monthly family, child, and non‑resident allowances; annual salary review; pension scheme; death benefit; long‑term care; accident‑at‑work and unemployment insurances; private medical insurance for you and your immediate family; 30 days annual leave plus public holidays; relocation package with installation grant if required; campus life facilities; family benefits including on‑site nursery and generous parental leave; benefits for non‑UK residents such as visa exemption and monthly non‑resident allowance.

Diversity & Inclusion

We believe diverse teams drive innovation and scientific excellence. We encourage applications from candidates of all genders, identities, nationalities, and any other diverse backgrounds.

Closing Date 28/06/2026

Bioinformatics Data Engineer (RNA Resources) employer: European Bioinformatics Institute | EMBL-EBI

As a leading employer in the field of bioinformatics, we offer a dynamic work environment where innovation thrives. Our team is dedicated to advancing RNA biology through cutting-edge research and technology, providing employees with opportunities for professional growth and collaboration with experts in the field. Located in Hinxton, our hybrid working model promotes a healthy work-life balance, complemented by comprehensive benefits including generous leave, family support, and a commitment to diversity and inclusion.

E

Contact Details:

European Bioinformatics Institute | EMBL-EBI Recruitment Team

StudySmarter Expert Advice🤫

We think this is how you could land Bioinformatics Data Engineer (RNA Resources)

Tip Number 1

Network like a pro! Reach out to folks in the RNA biology and bioinformatics community. Attend conferences, webinars, or local meetups to connect with potential colleagues and learn about job openings that might not be advertised.

Tip Number 2

Show off your skills! Create a portfolio showcasing your projects, especially those related to bioinformatics pipelines or AI technologies. This will give you an edge when chatting with hiring managers and demonstrate your hands-on experience.

Tip Number 3

Prepare for interviews by brushing up on common bioinformatics questions and coding challenges. Practice explaining your past projects and how they relate to the role of a Bioinformatics Data Engineer. Confidence is key!

Tip Number 4

Don't forget to apply through our website! It’s the best way to ensure your application gets seen. Plus, we love seeing candidates who are proactive and engaged with our resources.

We think you need these skills to ace Bioinformatics Data Engineer (RNA Resources)

Python
Bioinformatics Tool Development
Relational Databases (PostgreSQL, MySQL)
SQL
Database Architecture
Performance Tuning
Workflow Management Systems (Nextflow, Snakemake)

Some tips for your application 🫡

Tailor Your CV:Make sure your CV is tailored to the Bioinformatics Data Engineer role. Highlight your experience with data pipelines, Python, and any relevant projects that showcase your skills in bioinformatics. We want to see how you fit into our team!

Craft a Compelling Cover Letter:Your cover letter is your chance to shine! Use it to explain why you're passionate about RNA biology and how your background makes you a great fit for Rfam and RNAcentral. Let us know what excites you about this opportunity!

Showcase Your Technical Skills:Don’t forget to mention your proficiency in tools like PostgreSQL, Docker, and any experience with AI technologies. We’re looking for someone who can hit the ground running, so make sure we see your technical chops!

Apply Through Our Website:We encourage you to apply through our website for a smoother application process. It’s the best way for us to keep track of your application and ensure it gets the attention it deserves. We can’t wait to hear from you!

How to prepare for a job interview at European Bioinformatics Institute | EMBL-EBI

Know Your Pipelines

Make sure you understand the data pipelines relevant to Rfam and RNAcentral. Brush up on your knowledge of workflow management systems like Nextflow or Snakemake, as well as how to optimise and modernise these pipelines. Being able to discuss specific examples of your past work with bioinformatics pipelines will really impress.

Showcase Your Coding Skills

Since proficiency in Python and other programming languages is key, be prepared to demonstrate your coding abilities. You might be asked to solve a problem on the spot, so practice coding challenges beforehand. Familiarity with AI-assisted coding tools can also give you an edge, so don’t forget to mention any experience you have.

Communicate Clearly

Strong communication skills are essential for this role. Be ready to explain complex bioinformatics concepts in simple terms, especially when discussing your previous projects. Practising how you present your ideas can help you convey your thoughts more effectively during the interview.

Stay Updated on RNA Science

Keep yourself informed about the latest developments in RNA biology. This not only shows your passion for the field but also helps you engage in meaningful discussions during the interview. Mention any recent papers or conferences you've attended that relate to RNA resources, as this demonstrates your commitment to staying current.