Bioinformatics Data Engineer (RNA Resources)

Bioinformatics Data Engineer (RNA Resources)

Full-Time 39636 - 39636 £ / year (est.) Home office (partial)
1

At a Glance

  • Tasks: Develop and maintain RNA databases, optimise data pipelines, and implement AI-assisted curation.
  • Company: Join a leading bioinformatics team at EMBL with a focus on RNA biology.
  • Benefits: Hybrid working, competitive salary, generous leave, and private medical insurance.
  • Other info: Dynamic work environment with opportunities for professional growth and community outreach.
  • Why this job: Make a real impact in RNA science while working with cutting-edge technologies.
  • Qualifications: Master’s degree in a relevant field and proficiency in Python and bioinformatics tools.

The predicted salary is between 39636 - 39636 £ per year.

Rfam and RNAcentral are key resources for RNA biology, serving tens of thousands of users yearly and cited widely in the literature. This role is funded by the BBSRC and Wellcome and is part of the Sequence Families group led by Alex Bateman. The Bioinformatics Data Engineer will develop and maintain both the Rfam and RNAcentral databases, reporting to the Project Leader for RNA Resources and collaborating with bioinformaticians, developers, and curators.

Responsibilities

  • Run, maintain and optimise data pipelines, ensuring efficient data processing, storage and retrieval for the RNA resources.
  • Analyse existing curation and production pipelines, identifying opportunities for improvement, optimisation and scalability.
  • Modernise and containerise Rfam curation pipelines, implementing human‑in‑the‑loop AI‑assisted curation.
  • Develop and scale large‑language‑model pipelines used in RNAcentral for literature summarisation and curation.
  • Define scalable workflows for ncRNA annotation in genomes.
  • Document pipelines, processes and workflows for internal reference and knowledge sharing.
  • Participate in RNAcentral and Rfam data releases.
  • Provide outreach to the scientific community through presentations at major conferences (e.g., RNA Society Annual Meeting, ISMB) and convene regular feedback sessions with consortium members.
  • Keep updated with the latest developments in RNA science to ensure the resources continue to serve user needs.

Qualifications

  • Master’s level or equivalent qualification in a computational, biological or related scientific discipline.
  • Proficiency in Python and other relevant bioinformatics programming languages.
  • Experience with relational databases (PostgreSQL, MySQL); understanding of database architecture, performance tuning, partitioning, indexing and query optimisation.
  • Track record of developing and maintaining production bioinformatics pipelines using workflow management systems such as Nextflow or Snakemake.
  • Experience building applications that use large‑language‑model and other AI technologies.
  • Familiarity with containerisation (Docker, Singularity) and cloud infrastructure such as OpenStack.
  • Comfortable using Git/GitHub, Unix shell and Bash.
  • Experience with AI‑assisted coding tools.
  • Strong communication skills and ability to apply best‑practice software development methodologies.
  • Knowledge of RNA biology and practical experience with Rfam, Infernal, R‑scape or secondary‑structure prediction tools (optional but desirable).
  • Familiarity with gene annotation or genome feature representation.
  • Experience in high‑performance computing environments such as Slurm.
  • Experience planning and executing data‑migration projects, including downtime management, data consistency verification and rollback strategies.
  • Experience with AI workflow libraries such as LangChain or LangGraph.
  • Experience with Kubernetes and advanced cloud platforms.
  • Experience with the Rust programming language.

Benefits & Compensation

  • Hybrid working: 2 days per week from the office in Hinxton (Monday and Tuesday), with flexibility to come on site more often.
  • Contract length: 3 years (grant‑based).
  • Salary: Grade 5 monthly salary starting at £3,303 per month after tax, excluding pension and insurance contributions.
  • Monthly family, child and non‑resident allowances; annual salary review; pension scheme; death benefit; long‑term care; accident‑at‑work and unemployment insurances.
  • Private medical insurance for employee and immediate family (includes prescriptions and dental & optical cover).
  • Generous time off: 30 days annual leave per year plus public holidays; additional family leave (child sick, parental, holiday clubs).
  • Free shuttle bus to/from work, on‑site library and subsidised gym and cafeteria.
  • Family benefits: on‑site nursery, 10 days child sick leave, generous parental leave.
  • Visa exemption and educational grant for private schooling for non‑UK residents.
  • Regular social club and sports activities on campus and remotely.

Legal & Equal Opportunity Statement

EMBL is a signatory of DORA and encourages applications from candidates of all genders, identities, nationalities and backgrounds. We offer visa exemptions to international applicants.

Closing Date

Applications will close at 23:59 CET on 28 June 2026.

Bioinformatics Data Engineer (RNA Resources) employer: 1000 European Molecular Biology Laboratory

At EMBL, we pride ourselves on being an exceptional employer, offering a collaborative and innovative work culture that fosters professional growth and development. As a Bioinformatics Data Engineer in Hinxton, you will benefit from hybrid working arrangements, generous annual leave, and comprehensive family support, all while contributing to groundbreaking RNA research that impacts the scientific community. Our commitment to employee well-being is reflected in our extensive benefits package, including private medical insurance and access to on-site facilities, making EMBL a truly rewarding place to advance your career.

1

Contact Details:

1000 European Molecular Biology Laboratory Recruitment Team

StudySmarter Expert Advice🤫

We think this is how you could land Bioinformatics Data Engineer (RNA Resources)

Tip Number 1

Network like a pro! Reach out to folks in the bioinformatics community, especially those who work with RNA resources. Attend conferences and engage in discussions; you never know who might be looking for someone just like you!

Tip Number 2

Show off your skills! Create a portfolio showcasing your projects, especially those involving Python, databases, and AI technologies. This will give potential employers a taste of what you can bring to the table.

Tip Number 3

Don’t just apply anywhere—apply through our website! Tailor your application to highlight your experience with data pipelines and bioinformatics tools. Make it clear how you can contribute to Rfam and RNAcentral.

Tip Number 4

Prepare for interviews by brushing up on your knowledge of RNA biology and the latest trends in bioinformatics. Be ready to discuss how you’ve optimised workflows and tackled challenges in past projects.

We think you need these skills to ace Bioinformatics Data Engineer (RNA Resources)

Python
Bioinformatics Programming Languages
Relational Databases (PostgreSQL, MySQL)
Database Architecture
Performance Tuning
Workflow Management Systems (Nextflow, Snakemake)
Containerisation (Docker, Singularity)

Some tips for your application 🫡

Tailor Your CV:Make sure your CV is tailored to the Bioinformatics Data Engineer role. Highlight your experience with Python, relational databases, and any relevant bioinformatics projects. We want to see how your skills match what we're looking for!

Craft a Compelling Cover Letter:Your cover letter is your chance to shine! Use it to explain why you're passionate about RNA biology and how your background makes you a great fit for our team. Keep it engaging and personal – we love to see your personality come through!

Showcase Your Projects:If you've worked on any relevant projects, especially those involving data pipelines or AI technologies, make sure to mention them. We’re keen to see real examples of your work and how you’ve tackled challenges in the past.

Apply Through Our Website:Don’t forget to apply through our website! It’s the best way to ensure your application gets to us directly. Plus, you’ll find all the details you need about the role and our team there.

How to prepare for a job interview at 1000 European Molecular Biology Laboratory

Know Your Pipelines

Make sure you understand the data pipelines relevant to RNA resources. Be ready to discuss your experience with workflow management systems like Nextflow or Snakemake, and how you've optimised or modernised similar pipelines in the past.

Showcase Your Coding Skills

Brush up on your Python and any other programming languages mentioned in the job description. Prepare to demonstrate your proficiency through coding challenges or by discussing past projects where you used these skills effectively.

Familiarise Yourself with Databases

Since relational databases are key for this role, be prepared to talk about your experience with PostgreSQL or MySQL. Highlight any specific instances where you improved database performance or implemented query optimisation.

Communicate Clearly

Strong communication skills are essential. Practice explaining complex bioinformatics concepts in simple terms, as you may need to present your ideas to non-technical stakeholders or during outreach activities.