Bioinformatics Data Engineer (RNA Resources) in Saffron Walden

Bioinformatics Data Engineer (RNA Resources) in Saffron Walden

Saffron Walden Full-Time 39636 - 39636 £ / year (est.) No working from home possible
E

At a Glance

  • Tasks: Develop and optimise data pipelines for RNA databases, enhancing performance and scalability.
  • Company: Join a leading bioinformatics team at EMBL-EBI, shaping RNA research globally.
  • Benefits: Enjoy hybrid working, generous leave, private medical insurance, and family allowances.
  • Other info: Diverse and inclusive workplace with excellent career growth opportunities.
  • Why this job: Make a real impact in RNA science while collaborating with top experts in the field.
  • Qualifications: Masters in a relevant field, Python proficiency, and experience with bioinformatics pipelines.

The predicted salary is between 39636 - 39636 £ per year.

About the Team

Rfam and RNAcentral are key resources for RNA biology, serving tens of thousands of users every year and widely cited in the scientific literature. We are recruiting a Bioinformatics Data Engineer to develop and maintain both the Rfam and RNAcentral databases. They are currently funded by the BBSRC and Wellcome. The RNA Resources team is part of the Sequence Families group led by Alex Bateman. You will be reporting to the Project Leader for RNA Resources, and working closely with an RNA bioinformatician, two full-stack software developers, and an Rfam biocurator.

Your role

As a Bioinformatics Data Engineer, you will run, maintain and optimise our data pipelines, ensuring efficient data processing, storage and retrieval for Rfam and RNAcentral. You will work closely with cross-functional teams to analyse requirements, propose new data pipeline architectures, and implement solutions to improve performance and scalability.

The tasks will include:

  • Analysing existing data curation and data production pipelines and identifying areas for improvement, optimisation, and scalability.
  • Modernising and containerising Rfam curation pipelines, and implementing human-in-the-loop, AI-assisted agentic curation.
  • Developing and scaling LLM pipelines used in RNAcentral for literature summarisation and curation.
  • Developing scalable workflows for ncRNA annotation in genomes.
  • Documenting data pipelines, processes, and workflows for internal reference and knowledge sharing.
  • Participating in RNAcentral and Rfam data releases.

You will also be responsible for outreach to the scientific community through presentations at major conferences such as the RNA Society Annual Meeting and ISMB. Additionally, you will present at the RNAcentral consortium meetings and Scientific Advisory Board meetings, gathering regular feedback from community members. Finally, you are expected to keep up to date with the latest developments in RNA science to ensure the resources continue to provide our diverse users with valuable data and analysis.

You should be passionate about RNA science and want to help the scientific community. RNAcentral and Rfam are widely used resources, and this role offers the opportunity to shape the work of many RNA researchers worldwide.

You have

  • Masters level or equivalent qualification in a computational, biological or related scientific discipline.
  • Proficiency in Python and other relevant languages for bioinformatics tool development.
  • Experience with relational databases (PostgreSQL, MySQL) and SQL: knowledge of PostgreSQL and MySQL database architecture, performance tuning, partitioning strategies, indexing techniques, and query optimisation.
  • Demonstrated track record of developing and maintaining production bioinformatics pipelines with workflow management systems such as Nextflow or Snakemake.
  • Experience building applications with LLMs and other AI technologies.
  • Familiarity with Docker or other containerisation technologies, such as Singularity.
  • Comfortable using Git/GitHub, Unix, and Bash.
  • Experience of using AI assisted coding tools.
  • Ability to apply best-practice software development methodologies.
  • Strong communication skills.

You may also have

  • Knowledge of RNA biology and/or demonstrable practical experience with Rfam, Infernal, R-scape and tools for secondary structure prediction.
  • Familiarity with gene annotation or genome feature representation.
  • Experience with high-performance computing environments such as Slurm.
  • Experience in planning and executing data migration projects, including downtime management, data consistency verification, and rollback strategies.
  • Experience with AI workflow libraries such as LangChain and LangGraph.
  • Experience with Kubernetes and cloud infrastructure platforms such as OpenStack.
  • Experience with the Rust programming language.

Other helpful information

Hybrid Working: At EMBL-EBI we are pleased to offer hybrid working options for all our employees. You would be required to work 2 days from the office in Hinxton (currently this is Monday and Tuesday), with the flexibility to come on site more often if preferred.

Interviews: We plan to hold introductory meetings with selected candidates remotely starting in Early July, following this we plan to hold panel interviews remotely in Mid July.

Apply now!

Benefits and Contract Information

  • Financial incentives: depending on circumstances, monthly family/marriage allowance of £278 monthly child allowance of £336 per child. Non resident allowance up to £569 per month. Annual salary review, pension scheme, death benefit, long-term care, accident-at-work and unemployment insurances.
  • Hybrid working arrangements.
  • Private medical insurance for you and your immediate family (including all prescriptions and generous dental & optical cover).
  • Generous time off: 30 days annual leave per year, in addition to eight bank holidays.
  • Relocation package including installation grant (as applicable).
  • Campus life: Free shuttle bus to and from work, on-site library, subsidised on-site gym and cafeteria, casual dress code, extensive sports and social club activities (on campus and remotely).
  • Family benefits: On-site nursery, child sick leave, generous parental leave, holiday clubs on campus and monthly family and child allowances.
  • Contract duration: This position is a 3 year contract.
  • Salary: Monthly salary starting at £3,303 after tax (but excl. pension & insurances) + benefits (Total package will be dependent on family circumstances).
  • International applicants: We recruit internationally and successful candidates are offered visa exemptions. Read more on our page for international applicants.
  • Diversity and inclusion: At EMBL-EBI, we strongly believe that inclusive and diverse teams benefit from higher levels of innovation and creative thought. We encourage applications from women, LGBTQ+ and individuals from all nationalities.
  • Job location: This role is based in Hinxton, near Cambridge, UK. You will be required to relocate if you are based overseas and you will receive a generous relocation package to support you.

How to apply: To apply please submit a cover letter and a CV through our online system before the closing date. Applications will close on 28/06/2026.

Bioinformatics Data Engineer (RNA Resources) in Saffron Walden employer: EMBL-EBI

At EMBL-EBI, we pride ourselves on being an exceptional employer, offering a collaborative and innovative work culture that fosters professional growth in the field of bioinformatics. Our hybrid working model allows for flexibility while our generous benefits package, including private medical insurance, extensive annual leave, and family support initiatives, ensures a healthy work-life balance. Located in Hinxton near Cambridge, you will be part of a vibrant scientific community dedicated to advancing RNA research, with opportunities for outreach and engagement at major conferences.

E

Contact Details:

EMBL-EBI Recruitment Team

StudySmarter Expert Advice🤫

We think this is how you could land Bioinformatics Data Engineer (RNA Resources) in Saffron Walden

Tip Number 1

Network like a pro! Reach out to folks in the RNA and bioinformatics community on LinkedIn or at conferences. A friendly chat can lead to opportunities that aren’t even advertised yet.

Tip Number 2

Show off your skills! Create a portfolio showcasing your projects, especially those involving Python, databases, and bioinformatics pipelines. This gives potential employers a taste of what you can do.

Tip Number 3

Prepare for interviews by brushing up on common bioinformatics questions and coding challenges. Practice explaining your past projects clearly, focusing on how you improved data pipelines or worked with AI technologies.

Tip Number 4

Don’t forget to apply through our website! It’s the best way to ensure your application gets seen. Plus, we love seeing candidates who are genuinely interested in joining our team.

We think you need these skills to ace Bioinformatics Data Engineer (RNA Resources) in Saffron Walden

Python
Bioinformatics Tool Development
Relational Databases (PostgreSQL, MySQL)
SQL
Data Pipeline Development
Workflow Management Systems (Nextflow, Snakemake)
AI Technologies

Some tips for your application 🫡

Craft a Compelling Cover Letter:Your cover letter is your chance to shine! Make sure to highlight your passion for RNA science and how your skills align with the role. We want to see your personality, so don’t be afraid to let it show!

Tailor Your CV:Customise your CV to reflect the specific requirements of the Bioinformatics Data Engineer position. Emphasise your experience with Python, databases, and bioinformatics pipelines. We love seeing relevant examples that showcase your expertise!

Showcase Your Projects:If you've worked on any cool projects related to RNA biology or bioinformatics, make sure to mention them! We’re interested in your hands-on experience, especially with tools like Nextflow or Snakemake.

Apply Through Our Website:Don’t forget to submit your application through our online system! It’s the easiest way for us to keep track of your application and ensure it gets the attention it deserves. We can’t wait to hear from you!

How to prepare for a job interview at EMBL-EBI

Know Your RNA

Make sure you brush up on your RNA biology knowledge before the interview. Familiarise yourself with Rfam and RNAcentral, as well as any recent developments in RNA science. This will not only show your passion for the field but also help you engage in meaningful discussions with the interviewers.

Showcase Your Technical Skills

Be prepared to discuss your experience with Python, SQL, and bioinformatics tools. Bring examples of past projects where you've developed or maintained data pipelines. If you've worked with Docker or AI technologies, highlight those experiences too, as they are crucial for this role.

Prepare for Problem-Solving Questions

Expect questions that assess your ability to analyse and optimise data pipelines. Think about specific challenges you've faced in previous roles and how you overcame them. Practising these scenarios can help you articulate your thought process clearly during the interview.

Communicate Effectively

Strong communication skills are essential for this role, especially since you'll be working with cross-functional teams. Practice explaining complex technical concepts in simple terms. This will demonstrate your ability to collaborate and share knowledge effectively with others.