Senior Cloud Reliability Engineer in Cambridge

Senior Cloud Reliability Engineer in Cambridge

Cambridge Full-Time 60000 - 80000 £ / year (est.) No working from home possible
Jagex

At a Glance

  • Tasks: Partner with teams to enhance cloud-native architectures and improve service reliability.
  • Company: Join a leading gaming company in Cambridge with a focus on innovation.
  • Benefits: Enjoy private healthcare, flexible hours, generous leave, and a performance bonus.
  • Other info: Inclusive workplace committed to equal opportunities and career growth.
  • Why this job: Make a real impact on cloud services while working in a dynamic environment.
  • Qualifications: Experience in cloud reliability, AWS expertise, and strong Linux skills required.

The predicted salary is between 60000 - 80000 £ per year.

Location: Cambridge, UK – Applicants should be based (or willing to relocate) within a comfortable commuting distance of our office to attend onsite as required.

What you’ll be doing:

  • Partner with game and development teams to move services toward cloud-native architectures, improving resilience, security and cost efficiency across live environments.
  • Support the migration of workloads from managed VPS environments onto Jagex’s cloud platform, helping teams modernise safely without compromising uptime.
  • Define, embed and improve SLIs, SLOs and error‑budget thinking so service reliability is measurable and better understood across teams.
  • Design and enhance observability and alerting across logs, metrics and traces, giving teams faster insight into issues and reducing time to detection.
  • Automate operational tasks such as scaling, failover and deployments, while building self‑healing mechanisms that reduce toil and improve recovery.
  • Contribute hands‑on reliability improvements across Linux-based production systems, reusable IaC modules and team codebases, while helping raise engineering standards across Cloud Tech.

What we’re looking for:

  • Proven experience owning reliability for large-scale, internet-facing services in production.
  • Demonstrable AWS expertise across services such as VPC, EC2, ECS/EKS, ELB, ECR, Route53, KMS, IAM and Systems Manager.
  • Proven capability in cloud-native design, workload modernisation and Infrastructure as Code delivery.
  • Strong practical experience with SLIs, SLOs, incident response, root cause analysis and resilient system design.
  • Demonstrable production experience with Debian-based Linux environments, virtual machine fleet management and configuration management tooling.
  • Hands‑on experience with observability platforms, CI/CD, containerisation and programming or scripting in Python or Java.

What we offer:

  • Private Healthcare, including Dental Plan.
  • Discretionary annual performance bonus.
  • Minimum 6% Pension contributions.
  • Life Insurance.
  • Enhanced family leave policies from day 1.
  • Flexible working hours.
  • 25 days annual leave + Bank holidays & the option to buy/sell holidays + so much more!

Inclusion & Accessibility Statement: We are committed to providing equal opportunities and creating an environment where everyone can thrive. We welcome applications from all backgrounds, and we recruit, develop, and promote based on merit and ability. If you require any reasonable adjustments to support you during the recruitment process, please let us know when you’re invited to interview.

Right to Work Statement: This role is only open to applicants who have the permanent right to work in the UK. We are unable to provide or take over visa sponsorship for this position, either now or in the future. Applicants must therefore be able to demonstrate their ongoing eligibility to work in the UK without the need for employer sponsorship.

Senior Cloud Reliability Engineer in Cambridge employer: Jagex

At Jagex, we pride ourselves on being an exceptional employer, offering a vibrant work culture in the heart of Cambridge. Our commitment to employee growth is evident through our flexible working hours, comprehensive benefits including private healthcare and enhanced family leave policies, and a focus on inclusivity that ensures everyone can thrive. Join us to be part of a forward-thinking team dedicated to innovation and excellence in cloud reliability engineering.

Jagex

Contact Details:

Jagex Recruitment Team

StudySmarter Expert Advice🤫

We think this is how you could land Senior Cloud Reliability Engineer in Cambridge

Tip Number 1

Network like a pro! Reach out to folks in the industry, especially those already working at Jagex. A friendly chat can give you insider info and maybe even a referral!

Tip Number 2

Show off your skills! Prepare a portfolio or a GitHub repo showcasing your cloud-native projects and automation scripts. This will help us see your hands-on experience in action.

Tip Number 3

Practice makes perfect! Get ready for technical interviews by brushing up on your knowledge of AWS services and incident response strategies. We love candidates who can think on their feet!

Tip Number 4

Don’t forget to apply through our website! It’s the best way to ensure your application gets seen by the right people. Plus, it shows you’re genuinely interested in joining our team!

We think you need these skills to ace Senior Cloud Reliability Engineer in Cambridge

Cloud-Native Architecture
AWS Expertise
VPC
EC2
ECS/EKS
ELB
ECR

Some tips for your application 🫡

Tailor Your CV:Make sure your CV is tailored to the Senior Cloud Reliability Engineer role. Highlight your experience with cloud-native architectures and any relevant AWS expertise. We want to see how your skills align with what we’re looking for!

Showcase Your Projects:Include specific projects where you've improved service reliability or migrated workloads to the cloud. We love seeing real-world examples of your work, so don’t hold back on the details!

Be Clear and Concise:When writing your application, keep it clear and to the point. Use bullet points for key achievements and avoid jargon unless it’s relevant. We appreciate straightforward communication that gets to the heart of your experience.

Apply Through Our Website:Don’t forget to apply through our website! It’s the best way for us to receive your application and ensures you’re considered for the role. Plus, it’s super easy to do!

How to prepare for a job interview at Jagex

Know Your Cloud Stuff

Make sure you brush up on your AWS knowledge, especially services like EC2, ECS/EKS, and VPC. Be ready to discuss how you've used these in past projects, particularly in relation to cloud-native design and workload modernisation.

SLIs and SLOs are Key

Familiarise yourself with Service Level Indicators (SLIs) and Service Level Objectives (SLOs). Be prepared to share examples of how you've defined and improved these metrics in previous roles, as they’re crucial for demonstrating service reliability.

Show Off Your Automation Skills

Think about the operational tasks you've automated in the past. Whether it's scaling or deployments, have specific examples ready to showcase how you've built self-healing mechanisms that enhance system resilience.

Get Hands-On with Observability

Be ready to discuss your experience with observability platforms and how you've designed alerting systems. Highlight any tools you've used to improve issue detection and response times, as this will show your practical understanding of maintaining reliable systems.