Senior Cloud Reliability Engineer

Senior Cloud Reliability Engineer

Full-Time 60000 - 80000 € / year (est.) No home office possible
Jagex

At a Glance

  • Tasks: Partner with teams to enhance cloud-native services and improve system reliability.
  • Company: Join a leading gaming company in Cambridge, UK, known for innovation.
  • Benefits: Enjoy private healthcare, flexible hours, and generous annual leave.
  • Other info: Dynamic work culture with opportunities for professional growth and development.
  • Why this job: Make a real impact on game reliability while working with cutting-edge cloud technologies.
  • Qualifications: Experience in cloud services, incident response, and Linux environments required.

The predicted salary is between 60000 - 80000 € per year.

Location: Cambridge, UK – Applicants should be based (or willing to relocate) within a comfortable commuting distance of our office to attend onsite as required.

What you’ll be doing:

  • Partner with game and development teams to move services toward cloud-native architectures, improving resilience, security and cost efficiency across live environments.
  • Support the migration of workloads from managed VPS environments onto Jagex’s cloud platform, helping teams modernise safely without compromising uptime.
  • Define, embed and improve SLIs, SLOs and error‑budget thinking so service reliability is measurable and better understood across teams.
  • Design and enhance observability and alerting across logs, metrics and traces, giving teams faster insight into issues and reducing time to detection.
  • Automate operational tasks such as scaling, failover and deployments, while building self‑healing mechanisms that reduce toil and improve recovery.
  • Contribute hands‑on reliability improvements across Linux-based production systems, reusable IaC modules and team codebases, while helping raise engineering standards across Cloud Tech.

What we’re looking for:

  • Proven experience owning reliability for large-scale, internet-facing services in production.
  • Demonstrable AWS expertise across services such as VPC, EC2, ECS/EKS, ELB, ECR, Route53, KMS, IAM and Systems Manager.
  • Proven capability in cloud-native design, workload modernisation and Infrastructure as Code delivery.
  • Strong practical experience with SLIs, SLOs, incident response, root cause analysis and resilient system design.
  • Demonstrable production experience with Debian-based Linux environments, virtual machine fleet management and configuration management tooling.
  • Hands‑on experience with observability platforms, CI/CD, containerisation and programming or scripting in Python or Java.

What we offer:

  • Private Healthcare, including Dental Plan.
  • Discretionary annual performance bonus.
  • Minimum 6% Pension contributions.
  • Life Insurance.
  • Enhanced family leave policies from day 1.
  • Flexible working hours.
  • 25 days annual leave + Bank holidays.

Senior Cloud Reliability Engineer employer: Jagex

At Jagex, we pride ourselves on being an exceptional employer, offering a dynamic work culture that fosters innovation and collaboration in the heart of Cambridge. Our commitment to employee growth is evident through our comprehensive benefits package, including private healthcare, flexible working hours, and enhanced family leave policies from day one, ensuring that our team members thrive both personally and professionally. Join us to be part of a forward-thinking company where your contributions directly impact the success of our cloud-native architectures and live environments.

Jagex

Contact Detail:

Jagex Recruiting Team

StudySmarter Expert Advice🤫

We think this is how you could land Senior Cloud Reliability Engineer

Tip Number 1

Network like a pro! Reach out to folks in the industry, attend meetups or webinars, and connect with current employees at Jagex. You never know who might give you the inside scoop on job openings or even refer you directly!

Tip Number 2

Show off your skills! Create a portfolio or GitHub repository showcasing your cloud-native projects, especially those involving AWS services. This will not only demonstrate your expertise but also give you something tangible to discuss during interviews.

Tip Number 3

Prepare for technical interviews by brushing up on your knowledge of SLIs, SLOs, and incident response strategies. Practise explaining complex concepts in simple terms, as this will help you communicate effectively with both technical and non-technical team members.

Tip Number 4

Don’t forget to apply through our website! It’s the best way to ensure your application gets seen by the right people. Plus, it shows you’re genuinely interested in joining our team at Jagex!

We think you need these skills to ace Senior Cloud Reliability Engineer

Cloud-Native Architecture
AWS Expertise
VPC
EC2
ECS/EKS
ELB
ECR

Some tips for your application 🫡

Tailor Your CV:Make sure your CV is tailored to the Senior Cloud Reliability Engineer role. Highlight your experience with cloud-native architectures and any relevant AWS expertise. We want to see how your skills align with what we’re looking for!

Craft a Compelling Cover Letter:Your cover letter is your chance to shine! Use it to explain why you’re passionate about cloud reliability and how your past experiences make you a great fit for our team. Let us know what excites you about working at StudySmarter!

Showcase Your Technical Skills:Don’t forget to showcase your technical skills in your application. Mention your hands-on experience with Linux environments, Infrastructure as Code, and any programming or scripting languages you’re comfortable with. We love seeing practical examples!

Apply Through Our Website:We encourage you to apply through our website for the best chance of getting noticed. It’s super easy, and you’ll be able to keep track of your application status. Plus, we can’t wait to hear from you!

How to prepare for a job interview at Jagex

Know Your Cloud Stuff

Make sure you brush up on your AWS knowledge, especially the services mentioned in the job description like EC2, ECS, and Route53. Be ready to discuss how you've used these tools in past projects and how they can improve service reliability.

Showcase Your Problem-Solving Skills

Prepare to share specific examples of how you've tackled issues related to SLIs, SLOs, and incident response. Think about a time when you improved system resilience or reduced downtime, and be ready to explain your thought process.

Demonstrate Your Automation Expertise

Since automation is key for this role, come equipped with examples of operational tasks you've automated. Discuss any self-healing mechanisms you've implemented and how they benefited the team or project.

Be Ready for Technical Questions

Expect some hands-on technical questions or scenarios during the interview. Practice explaining your approach to designing observability and alerting systems, as well as your experience with IaC and Linux environments.