Lead Site Reliability Engineer in London

Lead Site Reliability Engineer in London

London Full-Time 43200 - 72000 £ / year (est.) Home office (partial)
G

At a Glance

  • Tasks: Maintain and evolve our platform for reliability, security, and scalability.
  • Company: Join FutureLearn, a leader in lifelong learning and career empowerment.
  • Benefits: 28 days annual leave, health plan, cycle scheme, and access to courses.
  • Why this job: Shape the future of education with cutting-edge technology and a passionate team.
  • Qualifications: Experience in cloud infrastructure, automation, and collaboration with software engineers.
  • Other info: Diversity and inclusion are at the heart of our culture.

The predicted salary is between 43200 - 72000 £ per year.

At FutureLearn, we are passionate about the power of lifelong learning. We help learners from all over the world progress in their careers and invest in their futures. We truly believe that up-skilling is a worthy investment, and we hope to empower our learners to take control of their careers through personalised learning pathways.

Partnering with 260+ world-class educational partners, including prestigious universities, global brands and industry partners, we offer our 20 million-strong learner community the opportunity to discover and access flexible, high-quality online courses and degrees. We want to help transform lives.

You will play a key role in maintaining and evolving FutureLearn's platform to ensure it is highly available, reliable, secure, and scalable as the business grows. Working closely with the Lead Technical Architect, SREs, and software engineers, you will help shape the technical direction of our infrastructure while fostering a strong DevOps culture.

What does success look like:

  • Partner with the Lead Technical Architect to set and evolve the technical direction of our infrastructure.
  • Take responsibility for a platform that is secure, resilient, scalable, and cost-efficient.
  • Develop deep expertise in FutureLearn's technology stack and its practical application, including AWS (RDS, ECS, EC2, S3, Lambda), Cloudflare, Redis, DNS, Docker, and the wider infrastructure platform.
  • Use, maintain, and continuously improve observability tooling such as Datadog and AWS CloudWatch.
  • Respond to incidents affecting the platform, including participation in the on-call rota.
  • Ensure disaster recovery and incident response processes are regularly tested and improved.
  • Act as an expert in the tools used to manage infrastructure and CI/CD systems, including Terraform, GitHub Actions, and scripting languages.
  • Own and continuously improve the developer experience.
  • Champion CI/CD best practices.
  • Empower software engineers to understand how to get their code into production.
  • Support engineers through pairing, teaching, mentoring, coaching, and code reviews.
  • Act as a subject matter expert for infrastructure and operational concerns across FutureLearn.

What you bring to the table:

  • Experience architecting and supporting cloud-native web application infrastructure.
  • Hands-on experience with containers and schedulers (Amazon ECS).
  • Experience using automated configuration management and infrastructure-as-code tools (Terraform).
  • A deep understanding of Linux, networking, and security.
  • Experience supporting database administration and performance.
  • A strong interest in automation and improving the developer experience.
  • Experience working closely with software engineers in an agile environment.
  • A solid understanding of Git and version control best practices.

Desirable:

  • Programming experience in Ruby, JavaScript, or Go.
  • Experience managing relationships with external suppliers such as AWS or Cloudflare.

What we offer you:

  • 28 days of Annual Leave plus UK Public Holidays.
  • Roll over up to 5 days Holiday.
  • Access to FutureLearn courses.
  • Westfield Health Cash Plan.
  • Cycle to Work scheme.
  • Season Ticket Loan.
  • Charity work – 1 day dedicated to a charity of your choice.
  • Calm Premium Subscription.

Diversity Statement: We value all the great benefits that diversity brings and encourage everyone to bring their whole self to work. We are committed to Equal Employment Opportunity regardless of race, colour, national origin, ethnicity, gender, age, disability, sexual orientation, gender identity or religion.

Lead Site Reliability Engineer in London employer: Global University Systems (GUS)

At FutureLearn, we are dedicated to fostering a culture of lifelong learning and career empowerment, making us an exceptional employer for those passionate about education and technology. Our collaborative environment encourages personal and professional growth, offering generous benefits such as 28 days of annual leave, access to our courses, and a commitment to diversity and inclusion. Join us in shaping the future of online learning while enjoying unique perks like a charity work day and health plans, all within a vibrant and innovative setting.
G

Contact Detail:

Global University Systems (GUS) Recruiting Team

StudySmarter Expert Advice 🤫

We think this is how you could land Lead Site Reliability Engineer in London

✨Tip Number 1

Network like a pro! Reach out to folks in the industry, attend meetups, and connect with FutureLearn employees on LinkedIn. A friendly chat can open doors that applications alone can't.

✨Tip Number 2

Show off your skills! If you’ve got a portfolio or GitHub showcasing your projects, make sure to share it during interviews. It’s a great way to demonstrate your expertise and passion for tech.

✨Tip Number 3

Prepare for the interview by understanding FutureLearn's mission and values. Tailor your answers to reflect how your experience aligns with their goals of lifelong learning and career empowerment.

✨Tip Number 4

Don’t forget to apply through our website! It’s the best way to ensure your application gets seen by the right people. Plus, it shows you’re genuinely interested in joining the FutureLearn team.

We think you need these skills to ace Lead Site Reliability Engineer in London

Cloud-Native Infrastructure Architecture
AWS (RDS, ECS, EC2, S3, Lambda)
Docker
Terraform
Linux
Networking
Security
Database Administration
CI/CD Best Practices
Observability Tooling (Datadog, AWS CloudWatch)
Incident Response
Agile Methodologies
Version Control (Git)
Programming (Ruby, JavaScript, Go)
DevOps Culture Building

Some tips for your application 🫡

Show Your Passion for Learning: When you write your application, let your enthusiasm for lifelong learning shine through. We want to see how you connect with our mission of empowering learners and transforming lives.

Tailor Your CV and Cover Letter: Make sure to customise your CV and cover letter to highlight your relevant experience and skills. We love seeing how your background aligns with the role of Lead Site Reliability Engineer and our goals at FutureLearn.

Be Clear and Concise: Keep your writing clear and to the point. We appreciate straightforward communication, so avoid jargon and focus on what makes you a great fit for the team.

Apply Through Our Website: Don’t forget to apply through our website! It’s the best way for us to receive your application and ensures you’re considered for this exciting opportunity at FutureLearn.

How to prepare for a job interview at Global University Systems (GUS)

✨Know Your Tech Stack

Make sure you have a solid understanding of FutureLearn's technology stack, especially AWS services like RDS, ECS, and EC2. Brush up on your knowledge of Docker and Terraform too, as these will likely come up in discussions.

✨Show Your DevOps Spirit

Demonstrate your passion for building a strong DevOps culture. Be ready to share examples of how you've fostered collaboration between teams and improved developer experiences in your previous roles.

✨Prepare for Incident Scenarios

Think about past incidents you've managed and be prepared to discuss how you approached troubleshooting and incident response. Highlight your experience with observability tools like Datadog and AWS CloudWatch.

✨Emphasise Lifelong Learning

Since FutureLearn values lifelong learning, express your eagerness to learn and grow within the role. Share any recent courses or certifications you've completed that relate to site reliability engineering or cloud infrastructure.

Lead Site Reliability Engineer in London
Global University Systems (GUS)
Location: London

Land your dream job quicker with Premium

You’re marked as a top applicant with our partner companies
Individual CV and cover letter feedback including tailoring to specific job roles
Be among the first applications for new jobs with our AI application
1:1 support and career advice from our career coaches
Go Premium

Money-back if you don't land a job in 6-months

G
Similar positions in other companies
UK’s top job board for Gen Z
discover-jobs-cta
Discover now
>