Site Reliability Engineer in London
Site Reliability Engineer

Site Reliability Engineer in London

London Full-Time 36000 - 60000 ÂŁ / year (est.) No home office possible
Go Premium
F

At a Glance

  • Tasks: Elevate platform reliability and performance while optimising AWS infrastructure.
  • Company: Join AIFanvue, the fastest-growing creator monetisation platform.
  • Benefits: Competitive salary, equity, unlimited holiday, and remote work options.
  • Why this job: Be a key player in redefining the creator economy with cutting-edge technology.
  • Qualifications: Experience with AWS, Aurora PostgreSQL, and a passion for reliability engineering.
  • Other info: Diverse teams welcome; we value potential as much as experience.

The predicted salary is between 36000 - 60000 ÂŁ per year.

Join us in redefining the creator economy with AIFanvue, the fastest-growing creator monetisation platform. We are the leading AI-powered creator-first platform, designed to empower creators worldwide to directly monetise their audience.

The Role

We are hiring a Site Reliability Engineer (SRE) to elevate the reliability, scalability, and performance of the core platform that powers Fanvue. You will be the technical specialist who ensures our infrastructure is predictable, resilient, and capable of supporting rapid product development across multiple teams.

This role sits at the heart of the platform: improving the health of our Aurora PostgreSQL estate, developing robust AWS infrastructure, enabling engineering teams with deep technical expertise, and driving the reliability culture required to support a fast-scaling product.

What You'll Do

  • Own and optimise Aurora PostgreSQL (ServerlessV2) clusters that power Fanvue's core systems, ensuring performance, availability, and scalability.
  • Oversee the reliability of AWS-managed data infrastructure across Aurora, ElastiCache Redis, DynamoDB, and RDS.
  • Develop and maintain Infrastructure as Code using AWS CDK (TypeScript), establishing automated, reusable patterns and best practices.
  • Reduce operational toil through automation and build self-service tooling that empowers engineering teams.
  • Implement and maintain robust monitoring, observability, and alerting using AWS CloudWatch.
  • Ensure CI/CD pipelines are reliable, safe, and performant, enabling frequent and high-confidence deployments.
  • Act as the escalation point for complex infrastructure and database issues, supporting teams when deep expertise is required.
  • Lead incident response, run post-mortems, and deliver actionable improvements to avoid repeat failures.
  • Partner closely with stream teams to understand their infrastructure needs and provide technical guidance without slowing their velocity.
  • Mentor engineers across the Platform team, raising reliability standards and improving operational maturity.

Who You Are

A highly experienced reliability engineer with deep hands‐on expertise in AWS‐managed database systems, distributed systems, and infrastructure automation. You bring:

  • Extensive experience operating, scaling, and tuning Aurora PostgreSQL (preferably ServerlessV2).
  • Strong proficiency across AWS database services: Aurora PostgreSQL, ElastiCache Redis, DynamoDB, and RDS.
  • Expertise with Infrastructure as Code, especially AWS CDK (TypeScript).
  • Proven ability to identify, measure, and eliminate toil through automation.
  • Experience applying SRE principles: SLIs, SLOs, error budgets, gradual rollouts, and reliability‐focused system design.
  • Strong architectural thinking, with the ability to design fault‐tolerant, scalable infrastructure.
  • Deep expertise with monitoring, observability, and performance tuning using AWS CloudWatch.
  • Excellent communication skills and the ability to guide teams without creating bottlenecks.
  • A high‐ownership mindset aligned with Amazon Leadership Principles: Ownership, Dive Deep, Think Big, Deliver Results.

Nice‐to‐haves

  • Experience supporting ECS Fargate workloads or containerised environments.
  • Background in building internal platform tools or developer enablement systems.
  • Familiarity with microservice vs centralised architecture trade‐offs.

You'll Thrive Here If

  • You enjoy being the deep technical expert teams rely on.
  • You love optimising systems for performance and reliability.
  • You are motivated by solving hard technical problems and making infrastructure invisible, stable, and scalable.
  • You take pride in raising engineering standards and creating leverage for others.

You'll Struggle Here If

  • You prefer reactive operations over proactive engineering.
  • You are uncomfortable owning large technical surfaces with autonomy.
  • You avoid hands‐on investigation, deep dives, or operational responsibility.

Why Join Fanvue?

  • Own and strengthen the most mission‐critical systems at one of the fastest‐growing creator platforms.
  • Competitive salary, equity, and benefits package.
  • A culture that values innovation, ownership, transparency, and speed.
  • Unlimited holiday.
  • Remote working.
  • Flexible hours to support how you perform best.
  • Budget for growth and wellbeing.

Fanvue is for Everyone

We know that diverse teams build better products. Even if you do not meet every single requirement, we encourage you to apply. Many great people grow into parts of a role, and we value potential just as much as experience.

Site Reliability Engineer in London employer: Fanvue LLC

At Fanvue, we pride ourselves on being a leading AI-powered creator monetisation platform that champions innovation and ownership. As a Site Reliability Engineer, you'll be at the forefront of enhancing our core systems, with access to competitive salaries, unlimited holiday, and a flexible remote working culture that prioritises your growth and wellbeing. Join us in a dynamic environment where diverse teams collaborate to redefine the creator economy and empower creators worldwide.
F

Contact Detail:

Fanvue LLC Recruiting Team

StudySmarter Expert Advice đŸ€«

We think this is how you could land Site Reliability Engineer in London

✹Tip Number 1

Network like a pro! Reach out to folks in the industry, attend meetups, and connect with people on LinkedIn. You never know who might have the inside scoop on job openings or can refer you directly.

✹Tip Number 2

Show off your skills! Create a portfolio or GitHub repository showcasing your projects and contributions. This gives potential employers a taste of what you can do and sets you apart from the crowd.

✹Tip Number 3

Prepare for interviews by practising common SRE scenarios and technical questions. Mock interviews with friends or using online platforms can help you feel more confident and ready to impress.

✹Tip Number 4

Don’t forget to apply through our website! It’s the best way to ensure your application gets seen by the right people. Plus, we love seeing candidates who are genuinely interested in joining our mission!

We think you need these skills to ace Site Reliability Engineer in London

Aurora PostgreSQL
AWS
Infrastructure as Code
AWS CDK (TypeScript)
Automation
Monitoring and Observability
AWS CloudWatch
CI/CD Pipelines
SRE Principles
Distributed Systems
Performance Tuning
Communication Skills
Architectural Thinking
Problem-Solving Skills
Mentoring

Some tips for your application đŸ«Ą

Tailor Your Application: Make sure to customise your CV and cover letter to highlight your experience with AWS, PostgreSQL, and SRE principles. We want to see how your skills align with our mission at Fanvue!

Showcase Your Technical Expertise: Don’t hold back on detailing your hands-on experience with infrastructure automation and monitoring tools. We love seeing specific examples of how you've optimised systems in the past!

Be Clear and Concise: When writing your application, keep it straightforward. Use bullet points for key achievements and avoid jargon that might confuse us. We appreciate clarity just as much as technical prowess!

Apply Through Our Website: We encourage you to submit your application directly through our website. It’s the best way for us to receive your details and ensures you’re considered for the role without any hiccups!

How to prepare for a job interview at Fanvue LLC

✹Know Your Tech Inside Out

Make sure you brush up on your knowledge of Aurora PostgreSQL and AWS services. Be ready to discuss your hands-on experience with these technologies, as well as any challenges you've faced and how you overcame them.

✹Showcase Your Problem-Solving Skills

Prepare to share specific examples of how you've tackled complex infrastructure issues in the past. Highlight your approach to incident response and how you’ve implemented improvements to avoid repeat failures.

✹Demonstrate Your Automation Expertise

Since reducing operational toil is key for this role, be ready to talk about your experience with Infrastructure as Code, particularly using AWS CDK. Discuss any self-service tooling you've built and how it empowered engineering teams.

✹Communicate Clearly and Confidently

Strong communication skills are essential. Practice explaining technical concepts in a way that’s easy to understand. Show that you can guide teams without creating bottlenecks, and be prepared to discuss how you foster collaboration.

Site Reliability Engineer in London
Fanvue LLC
Location: London
Go Premium

Land your dream job quicker with Premium

You’re marked as a top applicant with our partner companies
Individual CV and cover letter feedback including tailoring to specific job roles
Be among the first applications for new jobs with our AI application
1:1 support and career advice from our career coaches
Go Premium

Money-back if you don't land a job in 6-months

F
  • Site Reliability Engineer in London

    London
    Full-Time
    36000 - 60000 ÂŁ / year (est.)
  • F

    Fanvue LLC

    50-100
Similar positions in other companies
UK’s top job board for Gen Z
discover-jobs-cta
Discover now
>