Lead Site Reliability Engineer - Consulting & Strategy in London

Lead Site Reliability Engineer - Consulting & Strategy in London

London Full-Time 80000 - 100000 £ / year (est.) No working from home possible
Ticketmaster

At a Glance

  • Tasks: Lead reliability consulting, mentor teams, and drive improvements across multiple domains.
  • Company: Join Ticketmaster's innovative Central SRE Consulting team in London.
  • Benefits: Permanent role with competitive salary and opportunities for professional growth.
  • Other info: Inclusive environment promoting personal and professional development.
  • Why this job: Make a real impact on system reliability while collaborating with diverse teams.
  • Qualifications: Deep understanding of SRE principles and experience in distributed systems.

The predicted salary is between 80000 - 100000 £ per year.

Location: London, United Kingdom

Division: Ticketmaster UK Limited

Line Manager: Engagement Lead, CSRE Consulting

Contract Terms: Permanent, 40 hours per week

The Team

You will be part of the Central SRE Consulting team, which partners with product and platform engineering teams throughout Ticketmaster to improve reliability, resilience, and sustainable engineering practices. The team’s remit is to increase the adoption and maturity of SRE principles across Ticketmaster and ensure our services are appropriately scaled and reliable.

The Job

As a Lead Site Reliability Engineer in CSRE Consulting, you will lead reliability consulting work across multiple teams or a domain, aligning stakeholders on priorities and driving delivery of sustained improvements. You will translate reliability goals into sequenced workstreams, align dependencies, and ensure teams can maintain the mechanisms after you move on. You will mentor other consultants, codify reusable patterns, and influence shared platforms so reliability improvements propagate beyond any single team or engagement.

What You Will Be Doing

  • Lead consulting work from discovery through delivery by aligning stakeholders on priorities, sequencing work, and communicating measurable outcomes.
  • Establish working cadence and facilitate decision forums to surface risks, map dependencies, and drive clear ownership and timelines.
  • Align product, platform, and engineering stakeholders on reliability targets and trade‑offs using SLOs and error budgets.
  • Partner regularly with Engineering Managers, product managers, Staff and Principal engineers, and platform leads to keep dependencies, decisions, and delivery aligned.
  • Identify systemic risks across shared dependencies and coordinate remediation across multiple teams to reduce recurring incidents.
  • Drive change adoption by embedding reliability mechanisms into partner team routines such as planning, PRRs, and on‑call practices.
  • Design and implement reusable reliability mechanisms, templates, and tooling that can be adopted across teams.
  • Establish and evolve production readiness review practices with partner teams to improve launch quality and change safety.
  • Drive observability strategy for partner domains by improving signal quality, alerting philosophy, and operational dashboards.
  • Lead complex incident investigations and ensure learnings translate into durable fixes with clear owners and verification.
  • Lead reliability‑focused design and code reviews and guide teams toward simpler, safer architectures.
  • Mentor Senior engineers and other consultants through pairing, reviews, and structured coaching to multiply impact.
  • Partner with internal platform engineering to influence roadmaps and deliver shared capabilities that accelerate SRE adoption.
  • Improve CSRE Consulting playbooks and operating practices based on repeated patterns observed across teams.

What You Need to Know (Technical Skills)

  • Deep practical understanding of SRE principles, including SLO governance and error‑budget policy in practice.
  • Proven ability to lead cross‑team technical work and influence without authority.
  • Strong experience designing and troubleshooting distributed systems with cross‑service failure modes.
  • Experience shaping observability and alerting strategy and improving operational signal quality.
  • Strong Kubernetes and AWS experience, including governance and cost trade‑offs.
  • Ability to design reliability automation and tooling that is reusable and adopted by multiple teams.
  • Experience leading production readiness and resilience practices, including DR validation and controlled testing.
  • Strong software engineering fundamentals with the ability to deliver and review high‑quality changes in enterprise codebases.
  • Advanced incident analysis skills focused on systemic risk reduction and organizational learning.
  • Excellent communication skills, including exec‑ready summaries and clear technical diagrams.

You (Behavioural Skills)

  • Lead with service and humility, creating clarity and momentum without relying on authority.
  • Build relationships across teams and functions, and set clear expectations for how you partner and deliver.
  • Facilitate alignment by framing problems, surfacing trade‑offs, and running working sessions that end in decisions.
  • Persuade with evidence and empathy, adapting your narrative for engineers, product, and senior stakeholders.
  • Coach and mentor deliberately, helping others grow in reliability thinking and consulting craft.
  • Maintain psychological safety while raising standards, giving direct feedback with respect.
  • Stay persistent and patient in complex organizations, keeping work moving despite slow dependencies.
  • Hold ambiguity comfortably and turn messy inputs into clear plans, options, and next steps.
  • Favor simple mechanisms that scale adoption, not bespoke one‑offs that require you to maintain them.
  • Operate at a sustainable pace and discourage hero culture by designing systems that do not need it.
  • Take pride in quality, including documentation and decision records that help teams sustain the work.
  • Remain adaptable, switching between hands‑on debugging, stakeholder management, and planning as needed.

Equal Opportunities

Live Nation Entertainment is committed to equal opportunity. We encourage applications from people irrespective of gender, race, sexual orientation, religion, age, disability status, or caring responsibilities. We are passionate and committed to creating an inclusive environment and encouraging professional and personal growth. Live Nation Entertainment will never request payment or equipment purchases as part of the hiring process. Recruiters will only contact candidates from official Live Nation or affiliated brand email domains.

Lead Site Reliability Engineer - Consulting & Strategy in London employer: Ticketmaster

At Ticketmaster UK Limited, we pride ourselves on fostering a collaborative and inclusive work culture that empowers our employees to thrive. As a Lead Site Reliability Engineer in London, you will have the opportunity to mentor others, drive impactful change across teams, and contribute to the evolution of our engineering practices while enjoying a supportive environment that prioritises professional growth and work-life balance.

Ticketmaster

Contact Details:

Ticketmaster Recruitment Team

StudySmarter Expert Advice🤫

We think this is how you could land Lead Site Reliability Engineer - Consulting & Strategy in London

Tip Number 1

Network like a pro! Reach out to folks in the industry, especially those at Ticketmaster or similar companies. A friendly chat can open doors and give you insights that job descriptions just can't.

Tip Number 2

Prepare for interviews by practising common SRE scenarios. Think about how you'd tackle reliability issues or lead a team through an incident. We want you to show off your skills and experience!

Tip Number 3

Don’t forget to showcase your soft skills! Being able to communicate effectively and build relationships is key in this role. Share examples of how you've influenced teams or facilitated decision-making.

Tip Number 4

Apply through our website! It’s the best way to ensure your application gets seen. Plus, it shows you're genuinely interested in joining our team at StudySmarter.

We think you need these skills to ace Lead Site Reliability Engineer - Consulting & Strategy in London

SRE Principles
SLO Governance
Error-Budget Policy
Cross-Team Technical Leadership
Distributed Systems Design
Observability Strategy
Kubernetes

Some tips for your application 🫡

Tailor Your Application:Make sure to customise your CV and cover letter to highlight your experience with SRE principles and cross-team collaboration. We want to see how your skills align with the role, so don’t hold back on showcasing your relevant achievements!

Showcase Your Technical Skills:When detailing your experience, focus on your deep understanding of distributed systems, Kubernetes, and AWS. We’re looking for candidates who can demonstrate their technical prowess, so include specific examples of how you've tackled challenges in these areas.

Communicate Clearly:Your written application should reflect your excellent communication skills. Use clear, concise language and structure your thoughts logically. Remember, we value clarity and the ability to convey complex ideas simply, so make it easy for us to understand your points.

Apply Through Our Website:We encourage you to submit your application through our official website. This ensures that your application is processed smoothly and allows us to keep track of all candidates effectively. Plus, it’s the best way to stay updated on your application status!

How to prepare for a job interview at Ticketmaster

Know Your SRE Principles

Make sure you have a solid grasp of SRE principles, especially SLO governance and error-budget policies. Be ready to discuss how you've applied these concepts in past roles, as this will show your deep understanding and practical experience.

Showcase Your Leadership Skills

Prepare examples that highlight your ability to lead cross-team technical work and influence without authority. Think about specific situations where you aligned stakeholders on priorities or drove delivery of improvements, as this is crucial for the role.

Communicate Clearly and Effectively

Practice summarising complex technical concepts into clear, concise narratives. You’ll need to adapt your communication style for different audiences, so be prepared to explain your ideas to both engineers and senior stakeholders.

Demonstrate Your Problem-Solving Skills

Be ready to discuss how you've tackled systemic risks and coordinated remediation across teams. Share specific examples of how you’ve turned messy inputs into clear plans, as this will showcase your ability to handle ambiguity and drive change.