At a Glance
- Tasks: Lead reliability engineering initiatives and collaborate with global teams to enhance system performance.
- Company: Join a leading global financial markets infrastructure provider with a focus on innovation.
- Benefits: Enjoy tailored benefits, healthcare, retirement planning, and paid volunteering days.
- Other info: Be part of a diverse team that values individuality and encourages new ideas.
- Why this job: Make a significant impact in a dynamic environment while shaping the future of reliability engineering.
- Qualifications: 10+ years in SRE or related roles, strong AWS and Kubernetes experience required.
The predicted salary is between 80000 - 100000 € per year.
We are evolving our Site Reliability Engineering capabilities to strengthen reliability, observability, security, and operational excellence across our Markets and Risk Intelligence division. As a Technical Lead SRE, you will be a senior hands-on technical person helping shape the foundations of reliability across both new and existing platforms. You will collaborate with Architecture, Engineering, Security, and Platform teams to ensure reliability is built into systems from day one. While this is not a people-management or shift-based role, you will work closely with global teams and may occasionally be called upon for major incidents or critical issues. This position requires a highly proactive, hard-working expert with strong leadership presence and ownership of platform reliability outcomes.
Key Responsibilities
- Lead the establishment of SRE foundations for new projects building environments, monitoring, alerting, and ensuring operational readiness from day one.
- Collaborate with Architecture and Engineering teams to embed reliability, scalability, security, and observability into system design.
- Define, implement, and champion observability standards, tooling, and guidelines across metrics, logs, traces, and SLIs/SLOs.
- Design and evolve monitoring and alerting solutions that improve visibility, reduce toil, and strengthen system health.
- Continuously drive reliability improvements across our environments through incident reduction, performance tuning, and building resilient patterns.
- Partner with Security teams to ensure our platforms meet compliance, security, and risk-management expectations.
- Lead seamless handovers from project delivery into BAU SRE operations by ensuring documentation, readiness, and strong operational practices.
- Influence architectural and design decisions through data-driven cloud cost optimization and efficiency initiatives.
- Be a technical leader and mentor supporting engineers, shaping engineering standards, and fostering a culture of learning and development.
Person Specification
Education: Bachelor’s Degree in Computer Science or related field
Required skills and experience
- 10+ years of hands-on technical experience in SRE, Platform Engineering, Infrastructure, or related roles
- Strong experience with AWS, including services such as EKS, ECS, EC2, networking, IAM, and managed services
- Deep hands-on experience with Kubernetes and containerised platforms
- Strong background in Linux systems administration.
- Proven experience designing and operating observability platforms, including monitoring, logging, and alerting
- Hands-on experience with Datadog for metrics, logs, APM, and alerting
- Strong understanding of SRE principles, including SLOs, error budgets, incident management, and reliability engineering
- Experience working closely with architecture and engineering teams on system design and delivery
- Solid understanding of cloud security principles and experience collaborating with security teams
- Experience with cloud cost optimisation strategies and tooling
- Hands-on experience integrating AI with observability stacks (Prometheus, Grafana, ELK, OpenTelemetry) for proactive issue detection.
Good to have Skills
- Experience or working knowledge of Microsoft Azure
- Experience supporting multi-cloud or hybrid environments
- Exposure to Infrastructure as Code (e.g., Terraform, CloudFormation)
- Experience in large-scale, complex, or regulated environments
- Knowledge of vector databases and RAG architectures for building internal SRE knowledge assistants.
- Knowledge of Generative AI and LLM platforms (e.g., Claude, Amazon Bedrock)
Person Specification
- Strong technical authority with the ability to influence design and operational decisions
- Highly collaborative, comfortable working across architecture, engineering, security, and operations teams
- Calm and methodical under pressure, especially during incidents and critical issues
- Pragmatic problem-solver who balances reliability, security, cost, and delivery speed
- Clear communicator, able to explain complex technical concepts to diverse audiences
Join us and be part of a team that values innovation, quality, and continuous improvement. If you're ready to take your career to the next level and make a significant impact, we'd love to hear from you. LSEG is a leading global financial markets infrastructure and data provider. Our purpose is driving financial stability, empowering economies and enabling customers to create sustainable growth.
Our purpose is the foundation on which our culture is built. Our values of Integrity, Partnership, Excellence and Change underpin our purpose and set the standard for everything we do, every day. They go to the heart of who we are and guide our decision making and everyday actions.
Working with us means that you will be part of a dynamic organisation of 25,000 people across 65 countries. However, we will value your individuality and enable you to bring your true self to work so you can help enrich our diverse workforce.
We are proud to be an equal opportunities employer. This means that we do not discriminate on the basis of anyone’s race, religion, colour, national origin, gender, sexual orientation, gender identity, gender expression, age, marital status, veteran status, pregnancy or disability, or any other basis protected under applicable law.
You will be part of a collaborative and creative culture where we encourage new ideas. We are committed to sustainability across our global business and we are proud to partner with our customers to help them meet their sustainability objectives.
Our charity, the LSEG Foundation provides charitable grants to community groups that help people access economic opportunities and build a secure future with financial independence. Colleagues can get involved through fundraising and volunteering.
LSEG offers a range of tailored benefits and support, including healthcare, retirement planning, paid volunteering days and wellbeing initiatives.
Lead Site Reliability Engineer in Nottingham employer: London Stock Exchange Group
At LSEG, we pride ourselves on being an exceptional employer that fosters a culture of innovation, collaboration, and continuous improvement. As a Lead Site Reliability Engineer in London, you will have the opportunity to work with cutting-edge technologies while contributing to a diverse and inclusive environment that values your individuality. With tailored benefits, a commitment to employee growth, and a focus on sustainability, LSEG is dedicated to empowering you to make a meaningful impact in the financial markets infrastructure sector.
Contact Detail:
London Stock Exchange Group Recruiting Team
StudySmarter Expert Advice🤫
We think this is how you could land Lead Site Reliability Engineer in Nottingham
✨Tip Number 1
Network like a pro! Reach out to your connections in the industry, attend meetups, and engage with online communities. You never know who might have the inside scoop on job openings or can put in a good word for you.
✨Tip Number 2
Prepare for interviews by practising common SRE scenarios. Brush up on your technical skills and be ready to discuss how you've tackled reliability challenges in the past. We want to see your problem-solving skills in action!
✨Tip Number 3
Showcase your passion for reliability engineering! Share your projects, contributions to open-source, or any relevant blogs you've written. This not only highlights your expertise but also demonstrates your commitment to continuous improvement.
✨Tip Number 4
Don’t forget to apply through our website! It’s the best way to ensure your application gets seen by the right people. Plus, it shows you're genuinely interested in joining our team at LSEG.
We think you need these skills to ace Lead Site Reliability Engineer in Nottingham
Some tips for your application 🫡
Tailor Your CV:Make sure your CV is tailored to the Lead Site Reliability Engineer role. Highlight your experience with AWS, Kubernetes, and observability platforms. We want to see how your skills align with our needs!
Craft a Compelling Cover Letter:Your cover letter is your chance to shine! Share your passion for reliability engineering and how you’ve driven continuous improvement in past roles. Let us know why you’re excited about joining our team at StudySmarter.
Showcase Your Technical Skills:Don’t hold back on showcasing your technical expertise! Include specific examples of projects where you’ve implemented SRE principles or improved system reliability. We love seeing hands-on experience!
Apply Through Our Website:We encourage you to apply through our website for a smoother application process. It’s the best way for us to receive your application and keep track of it. Plus, it shows you’re keen on joining StudySmarter!
How to prepare for a job interview at London Stock Exchange Group
✨Know Your SRE Principles
Make sure you brush up on your understanding of SRE principles, especially SLOs, error budgets, and incident management. Be ready to discuss how you've applied these concepts in your previous roles, as this will show your depth of knowledge and experience.
✨Demonstrate Technical Expertise
Prepare to showcase your hands-on experience with AWS, Kubernetes, and observability platforms like Datadog. Bring specific examples of how you've designed and operated monitoring solutions or improved system reliability in past projects.
✨Collaborative Mindset
Since the role involves working closely with various teams, be prepared to discuss how you've successfully collaborated with architecture, engineering, and security teams in the past. Highlight any instances where your teamwork led to significant improvements in system design or operational practices.
✨Problem-Solving Under Pressure
Expect questions that assess your ability to remain calm and methodical during incidents. Share examples of how you've handled critical issues in the past, focusing on your approach to problem-solving and decision-making under pressure.