Site reliability engineer (SRE) in London

Site reliability engineer (SRE) in London

London Full-Time No home office possible
U

At a Glance

  • Tasks: Automate workflows and enhance platform reliability in a fast-paced trading environment.
  • Company: Leading financial services firm with a focus on innovation and technology.
  • Benefits: Competitive daily rate, hands-on experience, and opportunities for professional growth.
  • Other info: Work onsite in London, contributing to high-impact projects in finance.
  • Why this job: Join a dynamic team to drive automation and AI in market risk operations.
  • Qualifications: 8+ years in SRE, strong Python skills, and experience with distributed systems.

We are hiring experienced Site Reliability Engineers (SREs) to support a Market Risk platform within a leading financial services environment. This is an engineering-led transformation role, focused on automation, reliability, and AI-driven operational improvement rather than BAU support.

Success is measured by:

  • Reduced operational toil
  • Faster recovery (MTTR reduction)
  • Safer, faster change delivery
  • Increased automation and self-service
  • Improved platform reliability

Key Responsibilities

  • Automation Engineering (Core)
    • Build production-grade Python automation for operational workflows
    • Automate environment checks, dependency validation, reruns, restarts, and drift remediation
    • Deliver self-service tools with proper audit, rollback, and safety controls (idempotency, dry-run, approvals)
  • Process Re-engineering (Core)
    • Redesign incident, change, release, and recovery processes
    • Convert runbooks into automated workflows
    • Remove manual handoffs and operational friction
    • Define KPIs: toil, MTTR, alert volume, change failure rate
  • Agentic AI (Core)
    • Build agentic workflows for diagnostics, remediation, and orchestration
    • Implement guardrails, human-in-the-loop controls, and evaluation frameworks
    • Productionise AI automation with monitoring and feedback loops
  • Observability
    • Improve monitoring, logging, and system visibility to enable automation at scale

Required Skills

  • 8+ years SRE/production engineering experience
  • Strong Python (automation/tooling focus)
  • Experience with distributed systems in production environments
  • Strong Linux troubleshooting (app/system/network layers)
  • Hybrid infrastructure exposure (on-prem + cloud)
  • Kubernetes experience (ops/monitoring/reruns)
  • Strong background in automation and process optimisation
  • Proven experience with agentic AI or intelligent automation systems
  • Tool integration, guardrails, evaluation, and measurable production impact (toil/MTTR reduction)

Desirable

  • Banking/Finance/Market Risk experience
  • Familiarity with Athena ecosystem or similar (SecDB, Quartz)
  • Exposure to trading, risk, or regulatory platforms

A high-impact SRE role in a Market Risk trading environment, focused on eliminating operational toil through automation, AI, and reliability engineering at scale.

Site reliability engineer (SRE) in London employer: Uniting People

As a Site Reliability Engineer (SRE) within our leading financial services firm in London, you will thrive in an engineering-led culture that prioritises innovation and automation. We offer competitive daily rates, a collaborative work environment, and ample opportunities for professional growth, all while being at the forefront of transforming market risk platforms. Join us to make a meaningful impact in a dynamic sector where your expertise in AI-driven operational improvements will be highly valued.

U

Contact Detail:

Uniting People Recruiting Team

StudySmarter Expert Advice🤫

We think this is how you could land Site reliability engineer (SRE) in London

Tip Number 1

Network like a pro! Reach out to your connections in the banking and finance sectors, especially those who work with SREs. A friendly chat can lead to insider info about job openings that aren't even advertised yet.

Tip Number 2

Show off your skills! Create a portfolio or GitHub repository showcasing your Python automation projects and any cool SRE tools you've built. This gives potential employers a taste of what you can bring to their Market Risk platform.

Tip Number 3

Prepare for the interview like it’s a coding challenge! Brush up on your Linux troubleshooting and Kubernetes knowledge. Be ready to discuss how you've tackled operational challenges and improved reliability in past roles.

Tip Number 4

Don’t forget to apply through our website! It’s the best way to ensure your application gets seen by the right people. Plus, we love seeing candidates who are proactive about their job search!

We think you need these skills to ace Site reliability engineer (SRE) in London

Site Reliability Engineering (SRE)
Python
Automation Engineering
Incident Management
Change Management
Release Management
AI-driven Automation

Some tips for your application 🫡

Tailor Your CV:Make sure your CV is tailored to the Site Reliability Engineer role. Highlight your experience with Python automation, distributed systems, and any relevant banking or finance background. We want to see how your skills align with our needs!

Showcase Your Projects:Include specific projects where you've implemented automation or improved reliability. Use metrics to demonstrate your impact, like reduced MTTR or increased self-service capabilities. This helps us see your hands-on experience in action!

Craft a Compelling Cover Letter:Your cover letter should tell us why you're passionate about SRE and how you can contribute to our Market Risk platform. Share your thoughts on automation and AI-driven improvements, and let your personality shine through!

Apply Through Our Website:We encourage you to apply directly through our website. It’s the best way for us to receive your application and ensures you’re considered for the role. Plus, it shows you’re keen on joining our team!

How to prepare for a job interview at Uniting People

Know Your Automation Inside Out

Make sure you can discuss your experience with Python automation in detail. Be ready to share specific examples of how you've built production-grade automation for operational workflows, and how it has improved efficiency in previous roles.

Demonstrate Your Problem-Solving Skills

Prepare to talk about your approach to troubleshooting in distributed systems. Think of a few challenging incidents you've resolved, focusing on your thought process and the tools you used to diagnose and fix issues.

Familiarise Yourself with Agentic AI

Since agentic AI is essential for this role, brush up on your knowledge of intelligent automation systems. Be prepared to discuss how you've implemented guardrails and evaluation frameworks in past projects, and the measurable impacts they had.

Showcase Your Understanding of Market Risk

Even if you don't have direct experience in banking or finance, do some research on market risk platforms. Being able to speak intelligently about the industry and its challenges will show your enthusiasm and readiness to contribute.