Site Reliability Engineer I in London

Site Reliability Engineer I in London

London Full-Time 60000 - 80000 £ / year (est.) No home office possible
Purview Consultancy Services Ltd

At a Glance

  • Tasks: Automate and optimise processes for the Market Risk Platform using AI and Python.
  • Company: Join a leading banking and finance firm in London.
  • Benefits: Competitive contract salary with opportunities for professional growth.
  • Other info: Dynamic role focused on innovation and process re-engineering.
  • Why this job: Make a real impact by eliminating operational toil and enhancing reliability.
  • Qualifications: 8+ years SRE experience, strong Python skills, and cloud technology exposure.

The predicted salary is between 60000 - 80000 £ per year.

Location: London, UK - 5 Days Onsite

Job Type: Contract & Fixed term Employment

Domain: Banking / Finance / Trading Market Risk

Skills:

  • SRE experience with Python-based applications (not Java)
  • Exposure to cloud technologies
  • Familiarity with Athena ecosystem or similar (SecDB, Quartz)
  • Trade Lifecycle / Market Risk / Risk platform experience

Experience: Minimum 8+ years

Role description:

We need an experienced SRE to focus predominantly on automation, optimization, and process re-engineering using AI for the Market Risk Platform. Success is measured by capacity created (toil eliminated, fewer manual steps, faster recovery, safer/faster changes) not by being the primary BAU support resources. Strong Python and provable agentic AI delivery.

Primary Objectives:

  • Eliminate Operational toil and recurring manual work through durable automation
  • Re-engineer support/change processes to reduce handoffs, approvals friction and rerun complexity
  • Industrialize reliability operations so existing SREs spend less time firefighting and more time engineering

Key Responsibilities (Automation & Process first):

  • Automation Engineering (Core): Build production grade automation in Python (tools, services, workflows) to remove repetitive work: environment checks, dependency validation, automated reruns/reprocessing, safe restarts, drift detection, remediation actions, and standardized operation tasks
  • Create self-service capabilities for common requests (guard railed, auditable, repeatable)
  • Implement automation with Safety: idempotency, dry-run modes, approval gates where needed, rollback/undo strategies, and clear audit trails

Process Re-engineering (Core):

  • Map current operation processes (incident/problem/change, release readiness, rerun/recovery, access/entitlements, environment onboarding) and redesign them to remove waste and reduce cycle time.
  • Standardize runbooks/playbooks into executable workflows, reduce tribal knowledge via templates, checklists, and automated pre-flight controls
  • Define and track operation KPIs (toil hours removed, alert volume reduction, MTTR improvements, change failure rate reduction, rerun time reduction).

Agentic AI:

  • Design and implement agentic workflows that take action using tools/runbooks (e.g., diagnostics, evidence gathering, correlation, guided remediation, change-risk checks, automated rerun orchestration)
  • Put strong controls in place: scoped permissions, deterministic fallbacks, human-in-the-loop approvals for risky actions, evaluation harnesses and measurable outcomes.
  • Productionize with monitoring, logging and post incident learnings feeding back into the agent/tooling

Observability (enablement for automation)

Required skills & Experience:

  • Senior SRE experience on distributed systems and batch/intraday workloads in a production environment.
  • Strong Python
  • Provable agentic AI experience showing Tool integration, guard rails, evaluation approach, Measurable impact (toil reduction, MTTR reduction, alert reduction etc)
  • Demonstrated process optimization ability (removing steps/handoffs, standardizing workflows, implementing light weight controls with metrics)
  • Strong Linux and troubleshooting fundamentals across application/system/network layers
  • Experience working across mixed estates (On Prem VMs + Cloud, with some Kubernetes exposure for operational monitoring/reruns)

Differentiators:

  • Exposure to Banking/Finance Market Risk Domains
  • Experience and knowledge of Athena ecosystem familiarity or similar (Sec DB Quartz)

Site Reliability Engineer I in London employer: Purview Consultancy Services Ltd

As a Site Reliability Engineer I at our London office, you will join a dynamic team dedicated to innovation in the Banking and Finance sector. We pride ourselves on fostering a collaborative work culture that prioritises employee growth through continuous learning and development opportunities, while also offering competitive benefits and a focus on work-life balance. Our commitment to automation and process optimisation not only enhances operational efficiency but also empowers our engineers to engage in meaningful projects that drive real impact.
Purview Consultancy Services Ltd

Contact Detail:

Purview Consultancy Services Ltd Recruiting Team

StudySmarter Expert Advice 🤫

We think this is how you could land Site Reliability Engineer I in London

✨Tip Number 1

Network like a pro! Get out there and connect with folks in the industry. Attend meetups, webinars, or even just grab a coffee with someone who’s already in the SRE space. You never know when a casual chat could lead to your next big opportunity!

✨Tip Number 2

Show off your skills! Create a portfolio showcasing your Python projects, automation scripts, or any AI-driven solutions you've implemented. This is your chance to demonstrate your expertise and make a lasting impression on potential employers.

✨Tip Number 3

Prepare for those interviews! Research common SRE interview questions and practice your answers. Focus on your experience with automation, process re-engineering, and how you’ve tackled operational challenges in the past. Confidence is key!

✨Tip Number 4

Don’t forget to apply through our website! We’re always on the lookout for talented individuals like you. Make sure your application stands out by tailoring it to highlight your relevant experience in the banking and finance sectors.

We think you need these skills to ace Site Reliability Engineer I in London

Site Reliability Engineering (SRE)
Python
Cloud Technologies
Athena Ecosystem
Market Risk Experience
Automation Engineering
Process Re-engineering
Agentic AI
Observability
Linux
Troubleshooting
Kubernetes
Production Environment Experience
Distributed Systems

Some tips for your application 🫡

Tailor Your CV: Make sure your CV is tailored to the Site Reliability Engineer role. Highlight your SRE experience, especially with Python-based applications and any exposure to cloud technologies. We want to see how your skills align with our needs!

Showcase Your Projects: Include specific projects where you've implemented automation or process re-engineering. We love seeing real examples of how you've eliminated operational toil and improved efficiency. This will help us understand your hands-on experience!

Be Clear and Concise: When writing your application, keep it clear and concise. Use bullet points for key achievements and avoid jargon unless it's relevant. We appreciate straightforward communication that gets to the point!

Apply Through Our Website: Don't forget to apply through our website! It’s the best way for us to receive your application and ensures you’re considered for the role. Plus, it helps us keep everything organised on our end!

How to prepare for a job interview at Purview Consultancy Services Ltd

✨Know Your Python Inside Out

Since the role focuses heavily on Python-based applications, make sure you brush up on your Python skills. Be ready to discuss specific projects where you've used Python for automation and optimisation, and think about how you can demonstrate your coding abilities during the interview.

✨Familiarise Yourself with Market Risk Concepts

Understanding the banking and finance domain, especially market risk, is crucial. Do some research on trade lifecycles and risk platforms. Being able to speak knowledgeably about these topics will show that you're not just technically skilled but also understand the business context.

✨Prepare for Automation Scenarios

The job emphasises eliminating operational toil through automation. Prepare examples of how you've successfully implemented automation in previous roles. Think about the challenges you faced, the solutions you implemented, and the measurable outcomes that resulted from your efforts.

✨Showcase Your Process Re-engineering Skills

This role requires a strong focus on process optimisation. Be ready to discuss how you've mapped and redesigned operational processes in the past. Highlight any metrics you've tracked to demonstrate improvements, such as reduced cycle times or fewer manual steps.

Site Reliability Engineer I in London
Purview Consultancy Services Ltd
Location: London

Land your dream job quicker with Premium

You’re marked as a top applicant with our partner companies
Individual CV and cover letter feedback including tailoring to specific job roles
Be among the first applications for new jobs with our AI application
1:1 support and career advice from our career coaches
Go Premium

Money-back if you don't land a job in 6-months

>