Lead Site Reliability Engineer in London

Job Board

Companies

GetGround

Lead Site Reliability Engineer

Lead Site Reliability Engineer in London

London Full-Time 72000 - 108000 £ / year (est.) No working from home possible

Apply Now

At a Glance

Tasks: Lead the design and maintenance of scalable cloud infrastructure while mentoring fellow engineers.
Company: Join a fintech disruptor transforming property investment for everyone.
Benefits: Hybrid work model, competitive salary, and opportunities for professional growth.
Other info: Dynamic team culture focused on innovation and reliability.
Why this job: Make a real impact in a fast-paced environment while shaping the future of asset ownership.
Qualifications: 5+ years in SRE or DevOps with strong GCP and Kubernetes experience.

The predicted salary is between 72000 - 108000 £ per year.

London, Waterloo (Hybrid, 4 days in-office - Wednesday is our set work from home day, though you can come in on Wednesday too if you wish). We are disrupting one of the world's largest asset classes, property. With £2Bn+ assets on our platform and 30,000+ users across 70 countries, we're building the future of asset ownership and in doing so, are able to address wealth inequality. Our product simplifies property investing from start to finish, making real estate investment accessible to everyone.

What you'll love doing:

Working in cross-functional product teams, taking infrastructure and reliability initiatives from concept to production.
Navigating ambiguity in a fast-moving environment where ownership and freedom are core to how we operate.
Building and maintaining robust, scalable infrastructure across our GCP cloud environment.
Working with Kubernetes, Terraform, Cloudflare, and modern observability tooling to ensure our platform runs smoothly.
Collaborating closely with engineering teams to design CI/CD pipelines, improve deployment practices, and champion reliability as a core engineering principle.
Helping to define SRE practices for a high-growth fintech platform.
Mentoring other engineers as we scale our teams and impact.

What you'll be doing:

Designing, implementing, and maintaining our cloud infrastructure on Google Cloud Platform (GCP), ensuring scalability, reliability, and security.
Owning our Kubernetes clusters and containerization strategy - from Docker image optimization to cluster management and deployment orchestration.
Building and evolving our Infrastructure as Code using Terraform, creating modular, testable, well-documented configurations that scale with our rapid growth.
Managing and optimizing our Cloudflare infrastructure, including Workers for edge computing, DNS, CDN, security policies, and performance optimization.
Deploying AI powered product features in isolated and secure serverless environments.
Implementing comprehensive monitoring and observability using Prometheus and Grafana, defining SLIs/SLOs, and proactively identifying issues before they impact users.
Designing and maintaining CI/CD pipelines with appropriate quality gates, testing strategies, and deployment techniques (blue-green, canary) to enable fast, safe releases.
Ensuring security best practices across our infrastructure - from network design and access controls to secrets management and vulnerability scanning.
Working with engineering teams to improve application reliability, performance, and observability through instrumentation and architectural guidance.
Enabling developer productivity through self-service tooling, clear documentation, and automation of operational tasks.

What we're looking for:

5+ years in SRE, DevOps, or platform engineering roles with production-grade infrastructure experience.
Strong hands-on experience with Google Cloud Platform (GCP).
Expert-level knowledge of Kubernetes and Docker - you've deployed, managed, and troubleshot production clusters.
Proficiency in Terraform for infrastructure as code.
Experience with Cloudflare services, including Workers, DNS, CDN, and security features.
Experience implementing and managing observability stacks with Prometheus and Grafana.
Strong understanding of CI/CD principles, pipeline design, and deployment strategies.
Experience with cloud networking, security groups, VPCs, and network peering.
Solid scripting skills (Shell, Python, or similar).
Experience with blue-green or canary deployment techniques.
Familiarity with programming languages like Go or TypeScript.
Background in implementing security automation and quality gates.
Experience with configuration management tools.
Understanding of SRE principles: SLIs, SLOs, error budgets, and blameless postmortems.
Experience with edge computing and serverless architectures.
Track record of mentoring engineers and fostering a culture of reliability.

Lead Site Reliability Engineer in London employer: GetGround

Join a pioneering fintech company in London, Waterloo, where we are transforming property investment and tackling wealth inequality. Our hybrid work culture promotes flexibility with a dedicated work-from-home day, while our collaborative environment encourages innovation and personal growth. As a Lead Site Reliability Engineer, you'll have the opportunity to shape our infrastructure, mentor fellow engineers, and contribute to a mission that makes real estate accessible to all.

Contact Details:

GetGround Recruitment Team

View GetGround profile

StudySmarter Expert Advice🤫

We think this is how you could land Lead Site Reliability Engineer in London

✨Tip Number 1

Network like a pro! Attend meetups, webinars, or tech conferences related to SRE and DevOps. Chatting with industry folks can lead to job opportunities that aren’t even advertised yet.

✨Tip Number 2

Show off your skills! Create a portfolio showcasing your projects, especially those involving GCP, Kubernetes, and Terraform. This gives potential employers a taste of what you can do and sets you apart from the crowd.

✨Tip Number 3

Prepare for interviews by brushing up on your technical knowledge and soft skills. Practice explaining complex concepts simply, as you’ll need to collaborate with cross-functional teams. We want to see how you think!

✨Tip Number 4

Don’t forget to apply through our website! It’s the best way to ensure your application gets seen. Plus, we love seeing candidates who are genuinely interested in joining our mission to disrupt property investment.

We think you need these skills to ace Lead Site Reliability Engineer in London

Google Cloud Platform (GCP)

Kubernetes

Docker

Terraform

Cloudflare

Prometheus

Grafana

CI/CD Pipeline Design

Scripting (Shell, Python)

Security Best Practices

Infrastructure as Code

Observability

Edge Computing

Serverless Architectures

Mentoring Engineers

Some tips for your application 🫡

Tailor Your CV:Make sure your CV reflects the skills and experiences that match our Lead Site Reliability Engineer role. Highlight your hands-on experience with GCP, Kubernetes, and Terraform, as these are key to what we’re looking for.

Craft a Compelling Cover Letter:Use your cover letter to tell us why you’re passionate about SRE and how you can contribute to our mission of making property investment accessible. Share specific examples of your past work that align with our goals.

Showcase Your Problem-Solving Skills:In your application, don’t shy away from discussing challenges you've faced in previous roles. We love candidates who can navigate ambiguity and come up with innovative solutions, so share those stories!

Apply Through Our Website:We encourage you to apply directly through our website. It’s the best way for us to receive your application and ensures you’re considered for the role. Plus, it shows you’re keen on joining our team!

How to prepare for a job interview at GetGround

✨Know Your Tech Inside Out

Make sure you’re well-versed in the technologies mentioned in the job description, especially Google Cloud Platform, Kubernetes, and Terraform. Brush up on your hands-on experience with these tools, as you might be asked to solve real-world problems or discuss your past projects involving them.

✨Showcase Your Problem-Solving Skills

Prepare to discuss how you've navigated ambiguity in previous roles. Think of specific examples where you took ownership of a project or initiative, particularly in fast-paced environments. This will demonstrate your ability to thrive under pressure and contribute to their dynamic team.

✨Understand SRE Principles

Familiarise yourself with key SRE concepts like SLIs, SLOs, and error budgets. Be ready to explain how you’ve implemented these principles in your past work, as this will show that you not only understand the theory but can apply it effectively in practice.

✨Prepare Questions That Matter

Think of insightful questions to ask about their infrastructure, team dynamics, and future projects. This shows your genuine interest in the role and helps you assess if the company aligns with your career goals. Plus, it’s a great way to engage with your interviewers!

Lead Site Reliability Engineer in London

GetGround

Location: London

Apply Now

Lead Site Reliability Engineer in London

At a Glance

Lead Site Reliability Engineer in London employer: GetGround

StudySmarter Expert Advice🤫

We think you need these skills to ace Lead Site Reliability Engineer in London

Some tips for your application 🫡

How to prepare for a job interview at GetGround

Company

Product

Help