Site Reliability Engineer in London

Job Board

Companies

Reapit

Site Reliability Engineer

Site Reliability Engineer in London

London Full-Time 60000 - 80000 £ / year (est.) Home office (partial)

Apply Now

At a Glance

Tasks: Deploy and maintain AWS infrastructure, ensuring system health and performance.
Company: Reapit, a leading tech provider for estate agencies with over 25 years of experience.
Benefits: Flexible working, competitive salary, generous leave, and in-house training opportunities.
Other info: Collaborative environment focused on continuous learning and professional growth.
Why this job: Join a dynamic team and make a real impact on innovative technology solutions.
Qualifications: 5+ years in DevOps with strong skills in AWS, Terraform, and scripting languages.

The predicted salary is between 60000 - 80000 £ per year.

Reapit is the original, end-to-end business technology provider for estate agencies of all sizes. We’ve been helping sales and lettings agents to build relationships and grow their businesses for more than 25 years. Our technology connects property professionals in Europe, the Middle East, Australia, and New Zealand with buyers, sellers, tenants and landlords to power the relationships that change lives. In the United Kingdom and Ireland, Reapit’s market-leading technology product suite provides estate and lettings agents with powerful tools covering sales, lettings, property management, block management, client accounts and analytics, underpinned by a robust, security infrastructure.

What you’ll be doing:

Deploy, and maintain robust, scalable AWS infrastructure utilizing Infrastructure as Code (IaC) principles (e.g., CloudFormation, Terraform).
Implement, and maintain comprehensive monitoring, logging, and alerting solutions to ensure system health and performance.
Respond promptly and effectively to critical system alerts and incidents, performing root cause analysis (RCA) and implementing preventative measures.
Manage and execute scheduled maintenance windows, coordinating necessary system updates, patching, and upgrades with minimal downtime.
Provide out-of-hours on‑call support for major incidents and escalations to restore critical service functionality quickly and efficiently.
Automate repetitive operational tasks (“toil”) to increase system efficiency, reduce manual effort, and free up engineering time.
Drive continuous improvement in system reliability, performance, and recoverability (Disaster Recovery/Business Continuity Planning).
Collaborate closely with development teams (DevOps) to improve the entire software lifecycle, focusing on service stability and release engineering.
Establish and refine Service Level Indicators (SLIs), Service Level Objectives (SLOs), and Service Level Agreements (SLAs) for critical services.
Conduct capacity planning and performance testing to ensure the AWS environment can handle current and future load requirements.

Who we’re looking for:

5 years + experience in DevOps
Core technical skills in:

Scripting: bash, PowerShell, CDK, Python
Infrastructure: AWS, Azure (nice to have)
IaC: Terraform, CDK

We value the following attributes:

Technically excellent engineer – deep hands‑on expertise with cloud infrastructure, automation, and the entire DevOps toolchain.
Genuine team player – belief that the team’s success is yours, collaborating across disciplines.
Exceptional communicator – articulates technical concepts clearly to engineers and non‑technical stakeholders, documents thoroughly.
Passionate about the craft – loves building reliable systems, takes pride in clean infrastructure code.
Ownership‑driven professional – takes full accountability, follows through on commitments.
Collaborative problem‑solver – seeks input, builds consensus, approaches disagreements with curiosity.
Continuous learner who shares knowledge – stays current, mentors, pairs, and documents.
Calm and decisive under pressure – remains composed during incidents, makes sound decisions, keeps focus.

What your impact and success looks like:

Within 1 month: Complete onboarding with demonstrated understanding of key systems, infrastructure architecture, deployment processes, and on‑call procedures. Respond to and resolve production incidents independently for at least three core services, creating or updating runbooks. Become the go‑to person for our main UK system.
Within 3 months: Own incident response end‑to‑end, lead post‑mortems, drive remediation to completion. Deliver at least two substantial automation or tooling improvements that reduce operational overhead, improve deployment speed, or enhance reliability. Implement monitoring and alerting enhancements for critical services that improve observability, reduce alert fatigue, or decrease mean time to detection.
Within 6 months: Deliver a major operations project to production (e.g., new product/system, infrastructure migration, disaster recovery implementation). Demonstrate quantifiable improvements in reliability metrics under your ownership. Establish yourself as an expert, help teammates, influence operational decisions, and contribute to technical improvements.

What’s in it for you?

We operate a Flexible Working Policy; you’ll work from London or Solihull office two days a month.
The role offers a highly competitive salary and benefits:

5.5% employer pension contribution
20 days annual leave (plus a day for your birthday), increasing by a day for every year worked
Closed over Christmas, giving you time back with friends and family
In‑house training and access to Go1 (world’s largest online learning library)
Regular local and company‑wide social events, including Tucker Thursday – mouth‑watering cuisine delivered straight to the office once a month
Retail benefits and savings via our Benefits partner, Zest

Equal Employment Opportunity: We care about our industry and want it to become a more inclusive and diverse place to work. We are committed to Equal Employment Opportunity through attracting and retaining a complementary team of employees and building an inclusive environment for all. We welcome new ideas, thinking, and approaches, while listening to all employees.

Site Reliability Engineer in London employer: Reapit

Reapit is an exceptional employer that fosters a collaborative and innovative work culture, making it an ideal place for Site Reliability Engineers to thrive. With a strong commitment to employee growth, we offer extensive training opportunities, a flexible working policy, and competitive benefits, including generous annual leave and a robust pension contribution. Our London and Solihull locations provide a vibrant environment where you can connect with like-minded professionals while enjoying regular social events and a focus on inclusivity.

Contact Details:

Reapit Recruitment Team

View Reapit profile

StudySmarter Expert Advice🤫

We think this is how you could land Site Reliability Engineer in London

✨Join the IT Consultancy Buzz

Get involved in local or virtual IT consultancy meetups and forums. This is where we can rub shoulders with industry professionals, get insights into what Reapit values, and even spot unadvertised opportunities. Don't miss out on these chances to make a name for ourselves in the IT world!

✨Show Off Your Skills

Create a personal project or case study relevant to the challenges Reapit might face. Use platforms like GitHub or Medium to share your findings. This not only demonstrates our consulting skills but shows a proactive attitude, making us stand out from the crowd when applying for that full-time gig.

✨Leverage LinkedIn for Connections

Follow and engage with the relevant thought leaders and influencers in IT consultancy on LinkedIn. Share insightful content and join discussions to gain visibility. A well-placed comment or shared article could catch the attention of someone at Reapit!

✨Direct Apply to Reapit

Let's not forget to apply directly through the Reapit website! Tailor your application to showcase our understanding of their consulting style and how we can contribute to their projects. A personalised approach can make a huge difference in landing that full-time position!

We think you need these skills to ace Site Reliability Engineer in London

AWS

Infrastructure as Code (IaC)

CloudFormation

Terraform

Monitoring Solutions

Logging Solutions

Alerting Solutions

Root Cause Analysis (RCA)

Disaster Recovery

Business Continuity Planning

Scripting (bash, PowerShell, CDK, Python)

Collaboration with Development Teams (DevOps)

Service Level Indicators (SLIs)

Service Level Objectives (SLOs)

Service Level Agreements (SLAs)

Some tips for your application 🫡

Showcase Your Problem-Solving Skills:In IT consulting, it's all about problem-solving, so make sure your CV highlights your analytical skills and any relevant projects you've tackled. Mention specific technologies or methodologies you've used to resolve issues or improve processes; this shows you can think critically and deliver results, which is vital for us at Reapit.

Highlight Relevant Certifications:Certifications like ITIL, PMP, or even specific tech stack qualifications can really make you stand out. Make sure to include these in your CV, as they not only demonstrate your expertise but also your commitment to staying current in the field. We love seeing candidates who are proactive about their professional development!

Tailor Your Cover Letter:Your cover letter is your chance to connect personally with us at Reapit. Share stories about your experiences in IT consulting, and how they shaped your desire to join our team. Mention why you’re excited about this particular role, and how you see yourself contributing to our projects.

Keep It Clear and Concise:We're all busy, so make sure your application is easy to read. Use bullet points for key achievements, and don’t overload us with jargon. A clean, professional layout goes a long way. Remember, the clearer your application, the more likely we are to invite you in for an interview!

How to prepare for a job interview at Reapit

✨Brush Up on Your Technical Skills

For an IT consulting role, be ready to demonstrate your technical prowess. You might face questions on systems integration, cloud technologies, or even troubleshooting specific software. If you have experience with tools like AWS, Azure, or even specific programming languages, make sure you can talk about them fluently.

✨Showcase Your Problem-Solving Approach

IT consulting is all about solving problems for clients. Think about how you can illustrate your approach to a past challenge using the STAR method (Situation, Task, Action, Result). It's a great way to show how you tackle complex issues and come up with effective solutions.

✨Know the Business Impact of IT Solutions

When discussing your experiences, focus not just on the tech solutions you implemented, but also on their business impact. Employers want to see that you can connect IT with organisational goals. Prep examples that highlight how your tech contributions improved efficiency or reduced costs for past clients or projects.

✨Prepare for Behavioural Questions

Since IT consulting often involves teamwork and client interactions, expect behavioural questions that assess your interpersonal skills. Be prepared with examples that demonstrate your adaptability, communication skills, and how you handle client feedback. Before the interview, think of situations where you worked closely with clients to create effective IT strategies or changes.

Site Reliability Engineer in London

Reapit

Location: London

Apply Now

Site Reliability Engineer in London

At a Glance

Site Reliability Engineer in London employer: Reapit

StudySmarter Expert Advice🤫

We think you need these skills to ace Site Reliability Engineer in London

Some tips for your application 🫡

How to prepare for a job interview at Reapit

Company

Product

Help