Site reliability engineer (UK)

Site reliability engineer (UK)

Full-Time 36000 - 60000 £ / year (est.) Home office (partial)
Go Premium
T

At a Glance

  • Tasks: Automate and enhance system reliability for cutting-edge AI platforms.
  • Company: Join WRITER, a leader in enterprise generative AI with a dynamic culture.
  • Benefits: Generous PTO, medical insurance, parental leave, and wellness stipends.
  • Why this job: Make a real impact on AI-powered workflows and drive innovation.
  • Qualifications: 7+ years in site reliability engineering with cloud expertise.
  • Other info: Hybrid role with excellent career growth and team collaboration.

The predicted salary is between 36000 - 60000 £ per year.

About WRITER

WRITER is where the world's leading enterprises orchestrate AI-powered work. Our vision is to expand human capacity through superintelligence. We are proving it’s possible – through powerful, trustworthy AI that unites IT and business teams together to unlock enterprise-wide transformation. With WRITER's end-to-end platform, hundreds of companies like Mars, Marriott, Uber, and Vanguard are building and deploying AI agents that are grounded in their company's data and fueled by WRITER's enterprise-grade LLMs. Valued at $1.9B and backed by industry-leading investors, WRITER is rapidly cementing its position as the leader in enterprise generative AI. Founded in 2020 with office hubs in San Francisco, New York City, Austin, Chicago, and London, our team thinks big and moves fast, and we’re looking for smart, hardworking builders and scalers to join us on our journey to create a better future of work with AI.

About the role

At WRITER, our mission to expand human capacity with superintelligence relies on a foundational truth: our platform must be available, performant, and reliable, 24/7. As a site reliability engineer, you’ll be at the heart of making this a reality, impacting every enterprise customer who trusts us with their AI-powered workflows. This isn’t just about keeping the lights on; it’s about pushing the boundaries of what’s possible, proactively identifying and solving complex systemic challenges, and laying the groundwork for our rapid growth and the evolving demands of enterprise generative AI. You’ll build resilient systems, automate across the stack, and champion reliability best practices, directly enabling our ambitious product roadmap and ensuring our customers always have access to the powerful tools they need. This is a hybrid position, based out of our New York City or London hubs. You’ll report to our director of engineering.

What you’ll do

  • Automate operational tasks and infrastructure management by developing robust tools and platforms using Python, Go, or similar languages, significantly reducing manual toil across our production environment.
  • Design and implement scalable, fault-tolerant infrastructure solutions on public cloud providers (AWS, GCP, Azure) to support WRITER's rapidly expanding, high-traffic AI platform.
  • Own the reliability, performance, and efficiency of WRITER’s core services, defining and upholding stringent Service Level Objectives (SLOs) and Error Budgets.
  • Own the observability stack for monitoring, logging, and alerting systems to ensure rapid detection of issues across our complex distributed systems.
  • Lead incident response, post-mortems, and root cause analyses, applying learnings to proactively prevent future outages and build a more resilient system architecture.
  • Collaborate closely with product and engineering teams, providing expert guidance on system design for reliability, performance, and scalability from conception through launch.

What you need

  • A solid 7+ years of experience in site reliability engineering, DevOps, or a similar role focused on building and operating large-scale, high-availability production systems.
  • Deep expertise with cloud platforms (AWS strongly preferred), containerization technologies like Docker and Kubernetes, and Infrastructure-as-Code tools such as Terraform.
  • Strong proficiency in programming languages such as Python, Java, Go for automation and monitoring.
  • Knowledge of monitoring and logging tools (e.g., Prometheus, Grafana, ELK Stack) to maintain system health and performance.
  • Demonstrated ability to challenge the status quo, proactively identify systemic weaknesses, and propose innovative solutions to complex reliability problems.
  • Excellent communication, collaboration, and problem-solving skills, with a talent for building strong relationships and connecting with cross-functional teams.
  • A strong sense of ownership and accountability, eager to own mission-critical systems and drive them toward peak performance and unparalleled reliability.

Benefits & perks (UK full-time employees)

  • Generous PTO, plus company holidays.
  • Comprehensive medical and dental insurance.
  • Paid parental leave for all parents (12 weeks).
  • Fertility and family planning support.
  • Early-detection cancer testing through Galleri.
  • Competitive pension scheme and company contribution.
  • Annual work-life stipends for wellness, learning and development.
  • Company-wide off-sites and team off-sites.
  • Competitive compensation and company stock options.

Site reliability engineer (UK) employer: The Rundown AI, Inc.

WRITER is an exceptional employer that champions innovation and collaboration, offering a dynamic work culture where employees are empowered to push the boundaries of AI technology. With generous benefits including comprehensive medical coverage, competitive compensation, and ample opportunities for professional growth, WRITER fosters an environment that prioritises employee well-being and development. Located in vibrant hubs like London, team members enjoy a hybrid work model that promotes work-life balance while contributing to groundbreaking advancements in enterprise generative AI.
T

Contact Detail:

The Rundown AI, Inc. Recruiting Team

StudySmarter Expert Advice 🤫

We think this is how you could land Site reliability engineer (UK)

✨Tip Number 1

Network like a pro! Reach out to current or former employees at WRITER on LinkedIn. A friendly chat can give us insider info and maybe even a referral, which can really boost our chances.

✨Tip Number 2

Prepare for the interview by brushing up on your technical skills and understanding WRITER's mission. We want to show that we’re not just a fit for the role, but also passionate about their vision of AI-powered work.

✨Tip Number 3

Practice common SRE scenarios and problem-solving questions. We need to demonstrate our ability to tackle real-world challenges, so let’s get comfortable with discussing our past experiences and how they relate to the job.

✨Tip Number 4

Don’t forget to apply through our website! It’s the best way to ensure our application gets seen. Plus, it shows we’re serious about joining the WRITER team and contributing to their exciting journey.

We think you need these skills to ace Site reliability engineer (UK)

Site Reliability Engineering
DevOps
Cloud Platforms (AWS, GCP, Azure)
Containerization Technologies (Docker, Kubernetes)
Infrastructure-as-Code (Terraform)
Programming Languages (Python, Java, Go)
Monitoring and Logging Tools (Prometheus, Grafana, ELK Stack)
Incident Response
Root Cause Analysis
System Design for Reliability
Collaboration Skills
Problem-Solving Skills
Communication Skills
Ownership and Accountability

Some tips for your application 🫡

Tailor Your CV: Make sure your CV is tailored to the Site Reliability Engineer role. Highlight your experience with cloud platforms, automation, and any relevant programming languages like Python or Go. We want to see how your skills align with our mission!

Craft a Compelling Cover Letter: Your cover letter is your chance to shine! Share your passion for reliability engineering and how you can contribute to WRITER's vision. Be sure to mention specific projects or experiences that showcase your problem-solving skills.

Showcase Your Technical Skills: Don’t hold back on showcasing your technical expertise! Include details about your experience with monitoring tools, containerization technologies, and Infrastructure-as-Code. We love seeing candidates who can challenge the status quo and propose innovative solutions.

Apply Through Our Website: We encourage you to apply through our website for the best chance of getting noticed. It’s the easiest way for us to keep track of your application and ensure it reaches the right team. Let’s get started on this journey together!

How to prepare for a job interview at The Rundown AI, Inc.

✨Know Your Tech Stack

Make sure you’re well-versed in the technologies mentioned in the job description, especially AWS, Docker, and Kubernetes. Brush up on your Python or Go skills, as you'll likely be asked to demonstrate your proficiency in these languages during the interview.

✨Understand Reliability Principles

Familiarise yourself with concepts like Service Level Objectives (SLOs) and Error Budgets. Be prepared to discuss how you’ve implemented these in past roles and how they can be applied to WRITER’s platform to ensure high availability and performance.

✨Showcase Problem-Solving Skills

Prepare examples of complex reliability issues you've faced and how you resolved them. Highlight your ability to challenge the status quo and propose innovative solutions, as this aligns with WRITER's mission to push boundaries.

✨Communicate Effectively

Since collaboration is key in this role, practice articulating your thoughts clearly. Be ready to explain technical concepts to non-technical team members, showcasing your excellent communication skills and ability to build strong relationships across teams.

Site reliability engineer (UK)
The Rundown AI, Inc.
Go Premium

Land your dream job quicker with Premium

You’re marked as a top applicant with our partner companies
Individual CV and cover letter feedback including tailoring to specific job roles
Be among the first applications for new jobs with our AI application
1:1 support and career advice from our career coaches
Go Premium

Money-back if you don't land a job in 6-months

T
Similar positions in other companies
UK’s top job board for Gen Z
discover-jobs-cta
Discover now
>