At a Glance
- Tasks: Automate and enhance system reliability for cutting-edge AI platforms.
- Company: Join WRITER, a leader in enterprise generative AI with a dynamic culture.
- Benefits: Generous PTO, medical insurance, parental leave, and wellness stipends.
- Why this job: Make a real impact on AI-powered workflows and drive innovation.
- Qualifications: 7+ years in site reliability engineering with cloud expertise.
- Other info: Hybrid role with excellent career growth and team collaboration.
The predicted salary is between 36000 - 60000 £ per year.
About WRITER
WRITER is where the world's leading enterprises orchestrate AI-powered work. Our vision is to expand human capacity through superintelligence. We are proving it’s possible – through powerful, trustworthy AI that unites IT and business teams together to unlock enterprise-wide transformation. With WRITER's end-to-end platform, hundreds of companies like Mars, Marriott, Uber, and Vanguard are building and deploying AI agents that are grounded in their company's data and fueled by WRITER's enterprise-grade LLMs. Valued at $1.9B and backed by industry-leading investors, WRITER is rapidly cementing its position as the leader in enterprise generative AI. Founded in 2020 with office hubs in San Francisco, New York City, Austin, Chicago, and London, our team thinks big and moves fast, and we’re looking for smart, hardworking builders and scalers to join us on our journey to create a better future of work with AI.
About the role
At WRITER, our mission to expand human capacity with superintelligence relies on a foundational truth: our platform must be available, performant, and reliable, 24/7. As a site reliability engineer, you’ll be at the heart of making this a reality, impacting every enterprise customer who trusts us with their AI-powered workflows. This isn’t just about keeping the lights on; it’s about pushing the boundaries of what’s possible, proactively identifying and solving complex systemic challenges, and laying the groundwork for our rapid growth and the evolving demands of enterprise generative AI. You’ll build resilient systems, automate across the stack, and champion reliability best practices, directly enabling our ambitious product roadmap and ensuring our customers always have access to the powerful tools they need. This is a hybrid position, based out of our New York City or London hubs. You’ll report to our director of engineering.
What you’ll do
- Automate operational tasks and infrastructure management by developing robust tools and platforms using Python, Go, or similar languages, significantly reducing manual toil across our production environment.
- Design and implement scalable, fault-tolerant infrastructure solutions on public cloud providers (AWS, GCP, Azure) to support WRITER's rapidly expanding, high-traffic AI platform.
- Own the reliability, performance, and efficiency of WRITER’s core services, defining and upholding stringent Service Level Objectives (SLOs) and Error Budgets.
- Own the observability stack for monitoring, logging, and alerting systems to ensure rapid detection of issues across our complex distributed systems.
- Lead incident response, post-mortems, and root cause analyses, applying learnings to proactively prevent future outages and build a more resilient system architecture.
- Collaborate closely with product and engineering teams, providing expert guidance on system design for reliability, performance, and scalability from conception through launch.
What you need
- A solid 7+ years of experience in site reliability engineering, DevOps, or a similar role focused on building and operating large-scale, high-availability production systems.
- Deep expertise with cloud platforms (AWS strongly preferred), containerization technologies like Docker and Kubernetes, and Infrastructure-as-Code tools such as Terraform.
- Strong proficiency in programming languages such as Python, Java, Go for automation and monitoring.
- Knowledge of monitoring and logging tools (e.g., Prometheus, Grafana, ELK Stack) to maintain system health and performance.
- Demonstrated ability to challenge the status quo, proactively identify systemic weaknesses, and propose innovative solutions to complex reliability problems.
- Excellent communication, collaboration, and problem-solving skills, with a talent for building strong relationships and connecting with cross-functional teams.
- A strong sense of ownership and accountability, eager to own mission-critical systems and drive them toward peak performance and unparalleled reliability.
Benefits & perks (UK full-time employees)
- Generous PTO, plus company holidays.
- Comprehensive medical and dental insurance.
- Paid parental leave for all parents (12 weeks).
- Fertility and family planning support.
- Early-detection cancer testing through Galleri.
- Competitive pension scheme and company contribution.
- Annual work-life stipends for wellness, learning and development.
- Company-wide off-sites and team off-sites.
- Competitive compensation and company stock options.
Site reliability engineer (UK) employer: writer.com
Contact Detail:
writer.com Recruiting Team
StudySmarter Expert Advice 🤫
We think this is how you could land Site reliability engineer (UK)
✨Tip Number 1
Network like a pro! Reach out to current employees at WRITER on LinkedIn or other platforms. A friendly chat can give us insider info and might just get our foot in the door.
✨Tip Number 2
Show off your skills! Prepare a mini-project or a portfolio that highlights your experience with cloud platforms and automation tools. This hands-on approach can really impress during interviews.
✨Tip Number 3
Be ready to discuss real-world problems. Think of examples where you've tackled reliability issues or improved system performance. We want to see how you think on your feet!
✨Tip Number 4
Apply through our website! It’s the best way to ensure your application gets seen by the right people. Plus, it shows you're genuinely interested in joining our team at WRITER.
We think you need these skills to ace Site reliability engineer (UK)
Some tips for your application 🫡
Tailor Your CV: Make sure your CV is tailored to the Site Reliability Engineer role. Highlight your experience with cloud platforms, automation, and any relevant programming languages like Python or Go. We want to see how your skills align with our mission!
Craft a Compelling Cover Letter: Your cover letter is your chance to shine! Share your passion for reliability engineering and how you can contribute to WRITER's vision. Be sure to mention specific projects or experiences that demonstrate your problem-solving skills.
Showcase Your Technical Skills: Don’t hold back on showcasing your technical expertise! Include details about your experience with monitoring tools, containerization technologies, and Infrastructure-as-Code. We love seeing how you’ve tackled complex challenges in the past.
Apply Through Our Website: We encourage you to apply through our website for the best chance of getting noticed. It’s the easiest way for us to keep track of your application and ensure it reaches the right people. Let’s get started on this journey together!
How to prepare for a job interview at writer.com
✨Know Your Tech Stack
Make sure you’re well-versed in the technologies mentioned in the job description, especially AWS, Docker, and Kubernetes. Brush up on your Python or Go skills, as you'll likely be asked to demonstrate your proficiency in these languages during the interview.
✨Showcase Problem-Solving Skills
Prepare to discuss specific instances where you've identified and solved complex reliability issues. Use the STAR method (Situation, Task, Action, Result) to structure your answers and highlight your proactive approach to challenges.
✨Understand the Company’s Vision
Familiarise yourself with WRITER's mission and how they leverage AI to enhance business operations. Being able to articulate how your role as a site reliability engineer contributes to this vision will show your genuine interest in the company.
✨Prepare for Collaboration Questions
Since the role involves working closely with product and engineering teams, be ready to discuss your experience in cross-functional collaboration. Think of examples where you’ve successfully communicated technical concepts to non-technical stakeholders.