At a Glance
- Tasks: Build resilient systems and automate operations to enhance reliability and performance.
- Company: Join a forward-thinking tech company focused on innovation and collaboration.
- Benefits: Competitive pay, great benefits, and opportunities for career growth.
- Why this job: Make a real impact on critical systems and shape the future of digital infrastructure.
- Qualifications: Experience in SRE or DevOps, strong scripting skills, and a passion for automation.
- Other info: Work with cutting-edge tools in a dynamic environment that values learning.
The predicted salary is between 36000 - 60000 £ per year.
About the Role
Are you passionate about building resilient systems and eliminating operational toil through automation? We’re looking for a Site Reliability Engineer (SRE) to join our high-impact team and help shape the future of our digital infrastructure. As an SRE, you’ll blend software engineering with systems engineering to ensure the reliability, availability, and performance of our platforms. You’ll work on mission-critical systems, drive automation at scale, and collaborate across teams to embed reliability into every layer of our technology stack.
What You’ll Do
- Ensure the availability, scalability, and performance of systems through proactive monitoring and capacity planning.
- Lead incident response, root cause analysis, and implement preventive measures to avoid recurrence.
- Develop automation tools and scripts to reduce manual operations and improve system resilience.
- Optimize system performance and resource usage, identifying and resolving bottlenecks.
- Collaborate with development and product teams to integrate SRE best practices into the software lifecycle.
- Contribute to the evolution of our SLIs, SLOs, and error budgets to drive reliability metrics.
- Stay current with industry trends and contribute to our internal engineering communities.
What You Bring
- Proven experience as an SRE, DevOps Engineer, or Systems Engineer in a complex, high-availability environment.
- Deep expertise in Microsoft SQL Server (2016–2022), including performance tuning, high availability, and architecture.
- Strong scripting skills (e.g., PowerShell) and experience with automation/configuration tools like Ansible or Chef.
- Familiarity with observability tools, monitoring frameworks, and incident management practices.
- A mindset focused on eliminating TOIL, improving developer experience, and scaling operations through code.
- Excellent communication and collaboration skills.
Bonus Points
- Experience with cloud platforms (Azure, AWS, or GCP).
- Background in database automation and estate standardization.
- Knowledge of security and compliance in regulated environments.
Why Join Us?
- Work on high-impact systems that power critical business operations.
- Be part of a forward-thinking engineering culture that values innovation, learning, and collaboration.
- Access to cutting-edge tools and technologies.
- Competitive compensation, benefits, and career growth opportunities.
Site Reliability Engineer employer: Anson McCade
Contact Detail:
Anson McCade Recruiting Team
StudySmarter Expert Advice 🤫
We think this is how you could land Site Reliability Engineer
✨Tip Number 1
Network like a pro! Reach out to folks in the industry, attend meetups, and connect with SREs on LinkedIn. You never know who might have the inside scoop on job openings or can refer you directly.
✨Tip Number 2
Show off your skills! Create a portfolio showcasing your automation tools, scripts, and any projects that highlight your SRE expertise. This gives potential employers a tangible look at what you can bring to the table.
✨Tip Number 3
Prepare for those interviews! Brush up on your incident response strategies and be ready to discuss how you've tackled performance issues in the past. We want to see your problem-solving skills in action!
✨Tip Number 4
Don’t forget to apply through our website! It’s the best way to ensure your application gets seen by the right people. Plus, we love seeing candidates who are proactive about joining our team.
We think you need these skills to ace Site Reliability Engineer
Some tips for your application 🫡
Tailor Your CV: Make sure your CV reflects the skills and experiences that match the SRE role. Highlight your expertise in Microsoft SQL Server, automation tools, and any relevant projects you've worked on. We want to see how you can bring value to our team!
Craft a Compelling Cover Letter: Your cover letter is your chance to shine! Share your passion for building resilient systems and how you've tackled operational challenges in the past. Let us know why you're excited about joining StudySmarter and how you can contribute to our mission.
Showcase Your Problem-Solving Skills: In your application, don’t shy away from sharing specific examples of how you've resolved incidents or improved system performance. We love seeing candidates who can think critically and act decisively under pressure!
Apply Through Our Website: We encourage you to apply directly through our website for a smoother process. It helps us keep track of your application and ensures you get the attention you deserve. Plus, it’s super easy!
How to prepare for a job interview at Anson McCade
✨Know Your Systems Inside Out
Make sure you’re well-versed in the systems and technologies mentioned in the job description, especially Microsoft SQL Server and automation tools like Ansible or Chef. Brush up on your performance tuning skills and be ready to discuss how you've optimised system performance in past roles.
✨Showcase Your Automation Skills
Prepare examples of how you've developed automation tools or scripts to reduce manual operations. Be specific about the challenges you faced and how your solutions improved system resilience. This will demonstrate your ability to eliminate TOIL and enhance developer experience.
✨Be Ready for Incident Response Scenarios
Expect questions around incident response and root cause analysis. Think of a few incidents you've managed, what steps you took to resolve them, and how you implemented preventive measures. This shows your proactive approach to maintaining system reliability.
✨Communicate and Collaborate
Since collaboration is key in this role, practice articulating how you’ve worked with development and product teams in the past. Highlight any experiences where you integrated SRE best practices into the software lifecycle, as this will show your ability to work cross-functionally.