Job Board

Companies

Barracuda Networks Inc.

Director, Site Reliability Engineering

Director, Site Reliability Engineering in Reading

Reading Full-Time 72000 - 108000 £ / year (est.) No home office possible

Apply now

At a Glance

Tasks: Lead global reliability initiatives and oversee a distributed team of Site Reliability Engineers.
Company: Barracuda Networks, a leading cybersecurity company trusted by IT professionals worldwide.
Benefits: Leadership role with career growth, health benefits, retirement plan, and flexible time off.
Why this job: Shape the reliability of mission-critical systems and tackle cutting-edge cloud challenges.
Qualifications: 12+ years in infrastructure or SRE roles, with strong leadership and cloud expertise.
Other info: Join a culture of innovation and collaboration while driving AI-powered automation.

The predicted salary is between 72000 - 108000 £ per year.

Barracuda Networks is a leading cybersecurity company providing complete protection against complex threats. Our platform protects email, data, applications, and networks with innovative solutions and a managed XDR service, to strengthen cyber resilience. Hundreds of thousands of IT professionals and managed service providers worldwide trust us to protect and support them with solutions that are easy to buy, deploy, and use.

We are seeking a strategic and visionary Director of Site Reliability Engineering (SRE) in the Cloud Operations group to lead global reliability initiatives across Barracuda's SaaS portfolio. You will oversee a distributed team of Site Reliability Engineers and partner closely with Product Engineering, Security & Compliance, and other Cloud Operations teams to ensure our platforms are highly available, scalable, secure, and cost‑efficient. This role will also drive AI‑powered automation and agentic systems adoption to transform reliability operations.

What will you be working on

Strategic Leadership: Define and execute Barracuda's global SRE strategy, aligning reliability goals with business objectives and customer SLAs.
Operational Excellence: Drive continuous improvement in availability, latency, performance, and cost optimization across all cloud services.
AI & Agentic Systems Integration: Implement AI‑driven observability and anomaly detection for proactive incident prevention; deploy agentic automation systems to manage routine operational tasks, optimize cloud resources, and accelerate remediation workflows; explore LLM‑based runbooks and autonomous agents for incident triage and root cause analysis.
Cross‑Functional Collaboration: Partner with Engineering, Security, and FinOps teams to embed reliability into product design and delivery pipelines.
Architecture & Governance: Influence architectural decisions for reliability, disaster recovery, and observability systems; ensure compliance with security and regulatory standards.
Automation & Tooling: Champion Infrastructure‑as‑Code and CI/CD automation at scale using Terraform, Cloud Formation, GitHub Actions, and Jenkins.
Incident & Risk Management: Facilitate incident response protocols, conduct executive‑level postmortems, and implement proactive risk mitigation strategies.
Service Level Management: Define and enforce SLIs and SLOs across global services; report reliability metrics to executive leadership.
Team Development: Build and mentor a high‑performing SRE organization; foster a culture of ownership, innovation, and collaboration across regions.
Cloud Optimization: Lead initiatives for cost governance and performance tuning in AWS and Azure environments.
Executive Communication: Present reliability roadmaps, KPIs, and risk assessments to senior leadership and stakeholders.

What you bring to the role

Experience: 12+ years in infrastructure, cloud operations, or SRE roles, including 5+ years in leadership positions managing distributed teams.
Cloud Expertise: Deep knowledge of AWS and Azure architectures, security, and operations in large‑scale SaaS environments.
AI & Automation: Experience implementing AI‑driven observability, predictive analytics, and autonomous remediation systems.
Infrastructure as Code: Proven success implementing such as Terraform or CloudFormation at enterprise scale.
CI/CD & Automation: Advanced experience with GitHub Actions, Jenkins, and deployment strategies (blue/green, canary, rolling).
Container Orchestration: Expertise in Kubernetes (EKS, AKS) and containerized workloads.
Observability & Resilience: Strong background in Prometheus, Grafana, ELK, and APM tools; experience designing self‑healing systems.
Programming: Proficiency in Python, Go, or similar languages for automation and tooling.
Leadership Skills: Exceptional ability to lead globally distributed teams, influence cross‑functional stakeholders, and drive cultural change.
Certifications: AWS Solutions Architect/DevOps Professional and Kubernetes certifications (CKA, CKAD) preferred.

What You Will Get from Us

A leadership role where your vision shapes the reliability of mission‑critical systems.
Opportunities for career growth and executive visibility.
High‑quality health benefits, retirement plan with employer match, and flexible time off.
The chance to work on cutting‑edge cloud reliability challenges at scale.

Director, Site Reliability Engineering in Reading employer: Barracuda Networks Inc.

Barracuda Networks is an exceptional employer, offering a dynamic work environment where innovation and collaboration thrive. As a leader in cybersecurity, we provide our employees with opportunities for career growth, competitive health benefits, and a flexible work culture that values diversity and inclusion. Join us to tackle cutting-edge challenges in cloud reliability while being part of a supportive team that encourages professional development and strategic leadership.

Contact Detail:

Barracuda Networks Inc. Recruiting Team

View Barracuda Networks Inc. Profile

StudySmarter Expert Advice 🤫

We think this is how you could land Director, Site Reliability Engineering in Reading

✨Tip Number 1

Network like a pro! Reach out to folks in your industry on LinkedIn or at events. A friendly chat can lead to opportunities that aren’t even advertised yet.

✨Tip Number 2

Show off your skills! Create a portfolio or a personal project that highlights your expertise in SRE and cloud operations. This gives you something tangible to discuss during interviews.

✨Tip Number 3

Prepare for the interview by researching Barracuda Networks and their products. Understand their challenges and think about how you can contribute to their reliability goals.

✨Tip Number 4

Don’t forget to apply through our website! It’s the best way to ensure your application gets seen by the right people. Plus, it shows you’re genuinely interested in joining the team.

We think you need these skills to ace Director, Site Reliability Engineering in Reading

Strategic Leadership

Operational Excellence

AI-driven Observability

Anomaly Detection

Infrastructure as Code

CI/CD Automation

Cloud Expertise

Incident Management

Risk Mitigation

Service Level Management

Team Development

Cost Governance

Executive Communication

Container Orchestration

Programming in Python or Go

Some tips for your application 🫡

Tailor Your Application: Make sure to customise your CV and cover letter to highlight your experience in cloud operations and SRE roles. We want to see how your skills align with our needs at Barracuda, so don’t hold back on showcasing your relevant achievements!

Showcase Your Leadership Skills: As a Director, we’re looking for someone who can lead and inspire a distributed team. Share examples of how you've successfully managed teams and driven strategic initiatives in your previous roles. We love to see that leadership flair!

Highlight Your Technical Expertise: Don’t forget to mention your deep knowledge of AWS, Azure, and automation tools like Terraform or CloudFormation. We’re keen to know how you’ve implemented AI-driven observability and other tech solutions in your past work.

Apply Through Our Website: We encourage you to apply directly through our website. It’s the best way for us to receive your application and ensures you’re considered for this exciting opportunity. Plus, it’s super easy!

How to prepare for a job interview at Barracuda Networks Inc.

✨Know Your Stuff

Make sure you brush up on your knowledge of AWS and Azure architectures, as well as the latest trends in AI-driven observability and automation. Be ready to discuss how you've implemented these technologies in past roles, especially in large-scale SaaS environments.

✨Showcase Your Leadership Skills

Prepare examples that highlight your experience in leading distributed teams and driving cultural change. Think about specific challenges you've faced and how you overcame them, particularly in cross-functional collaboration with engineering and security teams.

✨Be Ready for Technical Questions

Expect to dive deep into technical discussions around Infrastructure as Code, CI/CD processes, and container orchestration. Brush up on tools like Terraform, GitHub Actions, and Kubernetes, and be prepared to explain your approach to incident management and risk mitigation.

✨Communicate Your Vision

Articulate your strategic vision for Site Reliability Engineering clearly. Be prepared to discuss how you would align reliability goals with business objectives and customer SLAs, and how you plan to drive continuous improvement in availability and performance across cloud services.

Director, Site Reliability Engineering in Reading

Barracuda Networks Inc.

Location: Reading

Apply now

Director, Site Reliability Engineering in Reading

Reading

Full-Time

72000 - 108000 £ / year (est.)

Apply now
Barracuda Networks Inc.

1001-5000

View Barracuda Networks Inc. Profile

Similar positions in other companies

UK’s top job board for Gen Z

Discover now

Director, Site Reliability Engineering in Reading

At a Glance

Director, Site Reliability Engineering in Reading employer: Barracuda Networks Inc.

StudySmarter Expert Advice 🤫

✨Tip Number 1

✨Tip Number 2

✨Tip Number 3

✨Tip Number 4

We think you need these skills to ace Director, Site Reliability Engineering in Reading

Some tips for your application 🫡

How to prepare for a job interview at Barracuda Networks Inc.

Director, Site Reliability Engineering in Reading

Land your dream job quicker with Premium

Similar positions in other companies

UK’s top job board for Gen Z