Manager, Cloud Site Reliability Engineering
Manager, Cloud Site Reliability Engineering

Manager, Cloud Site Reliability Engineering

London Full-Time 43200 - 72000 £ / year (est.) Home office (partial)
B

At a Glance

  • Tasks: Lead a team ensuring high availability of critical SaaS applications and implement scalable infrastructure.
  • Company: Barracuda is a top cybersecurity firm protecting data, applications, and networks globally.
  • Benefits: Enjoy internal mobility, equity options, and a culture that values your voice and impact.
  • Why this job: Join a passionate team focused on innovation, operational excellence, and making a real difference in cybersecurity.
  • Qualifications: 5+ years in SRE/DevOps leadership, strong technical skills, and experience in team development and strategic planning.
  • Other info: This role offers hybrid work options and opportunities for cross-training.

The predicted salary is between 43200 - 72000 £ per year.

Come join our passionate team! Barracuda is a leading cybersecurity company providing complete protection against complex threats. Our platform protects email, data, applications, and networks with innovative solutions, and a managed XDR service, to strengthen cyber resilience. Hundreds of thousands of IT professionals and managed service providers worldwide trust us to protect and support them with solutions that are easy to buy, deploy, and use.

We know a diverse workforce adds to our collective value and strength as an organization. Barracuda Networks is proud to be an employer that complies with all applicable national, state and local laws pertaining to nondiscrimination and equal opportunity regardless of race, gender, religion, sex, sexual orientation, national origin, or disability.

We seek a passionate, experienced Manager, Cloud Site Reliability Engineering for Data Protection and Network Security business units with great technical acumen and a strong background in operations, automation, implementation, and development.

As a Manager, Cloud Site Reliability Engineering, you will be leading a team responsible for ensuring the availability of high volume, critical SaaS applications, and seamless scaling. The application portfolio ranges from a broad spectrum of Data Protection and Network Security products.

What you will be working on:

  • Platform Architecture: Design and implement scalable infrastructure architectures that support high availability and reliability across multiple cloud environments
  • Reliability Engineering: Lead initiatives to improve system reliability, establish SLOs, and implement monitoring and alerting strategies
  • Team Leadership: Build, mentor, and grow a high-performing SRE team while fostering a culture of innovation and continuous improvement
  • Incident Management: Establish and optimize incident response processes, lead major incident reviews, and drive systematic improvements
  • Automation Development: Spearhead automation initiatives to reduce manual operations and improve system reliability
  • Performance Optimization: Lead projects to optimize system performance, capacity planning, and cost efficiency
  • Cross-team Collaboration: Work closely with development teams to implement SRE best practices and drive operational excellence
  • Technical Strategy: Develop and execute technical roadmaps aligned with business goals and scaling requirements
  • Security Integration: Ensure security best practices are embedded in infrastructure and operational processes
  • Knowledge Management: Establish documentation standards and knowledge sharing practices across the organization
  • Vendor Management: Evaluate and manage relationships with technical vendors and service providers
  • Operational Excellence: Drive continuous improvement in operational processes, tooling, and methodologies

What you bring to the role:

  • Technical Leadership Experience: 5+ years of experience leading and managing SRE/DevOps teams, with a proven track record of improving system reliability and performance
  • Architectural Vision: Deep understanding of distributed systems, cloud platforms (AWS/GCP/Azure), and modern infrastructure technologies
  • Operational Excellence: Strong background in implementing SLOs, SLIs, and SLAs, with expertise in incident management and post-mortem processes
  • Team Development: Experience in hiring, mentoring, and growing high-performing technical teams while fostering a culture of continuous learning
  • Strategic Planning: Ability to develop and execute technical roadmaps aligned with business objectives and scalability requirements
  • Problem-Solving Skills: Track record of solving complex technical challenges and implementing sustainable solutions
  • Communication: Excellence in communicating technical concepts to both technical and non-technical stakeholders
  • Automation Expertise: Strong background in infrastructure automation, CI/CD pipelines, and DevOps practices
  • Risk Management: Experience in capacity planning, disaster recovery, and building resilient systems
  • Cross-functional Collaboration: Proven ability to work effectively with product, development, and business teams
  • Change Management: Experience in managing organizational change and driving adoption of new technologies and practices
  • Budget Management: Skills in resource allocation, cost optimization, and managing operational budgets

What you’ll get from us:

A team where you can voice your opinion, make an impact, and where you and your experiences are valued. Internal mobility – there are opportunities for cross training and the ability to attain your next career step within Barracuda. In addition, you will receive equity, in the form of non-qualifying options.

Manager, Cloud Site Reliability Engineering employer: Barracuda Networks

At Barracuda, we pride ourselves on fostering a dynamic and inclusive work environment where innovation thrives. As a Manager in Cloud Site Reliability Engineering, you will not only lead a talented team but also have access to extensive growth opportunities, including internal mobility and cross-training. Our commitment to employee well-being is reflected in our supportive culture and the chance to make a meaningful impact in the cybersecurity landscape.
B

Contact Detail:

Barracuda Networks Recruiting Team

StudySmarter Expert Advice 🤫

We think this is how you could land Manager, Cloud Site Reliability Engineering

✨Tip Number 1

Familiarise yourself with Barracuda's products and services, especially in the areas of Data Protection and Network Security. Understanding their offerings will help you articulate how your experience aligns with their needs during discussions.

✨Tip Number 2

Showcase your leadership skills by preparing examples of how you've successfully built and mentored high-performing teams. Be ready to discuss specific strategies you've used to foster a culture of innovation and continuous improvement.

✨Tip Number 3

Highlight your experience with cloud platforms like AWS, GCP, or Azure. Be prepared to discuss how you've designed scalable infrastructure architectures and improved system reliability in previous roles.

✨Tip Number 4

Demonstrate your problem-solving skills by preparing to discuss complex technical challenges you've faced and the sustainable solutions you've implemented. This will show your ability to handle the demands of the role effectively.

We think you need these skills to ace Manager, Cloud Site Reliability Engineering

Technical Leadership Experience
Architectural Vision
Operational Excellence
Team Development
Strategic Planning
Problem-Solving Skills
Communication Skills
Automation Expertise
Risk Management
Cross-functional Collaboration
Change Management
Budget Management
Incident Management
Performance Optimization
Vendor Management

Some tips for your application 🫡

Tailor Your CV: Make sure your CV highlights relevant experience in cloud site reliability engineering, particularly focusing on your leadership roles and technical skills. Use keywords from the job description to align your experience with what Barracuda is looking for.

Craft a Compelling Cover Letter: Write a cover letter that showcases your passion for cybersecurity and your understanding of the role. Mention specific projects or achievements that demonstrate your ability to lead SRE teams and improve system reliability.

Showcase Technical Acumen: In your application, emphasise your experience with cloud platforms (AWS, GCP, Azure) and your knowledge of distributed systems. Provide examples of how you've implemented SLOs, SLIs, and SLAs in previous roles.

Highlight Team Leadership Skills: Discuss your experience in building and mentoring high-performing teams. Include examples of how you've fostered a culture of innovation and continuous improvement within your teams.

How to prepare for a job interview at Barracuda Networks

✨Showcase Your Technical Acumen

Be prepared to discuss your experience with cloud platforms like AWS, GCP, or Azure. Highlight specific projects where you designed scalable architectures or improved system reliability, as this aligns closely with the role's requirements.

✨Demonstrate Leadership Skills

Share examples of how you've built and mentored high-performing teams. Discuss your approach to fostering a culture of innovation and continuous improvement, which is crucial for the Manager position.

✨Prepare for Incident Management Scenarios

Expect questions about your experience with incident response processes. Be ready to explain how you've led major incident reviews and implemented systematic improvements in past roles.

✨Communicate Effectively

Practice explaining complex technical concepts in simple terms. This will help you connect with both technical and non-technical stakeholders during the interview, showcasing your communication skills.

Manager, Cloud Site Reliability Engineering
Barracuda Networks
B
Similar positions in other companies
UK’s top job board for Gen Z
discover-jobs-cta
Discover now
>