Director of Site Reliability Engineering
Director of Site Reliability Engineering

Director of Site Reliability Engineering

City of Westminster Full-Time 72000 - 108000 £ / year (est.) Home office (partial)
C

At a Glance

  • Tasks: Lead a dynamic team in managing cloud operations across AWS, Azure, and GCP.
  • Company: Join Coalfire, a leader in cybersecurity solutions dedicated to making the world safer.
  • Benefits: Enjoy flexible work options, competitive perks, and comprehensive insurance for you and your family.
  • Why this job: Be part of a passionate team that values growth, innovation, and community impact.
  • Qualifications: 8+ years in technical leadership with cloud infrastructure expertise and strong communication skills.
  • Other info: Diversity and inclusion are at our core; we celebrate individual differences.

The predicted salary is between 72000 - 108000 £ per year.

Coalfire is on a mission to make the world a safer place by solving our clients' hardest cybersecurity challenges. We work at the cutting edge of technology to advise, assess, automate, and ultimately help companies navigate the ever-changing cybersecurity landscape. We are headquartered in Denver, Colorado with offices across the U.S. and U.K., and we support clients around the world.

We are seeking a technically adept and operationally focused leader to oversee our Cloud Operations group, a growing team responsible for managing client infrastructure across AWS, Azure, and GCP. Our clients rely on us to operate secure, high-performing cloud environments that support regulated workloads and long-term service stability.

As Director of Cloud Operations, you will provide technical and operational leadership to a U.S.-based team of cloud and systems engineers and administrators, guiding them through the implementation of scalable processes, standards, and tooling that improve quality, reliability, and customer satisfaction. You'll also mentor frontline leaders and help shape a team culture built on ownership, communication, and operational discipline.

This is a technical management role; while you won’t be hands-on, you will be expected to engage deeply with cloud architecture, infrastructure operations, automation frameworks, and service delivery workflows. Your success will be measured by your ability to improve execution, reduce operational risk, and build a high-performing team culture centred on accountability, transparency, and excellence.

What You’ll Do

  • Lead and mentor 5+ direct managers and 20-30 indirect reports across cloud operations and systems engineering functions.
  • Build a team culture of accountability, urgency, and client ownership.
  • Support overall performance management and long-term career development practices.
  • Act as an escalation point for technical and operational blockers impacting delivery or customer satisfaction.

Operational Excellence & Service Delivery

  • Drive improvements in incident response, ticket handling, change management, and patch compliance.
  • Standardize runbooks, monitoring, escalation paths, and documentation across client environments.
  • Identify and track key operational metrics such as MTTR, SLA adherence, and customer satisfaction.
  • Partner with internal teams to create more proactive service models that anticipate client issues before escalation.

Strategic and Organizational Growth

  • Collaborate with leadership to expand technical capabilities and develop new professional service offerings.
  • Evaluate emerging technologies and trends to guide innovation within the team’s technical practices.
  • Support organizational growth by creating scalable frameworks for service delivery and team expansion.
  • Participate in strategic planning sessions to align technical direction with business objectives.

Cross-Functional Collaboration

  • Collaborate with other departments to ensure alignment between professional services and broader business goals.
  • Partner with the Security Director on shared concerns such as incident containment, vulnerability remediation, and tooling integration.

What You’ll Bring

  • Proven leadership experience with technical operations teams in a managed services or MSP context.
  • Deep knowledge of cloud infrastructure in AWS, Azure, and GCP environments.
  • Familiarity with infrastructure-as-code tools like Terraform, Ansible, GitHub/GitLab pipelines.
  • Strong communication skills with the ability to manage both internal teams and client expectations.
  • High emotional intelligence and situational awareness during client escalations and internal performance issues.
  • Experience leading operational maturity or ITSM process rollouts (e.g., incident/change/problem management).
  • Familiarity with SRE principles, but adaptable to operationally heavy environments.
  • Metric and KPI management.
  • 8+ years of technical leadership experience, ideally within a managed services or multi-client environment.
  • Proven success in scaling technical organizations and driving operational excellence in a professional services environment.
  • Experience managing key operational metrics such as utilization, margins, and capacity.

Bonus Points

  • Direct experience leading cloud-focused teams or organizations.
  • Background in customer-facing roles, with experience in client escalations or high-level technical discussions.
  • Relevant certifications in cloud platforms (AWS, Azure, GCP) or IT frameworks (ITIL, TOGAF) are preferred.

At Coalfire, you’ll find the support you need to thrive personally and professionally. In many cases, we provide a flexible work model that empowers you to choose when and where you’ll work most effectively - whether you’re at home or an office. Regardless of location, you’ll experience a company that prioritises connection and wellbeing and be part of a team where people care about each other and our communities. You’ll have opportunities to join employee resource groups, participate in in-person and virtual events, and more. And you’ll enjoy competitive perks and benefits to support you and your family, like paid parental leave, flexible time off, certification and training reimbursement, digital mental health and wellbeing support membership, and comprehensive insurance options.

At Coalfire, equal opportunity and pay equity is integral to the way we do business. All qualified applicants will receive consideration for employment without regard to race, colour, religion, sex, sexual orientation, gender identity, national origin, disability, or status as a protected veteran. Coalfire is committed to providing access, equal opportunity, and reasonable accommodation for individuals with disabilities in employment, its services, programs, and activities.

Director of Site Reliability Engineering employer: Coalfire Systems

Coalfire Systems is an exceptional employer that fosters a culture of inclusivity and support, making it an ideal place for professionals seeking to grow in the cybersecurity field. With a flexible work model, competitive benefits, and a commitment to employee wellbeing, Coalfire empowers its team members to thrive both personally and professionally while tackling meaningful challenges in cloud operations. The company's dedication to mentorship and career development ensures that employees have ample opportunities to advance their skills and contribute to a safer digital world.
C

Contact Detail:

Coalfire Systems Recruiting Team

StudySmarter Expert Advice 🤫

We think this is how you could land Director of Site Reliability Engineering

✨Tip Number 1

Familiarise yourself with the latest trends in cloud infrastructure, especially within AWS, Azure, and GCP. This knowledge will not only help you during interviews but also demonstrate your commitment to staying updated in a rapidly evolving field.

✨Tip Number 2

Network with current or former employees of Coalfire Systems on platforms like LinkedIn. Engaging with them can provide valuable insights into the company culture and expectations, which can be beneficial during your interview.

✨Tip Number 3

Prepare to discuss your experience with operational excellence and service delivery. Be ready to share specific examples of how you've improved processes or metrics in previous roles, as this aligns closely with what Coalfire is looking for.

✨Tip Number 4

Showcase your leadership style and how you foster team culture. Since the role involves mentoring and leading a diverse team, demonstrating your ability to build accountability and communication will set you apart from other candidates.

We think you need these skills to ace Director of Site Reliability Engineering

Leadership Skills
Cloud Infrastructure Knowledge (AWS, Azure, GCP)
Infrastructure-as-Code Tools (Terraform, Ansible)
Operational Excellence
Incident Management
Change Management
Problem Management
Strong Communication Skills
Emotional Intelligence
Situational Awareness
Metric and KPI Management
Technical Operations Experience
Client Relationship Management
Strategic Planning
Cross-Functional Collaboration
Service Delivery Frameworks

Some tips for your application 🫡

Tailor Your CV: Make sure your CV highlights relevant experience in cloud operations and technical leadership. Use specific examples that demonstrate your ability to manage teams and improve operational excellence.

Craft a Compelling Cover Letter: In your cover letter, express your passion for cybersecurity and your understanding of the challenges faced by clients. Mention how your leadership style aligns with Coalfire's values and mission.

Highlight Relevant Skills: Emphasise your knowledge of AWS, Azure, and GCP, as well as your familiarity with infrastructure-as-code tools. Be sure to mention any relevant certifications that could set you apart from other candidates.

Showcase Leadership Experience: Detail your previous leadership roles, focusing on how you've built team culture and driven operational improvements. Use metrics to illustrate your success in managing key operational metrics and enhancing customer satisfaction.

How to prepare for a job interview at Coalfire Systems

✨Showcase Your Leadership Skills

As a Director of Site Reliability Engineering, you'll need to demonstrate your leadership experience. Prepare examples of how you've successfully led technical teams, mentored managers, and fostered a culture of accountability and client ownership.

✨Demonstrate Technical Expertise

Be ready to discuss your deep knowledge of cloud infrastructure, particularly in AWS, Azure, and GCP. Familiarise yourself with infrastructure-as-code tools like Terraform and Ansible, and be prepared to explain how you've used them in past roles.

✨Prepare for Operational Excellence Questions

Expect questions about your experience with operational maturity and ITSM processes. Think of specific instances where you've driven improvements in incident response, change management, or service delivery metrics.

✨Emphasise Communication and Collaboration

Strong communication skills are crucial for this role. Be prepared to discuss how you've managed internal teams and client expectations, especially during escalations. Highlight any cross-functional collaboration experiences that showcase your ability to align technical direction with business goals.

Director of Site Reliability Engineering
Coalfire Systems
C
Similar positions in other companies
UK’s top job board for Gen Z
discover-jobs-cta
Discover now
>