AWS Head of Site Reliability Engineering (Must hold current SC) (London)

London Full-Time 43200 - 72000 £ / year (est.) No home office possible

Apply now

Tasks: Lead the SRE team, manage AWS infrastructure, and implement best practices for reliability.
Company: Amber Labs is a dynamic tech consultancy focused on innovation and collaboration.
Benefits: Enjoy flexible work, private medical insurance, 25 days leave, and a vibrant company culture.
Why this job: Join a rapidly growing start-up that values personal growth and encourages experimentation.
Qualifications: 8+ years in SRE or DevOps, with strong AWS expertise and leadership experience required.
Other info: This is a 12-month FTC role; SC clearance is mandatory.

The predicted salary is between 43200 - 72000 £ per year.

AWS Head of Site Reliability Engineering (Must hold current SC)
2 days ago Be among the first 25 applicants

Direct message the job poster from Amber Labs

AWS Head of Site Reliability Engineering (Must hold current SC)

The Company:

At Amber Labs, we are a cutting-edge UK and European technology consultancy that prioritises empowering autonomy, promoting experimentation, and facilitating rapid learning to provide exceptional value to our clients. Our company culture is centred around collaboration, where all colleagues, regardless of their role, work together to minimise risk and shorten delivery times. Our team consists of highly-skilled cross-functional consultants, analysts, and support staff.

Overview:

We are looking for a highly skilled and visionary leader to join our team as the Head of Site Reliability Engineering (SRE) with a strong focus on AWS cloud infrastructure. The ideal candidate will have a deep understanding of cloud architectures, extensive experience in SRE practices, and the ability to lead and scale SRE teams to ensure the availability, performance, and security of our systems.

Key Responsibilities:

Leadership and Team Management: Lead and manage the SRE team to ensure high availability, scalability, and performance of our AWS-based infrastructure. Provide mentorship and guidance to junior and senior engineers, fostering a culture of operational excellence and continuous improvement.

Cloud Infrastructure Management: Oversee the design, implementation, and maintenance of cloud infrastructure in AWS, ensuring the systems are secure, reliable, and highly available. Use best practices for AWS services, automation, and monitoring.

SRE Practices Implementation: Establish and lead the implementation of SRE principles, such as Service Level Objectives (SLOs), Service Level Indicators (SLIs), and Error Budgets, to drive the team's focus on reliability.

Incident Management: Lead incident response efforts, root cause analysis (RCA), and post-incident reviews to improve system reliability. Ensure rapid response to production issues and minimize downtime.

Performance Optimization: Drive initiatives for performance tuning, cost optimization, and efficient use of AWS resources. Ensure the infrastructure can scale to meet the demands of the business.

Automation and Continuous Improvement: Champion the automation of manual tasks, such as deployments, monitoring, and scaling, using tools like Terraform, CloudFormation, Jenkins, and other CI/CD platforms.

Collaboration: Work closely with cross-functional teams (Engineering, DevOps, Security, etc.) to ensure seamless collaboration in achieving business and technical goals.

Monitoring and Alerts: Implement and maintain robust monitoring, alerting, and logging systems to detect issues before they impact the business, using AWS CloudWatch, Prometheus, Grafana, etc.

Cost Management: Help optimize AWS costs while maintaining operational efficiency and reliability.

Required Qualifications:

Experience: 8+ years of experience in Site Reliability Engineering, DevOps, or similar roles, with at least 2 years in a leadership position.

AWS Expertise: Extensive experience with AWS services, such as EC2, S3, Lambda, RDS, VPC, CloudFormation, CloudWatch, etc. Hands-on experience with cloud architecture and design.

SRE Best Practices: Deep understanding of SRE principles and frameworks, including SLOs, SLIs, and Error Budgets.

Incident Management: Proven experience in incident management, including response, recovery, root cause analysis, and post-mortem reporting.

Automation Tools: Proficient in automation tools like Terraform, CloudFormation, Jenkins, and other CI/CD tools.

Preferred Qualifications:

Certifications: AWS Certified Solutions Architect – Professional, AWS Certified DevOps Engineer, or other relevant certifications.

Agile Methodologies: Experience with Agile and Lean practices in a cloud-native environment.

Competitive salary and performance-based bonus structure.

Join a rapidly expanding start-up where personal growth is a part of our DNA.

Benefit from a flexible work environment focused on deliverable outcomes.

Receive private medical insurance through Aviva.

Enjoy the benefits of a company pension plan through Nest.

25 days of annual leave plus UK bank holidays.

Access Perkbox, a global employee rewards platform offering discounts, perks, and wellness resources.

Participate in a generous employee referral program.

A highly collaborative and collegial environment with opportunities for career advancement.

Be encouraged to take bold steps and embrace a mindset of experimentation.

Choose your preferred device, PC or Mac.

Diversity & Inclusion:

Here at Amber Labs, we are dedicated to fostering an inclusive and equitable workplace for all. Our commitment to diversity, equality, and inclusion includes:

Valuing the unique experiences, perspectives, and backgrounds of all employees and creating an environment where everyone feels welcomed, respected, and valued.

Prohibiting all forms of harassment, bullying, discrimination, and victimisation and promoting a culture of dignity and respect for all.

Educating all new hires on our Diversity and Inclusion policies and ensuring they are aware of their rights and responsibilities to create a safe and inclusive workplace.

By taking these steps, we are dedicated to building a workplace that reflects and celebrates the diversity of our employees and communities.

This role at Amber Labs is a 12 Month FTC position, and all employees are required to meet the Baseline Personnel Security Standard (BPSS) and hold current SC. Please be advised that, at this time, we are unable to consider candidates who require sponsorship or hold a visa of any type.

What Happens Next?

Our Talent Acquisition Team will be in touch to advise you on the next steps. We have a two-stage interview process for most of our consultants. In certain cases, we may include a third and final stage, which is a conversation with the company Partners. This will only be considered if deemed necessary.

Referrals increase your chances of interviewing at Amber Labs by 2x

Get notified about new Head of Engineering jobs in London Area, United Kingdom .

London, England, United Kingdom 2 weeks ago

London, England, United Kingdom 1 week ago