Site Reliability Engineer - SRE
Site Reliability Engineer - SRE

Site Reliability Engineer - SRE

London Full-Time 43200 - 72000 £ / year (est.) No home office possible
C

At a Glance

  • Tasks: Join our tech team as a Site Reliability Engineer, ensuring application reliability and supporting product launches.
  • Company: CFP Energy is an award-winning leader in innovative financial solutions for a low-carbon economy.
  • Benefits: Enjoy 25 days leave, hybrid work, bonuses, and a range of health and wellness perks.
  • Why this job: Be part of a mission-driven company making a real impact on sustainability and energy innovation.
  • Qualifications: Experience with monitoring tools, Kubernetes, IaC, and strong communication skills are essential.
  • Other info: We value diversity and are committed to equal opportunities for all applicants.

The predicted salary is between 43200 - 72000 £ per year.

About CFP Energy: Our mission is to facilitate the transition to a low-carbon economy by providing innovative financial solutions to our clients. We are not just any energy and sustainability group; we’re a dynamic, award-winning powerhouse! At the forefront of environmental innovation, we provide cutting-edge solutions for large-scale energy consumers. We do everything from guiding small businesses to corporate giants on their journey to achieve net zero emissions to expertly managing risks and supplying vital power and gas resources. But wait, there’s more! We’re not content with excelling in our current ventures - we thrive on pioneering new businesses and seizing energy investment opportunities.

About the role: We are looking for a highly capable and experienced Site Reliability Engineer to join our growing tech team. As an SRE you will be a hands-on coach for the development teams maintaining and improving our solutions’ reliability. You will be part of our DevOps team but spend most of your time working closely with the engineering teams. Our ideal candidate will be passionate about best practices within technology teams, fully supportive of what the group is doing, and who wishes to make a difference.

Responsibilities:

  • Work with the development teams to build robust and reliable applications on our Kubernetes clusters in Azure.
  • Drive alerting and monitoring solutions to provide teams with better optics of the live application ecosystem, using tools such as DataDog, Grafana and PagerDuty.
  • Support product launches by ensuring operational readiness.
  • Continuously improve our release practices and CI/CD pipelines.
  • Participate in on-call support and take the incident commander role when dealing with critical incidents.
  • Run postmortems and root cause analysis to unlock learnings from incidents.
  • Adhere to agile methodologies and Kanban processes and have a coaching mindset with the ability to understand and adapt to diverse cultures and hierarchies.
  • Drive innovation by discovering new technologies, reviewing tooling, and making suggestions on improving our current stack and architecture.
  • Drive the change you seek and be an autonomous, proactive, confident, credible, and persuasive team player.

Experience Required:

  • Expertise with monitoring and alerting platforms, such as ELK, DataDog, Grafana, Loki, etc.
  • Solid understanding of monitoring and alerting best practices.
  • Previous experience as DevOps/Platform Engineer or SRE.
  • Expertise with IaC tooling (Terraform) and good understanding of cloud technologies, ideally Azure.
  • Hands-on expertise with Kubernetes and Helm.
  • Fundamental understanding of GIT/version control and SDLC pipelines.
  • Excellent communication and interpersonal skills, with the ability to effectively interact with clients, team members, and stakeholders at all levels.
  • Object-oriented and non-object-oriented coding is highly advantageous.

Benefits:

  • 25 days annual leave in addition to Bank holidays.
  • Hybrid working pattern.
  • Discretionary bonus scheme.
  • Company pension scheme.
  • Life and medical insurance, and eyecare scheme.
  • Employee Assistance Program.
  • Cycle to work scheme.
  • Family-friendly policies.
  • Recruit and Reward scheme.
  • Access to perk-box benefits package.

Join us in our mission to protect and secure critical infrastructure in an increasingly connected world. If you are a strategic thinker, a natural leader, and passionate about cybersecurity, we want to hear from you! The CFP Group is committed to ensuring equal opportunities, fairness of treatment, dignity and respect, and the elimination of all forms of discrimination in the workplace for all employees/contractors and job applicants.

Site Reliability Engineer - SRE employer: CFP Energy (UK) Ltd

At CFP Energy, we pride ourselves on being an exceptional employer, offering a vibrant work culture that fosters innovation and collaboration. Our commitment to employee growth is evident through our comprehensive benefits package, including 25 days of annual leave, hybrid working options, and a discretionary bonus scheme, all designed to support a healthy work-life balance. Join us in our mission to drive the transition to a low-carbon economy while enjoying unique opportunities for professional development and making a meaningful impact in the energy sector.
C

Contact Detail:

CFP Energy (UK) Ltd Recruiting Team

StudySmarter Expert Advice 🤫

We think this is how you could land Site Reliability Engineer - SRE

✨Tip Number 1

Familiarise yourself with the specific tools mentioned in the job description, such as DataDog, Grafana, and Kubernetes. Having hands-on experience or even personal projects showcasing your skills with these technologies can set you apart from other candidates.

✨Tip Number 2

Engage with the SRE community online. Join forums, attend webinars, or participate in local meetups to network with professionals in the field. This not only enhances your knowledge but also helps you make connections that could lead to job opportunities.

✨Tip Number 3

Prepare to discuss your previous experiences with incident management and postmortems. Be ready to share specific examples of how you've handled critical incidents and what you learned from them, as this aligns closely with the responsibilities of the role.

✨Tip Number 4

Showcase your coaching mindset by thinking about how you can support development teams in improving their practices. Be prepared to discuss strategies for fostering collaboration and innovation within tech teams during your interview.

We think you need these skills to ace Site Reliability Engineer - SRE

Kubernetes
Azure Cloud Services
Monitoring and Alerting Tools (e.g., DataDog, Grafana, ELK)
Incident Management
Root Cause Analysis
Continuous Integration/Continuous Deployment (CI/CD)
Infrastructure as Code (IaC) using Terraform
Version Control (GIT)
Agile Methodologies
Coaching and Mentoring Skills
Problem-Solving Skills
Interpersonal Communication
Technical Documentation
Proactive Mindset
Adaptability to Diverse Cultures

Some tips for your application 🫡

Understand the Role: Before applying, make sure you fully understand the responsibilities and requirements of the Site Reliability Engineer position. Familiarise yourself with the technologies mentioned, such as Kubernetes, Azure, and monitoring tools like DataDog and Grafana.

Tailor Your CV: Customise your CV to highlight relevant experience and skills that align with the job description. Emphasise your expertise in monitoring and alerting platforms, IaC tooling, and any previous roles as a DevOps/Platform Engineer or SRE.

Craft a Compelling Cover Letter: Write a cover letter that showcases your passion for technology and your understanding of best practices within tech teams. Mention specific examples of how you've driven innovation or improved reliability in past roles.

Highlight Soft Skills: In your application, don't forget to mention your excellent communication and interpersonal skills. The role requires effective interaction with clients and team members, so provide examples of how you've successfully collaborated in diverse environments.

How to prepare for a job interview at CFP Energy (UK) Ltd

✨Showcase Your Technical Skills

Be prepared to discuss your expertise with monitoring and alerting platforms like DataDog and Grafana. Highlight your hands-on experience with Kubernetes and IaC tooling such as Terraform, as these are crucial for the role.

✨Demonstrate Your Problem-Solving Abilities

Expect to be asked about past incidents you've managed. Prepare to discuss how you conducted postmortems and root cause analyses, showcasing your ability to learn from mistakes and improve processes.

✨Emphasise Your Communication Skills

As an SRE, you'll need to interact with various teams. Be ready to provide examples of how you've effectively communicated technical concepts to non-technical stakeholders and collaborated within diverse teams.

✨Align with Their Mission

CFP Energy is focused on transitioning to a low-carbon economy. Show your passion for sustainability and innovation in energy solutions, and explain how your values align with their mission during the interview.

Site Reliability Engineer - SRE
CFP Energy (UK) Ltd
C
Similar positions in other companies
UK’s top job board for Gen Z
discover-jobs-cta
Discover now
>