Site Reliability Engineer London DV cleared
Site Reliability Engineer London DV cleared

Site Reliability Engineer London DV cleared

London Full-Time 48000 - 72000 £ / year (est.) No home office possible
Go Premium
Evolution Recruitment Solutions

At a Glance

  • Tasks: Join our team to enhance system reliability and performance for national security.
  • Company: Be part of a cutting-edge company focused on improving essential systems.
  • Benefits: Enjoy remote work options, overtime benefits, and a culture of continual improvement.
  • Why this job: Make a real impact while learning and innovating in a supportive environment.
  • Qualifications: Enthusiasm for software development and a willingness to learn are key; experience is a plus.
  • Other info: Participate in a 24/7 on-call rota with additional allowances.

The predicted salary is between 48000 - 72000 £ per year.

Site Reliability Engineer – NS London – DV

Job Details

Site Reliability Engineering is a rapidly growing concept in industry, focused on improving the quality, reliability, and performance of essential systems. As a Site Reliability Engineer, you\’ll be part of a team delivering these benefits to a key national security customer. We are building our team and tools, aiming to create a culture of continual improvement to revolutionize how our customer’s systems are built and maintained. This role combines operational product support with software engineering to develop applications that monitor the health of our systems. The SRE team operates within a broader program central to the customer mission.

The role holder:

As an SRE, your work will replace traditional operations tasks with software and systems engineering solutions, automating processes to minimize manual work like incident tickets and on-call duties, aiming for less than half of the team\’s time spent on manual tasks. You should be enthusiastic about learning, experimenting, and developing tools to enhance application health and reliability to support the customer mission.

Role accountabilities include:

  • Supporting and maintaining essential services that support core mission applications, proactively enhancing their availability, performance, and stability.
  • Participating in a 24/7 on-call rota to support critical production systems outside business hours, with additional allowances and overtime benefits.
  • Finding innovative solutions and automating repetitive tasks. Collaborating with development teams to advise on best practices in system design and building.
  • Designing and deploying monitoring solutions, creating custom tools as needed to provide comprehensive insights and demonstrate team improvements daily.

Competencies

  • Experience in the following areas is desirable, but enthusiasm and willingness to learn are highly valued. Training and on-the-job development will help fill any knowledge gaps.

o Software development in web technologies and object-oriented programming
o Database technologies such as Oracle SQL, MongoDB, Postgres
o Linux and Windows command-line proficiency, e.g., Bash and PowerShell

o Monitoring large systems with tools like Grafana, Prometheus, ELK, Splunk
o Experience working in Agile teams and familiarity with tools like Atlassian
o Diagnosing and troubleshooting application issues causing outages
o Troubleshooting skills across the stack
o Understanding of ITIL principles
o Microservices architecture, Docker, and container platforms like OpenShift and Kubernetes

  • Awareness of current technology trends to adopt new tools.
  • Understanding the relationship between software and infrastructure, focusing on scalability and resilience, and optimizing infrastructure deployment.

#J-18808-Ljbffr

Site Reliability Engineer London DV cleared employer: Evolution Recruitment Solutions

As a Site Reliability Engineer in London, you will join a dynamic team dedicated to enhancing the reliability and performance of critical national security systems. Our company fosters a culture of continuous improvement, offering extensive training and development opportunities to help you grow your skills while working on innovative solutions. With competitive benefits, including additional allowances for on-call duties, we prioritise employee well-being and engagement, making us an exceptional employer in the tech industry.
Evolution Recruitment Solutions

Contact Detail:

Evolution Recruitment Solutions Recruiting Team

StudySmarter Expert Advice 🤫

We think this is how you could land Site Reliability Engineer London DV cleared

✨Tip Number 1

Familiarise yourself with the specific tools and technologies mentioned in the job description, such as Grafana, Prometheus, and Docker. Having hands-on experience or even personal projects showcasing your skills with these tools can set you apart from other candidates.

✨Tip Number 2

Engage with the Site Reliability Engineering community through forums, webinars, or local meetups. Networking with professionals in the field can provide insights into the role and may even lead to referrals within our company.

✨Tip Number 3

Demonstrate your problem-solving skills by preparing examples of how you've automated processes or improved system reliability in previous roles. Be ready to discuss these experiences during interviews to showcase your practical knowledge.

✨Tip Number 4

Stay updated on current technology trends relevant to Site Reliability Engineering. Being able to discuss recent advancements or tools in your interview will show your enthusiasm for the field and your commitment to continual learning.

We think you need these skills to ace Site Reliability Engineer London DV cleared

Software Development in Web Technologies
Object-Oriented Programming
Database Technologies (Oracle SQL, MongoDB, PostgreSQL)
Linux and Windows Command-Line Proficiency (Bash, PowerShell)
Monitoring Tools (Grafana, Prometheus, ELK, Splunk)
Agile Methodologies
Troubleshooting Skills Across the Stack
Understanding of ITIL Principles
Microservices Architecture
Docker and Container Platforms (OpenShift, Kubernetes)
Automation of Processes
Collaboration with Development Teams
Designing and Deploying Monitoring Solutions
Enthusiasm for Learning and Experimentation
Understanding of Scalability and Resilience in Infrastructure

Some tips for your application 🫡

Tailor Your CV: Make sure your CV highlights relevant experience in software development, system reliability, and automation. Emphasise any specific tools or technologies mentioned in the job description, such as Docker, Kubernetes, or monitoring tools like Grafana.

Craft a Compelling Cover Letter: In your cover letter, express your enthusiasm for the role and the company. Discuss your passion for improving system reliability and your willingness to learn new technologies. Mention any relevant projects or experiences that demonstrate your problem-solving skills.

Showcase Your Technical Skills: Include a section in your application that lists your technical skills, particularly those related to web technologies, database management, and command-line proficiency. Be specific about your experience with tools like Prometheus or ELK.

Highlight Team Collaboration: Since the role involves working closely with development teams, mention any past experiences where you collaborated effectively in an Agile environment. Highlight your ability to communicate best practices in system design and your contributions to team projects.

How to prepare for a job interview at Evolution Recruitment Solutions

✨Show Your Enthusiasm for Learning

As a Site Reliability Engineer, a willingness to learn is crucial. Be prepared to discuss how you've approached learning new technologies or skills in the past, and express your excitement about the opportunity to grow within the role.

✨Demonstrate Problem-Solving Skills

Expect to be asked about specific challenges you've faced in previous roles. Prepare examples that showcase your troubleshooting abilities and how you’ve automated processes to improve efficiency, as this aligns with the job's focus on minimising manual tasks.

✨Familiarise Yourself with Relevant Tools

Make sure you have a good understanding of the tools mentioned in the job description, such as Grafana, Prometheus, and Docker. Being able to discuss your experience with these tools will show that you're well-prepared and knowledgeable about the role.

✨Understand the Importance of Collaboration

Collaboration with development teams is key in this role. Be ready to talk about your experiences working in Agile environments and how you’ve contributed to team success, particularly in advising on best practices in system design.

Site Reliability Engineer London DV cleared
Evolution Recruitment Solutions
Location: London
Go Premium

Land your dream job quicker with Premium

You’re marked as a top applicant with our partner companies
Individual CV and cover letter feedback including tailoring to specific job roles
Be among the first applications for new jobs with our AI application
1:1 support and career advice from our career coaches
Go Premium

Money-back if you don't land a job in 6-months

>