Site Reliability Engineer - NS London
Site Reliability Engineer - NS London

Site Reliability Engineer - NS London

London Full-Time 43200 - 72000 £ / year (est.) No home office possible
B

At a Glance

  • Tasks: Join our team to enhance system reliability and performance through innovative software solutions.
  • Company: BAE Systems Digital Intelligence leads in cyber defence, supporting governments and businesses globally.
  • Benefits: Enjoy hybrid working, flexible hours, and additional pay for on-call duties.
  • Why this job: Be part of a mission-driven culture that values learning and innovation in tech.
  • Qualifications: Excitement to learn new technologies is key; experience in software development is a plus.
  • Other info: Applicants must be eligible to work in the UK and hold an active eDV security clearance.

The predicted salary is between 43200 - 72000 £ per year.

Location(s): [[mfield3

BAE Systems Digital Intelligence is home to 4,500 digital, cyber and intelligence experts. We work collaboratively across 10 countries to collect, connect and understand complex data, so that governments, nation states, armed forces and commercial businesses can unlock digital advantage in the most demanding environments.

Site Reliability Engineering is a rapidly growing concept in industry, with a remit to drive the quality, reliability and performance of essential systems. As a Site Reliability Engineer you\’ll be part of a team in BAE Systems at the forefront of this, delivering these benefits to a key national security customer. We are in the process of building our team and tools, and with your help will create a culture of continual improvement to revolutionise the way our customer\’s systems are built and maintained. This role blends operational product support with software engineering to create applications to understand the overall health of our systems. The SRE team sits within a wider programme at the core of the customer mission.

Role holder

As an SRE, fundamentally you will be doing work that has historically been done by an operations team, but using software and systems engineering expertise to substitute automation for human labour, with the objective of limiting traditional manual operations work (incident tickets, on‑call etc.) to no more than half of the SRE team\’s time (and aiming for considerably less). You will have an enthusiasm to learn and experiment, to develop tools to understand application health and improve their reliability to support the customer mission.

Role accountabilities

  • Supporting and maintaining essential service that support core mission applications, proactively enhancing their availability, performance and stability.
  • Being part of the 24/7 on call rota, supporting critical production systems out of business hours, for which additional on call allowances and overtime benefits will be paid.
  • Finding innovative solutions to problems rather than undertaking repetitive work, automating everything you can. You will work alongside development teams, advising them of good practice in how to design and build systems, learning from what you know works well.
  • You will design and deploy monitoring products, creating bespoke tools where required, to provide comprehensive and intelligent observations to meet the customer requirements and demonstrate the improvements the team are making on a daily basis.
  • You will be well versed in the relationship between software and infrastructure, understanding the characteristics of systems that enable them to be scalable and resilient to failure, and how to get the best out of the infrastructure they are deployed to.
  • Participating in the wider DevOps/SRE community within the organisation.

Competencies

  • Software development in web technologies and object oriented programming
  • Database technologies such as Oracle SQL, Mongo, Postgres
  • Know your way around Linux and Windows command lines, e.g. Bash and PowerShell
  • Monitoring large systems using technologies such as Grafana, Prometheus, ELK, Splunk
  • Experience of working in Agile teams, and the tooling that supports it, e.g. Atlassian
  • Diagnosing and troubleshooting application issues resulting in service outages
  • Troubleshooting skills across different levels of the stack
  • Understanding of ITIL
  • Micro‑services architectures, Docker and container platforms such as Openshift, Kubernetes

Security Clearance

Due to the nature of our work, successful candidates for this role will be required to hold an active eDV before applying for this opportunity.

Life at BAE Systems Digital Intelligence

We are embracing Hybrid Working. This means you and your colleagues may be working in different locations, such as from home, another BAE Systems office or client site, some or all of the time, and work might be going on at different times of the day. By embracing technology, we can interact, collaborate and create together, even when we\’re working remotely from one another. Hybrid Working allows for increased flexibility in when and where we work, helping us to balance our work and personal life more effectively, and enhance well‑being.

Division overview: Capabilities

At BAE Systems Digital Intelligence, we pride ourselves in being a leader in the cyber defence industry, and Capabilities is the engine that keeps the business moving forward. It is the largest area of Digital Intelligence, containing our Engineering, Consulting and Project Management teams that design and implement the defence solutions and digital transformation projects that make us a globally recognised brand in both the public and private sector. As a member of the Capabilities team, you will be creating and managing the solutions that earn us our place in an ever changing digital world. We all have a role to play in defending our clients, and this is yours.

#J-18808-Ljbffr

Site Reliability Engineer - NS London employer: BAE Systems.

BAE Systems Digital Intelligence is an exceptional employer, offering a dynamic work culture that fosters collaboration and innovation among its 4,500 experts in London. With a strong commitment to employee growth through training and knowledge sharing, the company embraces hybrid working to enhance work-life balance while ensuring that every team member plays a vital role in national security. Join us to be part of a forward-thinking team that values diverse perspectives and is dedicated to revolutionising the way critical systems are built and maintained.
B

Contact Detail:

BAE Systems. Recruiting Team

StudySmarter Expert Advice 🤫

We think this is how you could land Site Reliability Engineer - NS London

✨Tip Number 1

Familiarise yourself with the specific tools and technologies mentioned in the job description, such as Grafana, Prometheus, and Docker. Having hands-on experience or projects showcasing your skills with these tools can set you apart from other candidates.

✨Tip Number 2

Engage with the DevOps and SRE communities online. Participate in forums, attend webinars, or join relevant groups on platforms like LinkedIn. This not only helps you learn but also shows your enthusiasm for the field, which is highly valued by employers.

✨Tip Number 3

Demonstrate your problem-solving skills by preparing examples of past experiences where you automated processes or improved system reliability. Be ready to discuss these during interviews to showcase your proactive approach to challenges.

✨Tip Number 4

Research BAE Systems Digital Intelligence and their projects related to national security. Understanding their mission and values will help you tailor your conversations and show that you're genuinely interested in contributing to their goals.

We think you need these skills to ace Site Reliability Engineer - NS London

Software Development in Web Technologies
Object-Oriented Programming
Database Technologies (Oracle SQL, MongoDB, Postgres)
Linux and Windows Command Line Proficiency (Bash, PowerShell)
Monitoring Large Systems (Grafana, Prometheus, ELK, Splunk)
Agile Methodologies and Tools (e.g., Atlassian)
Diagnosing and Troubleshooting Application Issues
Troubleshooting Skills Across Different Levels of the Stack
Understanding of ITIL Framework
Microservices Architectures
Docker and Container Platforms (Openshift, Kubernetes)
Automation Skills
Problem-Solving Skills
Enthusiasm for Learning New Technologies
Ability to Work in a Hybrid Environment

Some tips for your application 🫡

Tailor Your CV: Make sure your CV highlights relevant experience in Site Reliability Engineering, software development, and any specific technologies mentioned in the job description. Use keywords from the job posting to ensure your application stands out.

Craft a Compelling Cover Letter: Write a cover letter that showcases your enthusiasm for the role and the company. Mention specific projects or experiences that demonstrate your problem-solving skills and ability to work with both software and infrastructure.

Showcase Your Technical Skills: In your application, clearly outline your technical skills related to web technologies, database management, and monitoring systems. Provide examples of how you've used these skills in past roles to improve system reliability.

Highlight Your Teamwork Experience: Since the role involves collaboration with development teams, emphasise your experience working in Agile environments. Share examples of how you’ve contributed to team success and improved processes through collaboration.

How to prepare for a job interview at BAE Systems.

✨Show Your Passion for Learning

As a Site Reliability Engineer, enthusiasm for learning new technologies is crucial. Be prepared to discuss how you've tackled challenging problems in the past and your approach to continuous improvement.

✨Demonstrate Your Technical Skills

Familiarise yourself with the key technologies mentioned in the job description, such as Linux command lines, monitoring tools like Grafana and Prometheus, and database technologies. Be ready to provide examples of how you've used these in previous roles.

✨Highlight Your Problem-Solving Abilities

The role requires innovative solutions to complex issues. Prepare to share specific instances where you've automated processes or improved system reliability, showcasing your ability to think outside the box.

✨Understand the Company Culture

BAE Systems values inclusion and collaboration. Research their culture and be ready to discuss how you can contribute to a diverse team environment, especially in a hybrid working model.

Site Reliability Engineer - NS London
BAE Systems.
Location: London

Land your dream job quicker with Premium

You’re marked as a top applicant with our partner companies
Individual CV and cover letter feedback including tailoring to specific job roles
Be among the first applications for new jobs with our AI application
1:1 support and career advice from our career coaches
Go Premium

Money-back if you don't land a job in 6-months

B
Similar positions in other companies
UK’s top job board for Gen Z
discover-jobs-cta
Discover now
>