At a Glance
- Tasks: Support critical services and enhance system reliability while collaborating with development teams.
- Company: Join a dynamic team focused on national security and innovative technology solutions.
- Benefits: Enjoy competitive pay, overtime opportunities, and a chance to work in central London.
- Why this job: Be part of a mission-driven culture that values continuous improvement and cutting-edge tech.
- Qualifications: Experience in software development, database technologies, and familiarity with monitoring tools required.
- Other info: Must have UK Enhanced DV clearance; 24/7 on-call participation is essential.
The predicted salary is between 36000 - 60000 £ per year.
UK Enhanced DV clearance essential
Start: ASAP
Duration: initial 12-month contract
Pay: inside IR35, negotiable
Location: full time on site in central London (5-days in office)
Role Description:
In this role you’ll be at the forefront of delivering enhanced reliability, performance, and quality to a key national security customer. Joining a growing team, you’ll help create a culture of continuous improvement and play a pivotal role in revolutionising how systems are developed and supported. This role combines operational support with software engineering, allowing you to design tools and applications that monitor and improve system health. As part of a wider programme, you'll be integral to supporting the customer's critical mission.
Key Responsibilities:
- Support and maintain critical services, enhancing the availability, performance, and stability of core mission applications.
- Participate in the 24/7 on-call rota (one week in 5 with overtime rate TBC), supporting production systems outside business hours, with additional on-call allowances and overtime benefits.
- Focus on automation to reduce manual operations work (e.g. incident tickets, on-call) to improve efficiency.
- Collaborate with development teams, advising on best practices for system design and implementation.
- Design and deploy monitoring tools to provide intelligent insights into system health, customising tools where necessary.
- Understand the relationship between software and infrastructure, ensuring systems are scalable and resilient to failure.
- Participate in the wider DevOps/SRE community, sharing knowledge and best practices across the organisation.
Key Skills & Experience:
- Experience or enthusiasm for software development in web technologies and object-oriented programming.
- Familiarity with database technologies such as Oracle SQL, MongoDB, or Postgres.
- Proficiency with Linux and Windows command lines (e.g. Bash, PowerShell).
- Experience with monitoring large systems using tools like Grafana, Prometheus, ELK, and Splunk.
- Knowledge of Agile methodologies and tools like Atlassian.
- Strong troubleshooting skills across various levels of the application stack.
- Familiarity with ITIL processes.
- Experience with microservices architectures and container platforms like Docker, Kubernetes, and OpenShift.
- A passion for learning new technologies and solving complex problems.
- Awareness of emerging tech trends and tools in the SRE space.
Interested in this role? Please apply directly to this advert with an updated CV to be considered for the role.
Site Reliability Engineer (City of London) employer: Stott and May
Contact Detail:
Stott and May Recruiting Team
StudySmarter Expert Advice 🤫
We think this is how you could land Site Reliability Engineer (City of London)
✨Tip Number 1
Familiarise yourself with the specific tools and technologies mentioned in the job description, such as Grafana, Prometheus, and Docker. Having hands-on experience or even personal projects showcasing these skills can set you apart during discussions.
✨Tip Number 2
Network with professionals in the Site Reliability Engineering field, especially those who have experience in national security sectors. Engaging in relevant online communities or attending meetups can provide insights and potentially lead to referrals.
✨Tip Number 3
Prepare to discuss your troubleshooting experiences in detail. Be ready to share specific examples of how you've resolved complex issues in previous roles, as this will demonstrate your problem-solving abilities and technical expertise.
✨Tip Number 4
Showcase your passion for continuous improvement and learning. Be prepared to discuss any recent technologies or methodologies you've explored, as well as how you’ve applied them to enhance system reliability or performance in past projects.
We think you need these skills to ace Site Reliability Engineer (City of London)
Some tips for your application 🫡
Tailor Your CV: Make sure your CV highlights relevant experience and skills that align with the Site Reliability Engineer role. Emphasise your software development experience, familiarity with monitoring tools, and any knowledge of Agile methodologies.
Craft a Strong Cover Letter: Write a cover letter that showcases your enthusiasm for the role and the company. Mention specific projects or experiences that demonstrate your ability to enhance system reliability and performance, as well as your passion for continuous improvement.
Highlight Relevant Skills: In your application, clearly outline your proficiency with Linux and Windows command lines, database technologies, and any experience with microservices architectures. This will help you stand out as a candidate who meets the key skills and experience required.
Showcase Problem-Solving Abilities: Provide examples in your application that illustrate your strong troubleshooting skills and your ability to solve complex problems. This is crucial for a role that involves supporting critical services and ensuring system health.
How to prepare for a job interview at Stott and May
✨Showcase Your Technical Skills
Be prepared to discuss your experience with web technologies, object-oriented programming, and database technologies. Highlight specific projects where you've used tools like Grafana or Prometheus to monitor systems.
✨Demonstrate Problem-Solving Abilities
Expect to face scenario-based questions that assess your troubleshooting skills. Prepare examples of complex problems you've solved in previous roles, particularly those related to system reliability and performance.
✨Emphasise Collaboration and Communication
Since the role involves working closely with development teams, be ready to discuss how you've collaborated in the past. Share experiences where you advised on best practices for system design and implementation.
✨Express Your Passion for Continuous Learning
The company values a culture of continuous improvement. Talk about your enthusiasm for learning new technologies and staying updated on emerging trends in the SRE space, as this will resonate well with the interviewers.