Senior Site Reliability Engineer, Production Engineering in London
Senior Site Reliability Engineer, Production Engineering

Senior Site Reliability Engineer, Production Engineering in London

London Full-Time 36000 - 60000 ÂŁ / year (est.) No home office possible
Go Premium
C

At a Glance

  • Tasks: Lead technical projects to enhance system reliability and performance across diverse environments.
  • Company: Join Cisco, a leader in digital experience assurance and innovative technology.
  • Benefits: Competitive salary, health benefits, and opportunities for professional growth.
  • Why this job: Make a real impact in the AI era with cutting-edge technology and collaborative teams.
  • Qualifications: Expertise in Kubernetes, software development, and cloud technologies required.
  • Other info: Dynamic work environment with limitless opportunities for career advancement.

The predicted salary is between 36000 - 60000 ÂŁ per year.

Join to apply for the Senior Site Reliability Engineer, Production Engineering role at Cisco.

Meet the Team:

Cisco ThousandEyes is a Digital Experience Assurance platform that empowers organizations to deliver flawless digital experiences across every network - even the ones they don’t own. Powered by AI and an unmatched set of cloud, internet and enterprise network telemetry data, ThousandEyes enables IT teams to proactively detect, diagnose, and remediate issues before they impact end‐user experiences. ThousandEyes is deeply integrated across the entire Cisco technology portfolio and beyond, helping customers deploy at scale while also delivering AI‐powered assurance insights within Cisco's leading Networking, Security, Collaboration, and Observability portfolios.

Your Impact:

  • Technical Leadership & Collaboration: Forge strong partnerships with cross‐functional stakeholders to identify requirements and deliver solutions that address project and departmental objectives.
  • Solution Design & Deployment: Architect and implement small to mid‐size or moderately complex solutions that elevate reliability, availability, latency, and performance across diverse environments and customer segments.
  • Automation & Service Reliability: Combine expertise in design, automation, deployment, and coding to enhance system reliability for new and existing platforms, tailoring approaches to regional, national, or customer‐specific needs.
  • High Availability & Disaster Recovery: Develop and validate automated high‐availability and disaster‐recovery mechanisms, ensuring systems are robust, scalable, and support rapid velocity in delivery. Take part in regular disaster‐recovery drills.
  • Capacity Planning & Reporting: Analyze resource usage and produce actionable reports to forecast and address capacity constraints, supporting proactive decision‐making and operational excellence.
  • Monitoring & Tooling: Design, build, and deploy tools that deliver comprehensive visibility into infrastructure performance and reliability. Automate key platform functions for efficiency and resilience.
  • Incident Response & Continuous Improvement: Monitor production environments, collaborate with Development and Operations to diagnose issues, and develop monitoring tools to preemptively identify and resolve problems. Serve as on‐call Site Reliability Engineer (SRE), lead post‐mortems, and deliver clear root‐cause analyses.
  • Security & Compliance: Embed strong security controls in architectural design, collaborate with security teams to enhance safeguards, and contribute to incident response efforts as needed. Work closely with various teams specializing in security to ensure platform components and infrastructure are secure at the highest possible level.

Minimum Qualifications:

  • Expert‐level knowledge of Kubernetes and its ecosystem.
  • Proficiency in software development with languages such as Python or Go.
  • In‐depth knowledge of cloud providers, preferably AWS.
  • Solid conceptual and practical knowledge in Web technologies, Networking, and Linux.
  • Knowledge of Site Reliability principles: Incident Response, Change Management, Distributed Systems, Deployment Strategies, and SLOs.

Preferred Qualifications:

  • Familiarity with best practices for operating a large‐scale, highly available enterprise platform.
  • 5+ years of experience in a related role.
  • Proven ability to build and implement scalable and well‐tested solutions.
  • Excellent communication and documentation skills.
  • Strong sense of ownership, drive, and attention to detail.

Why Cisco?

At Cisco, we’re revolutionizing how data and infrastructure connect and protect organizations in the AI era – and beyond. We’ve been innovating fearlessly for 40 years to create solutions that power how humans and technology work together across the physical and digital worlds. These solutions provide customers with unparalleled security, visibility, and insights across the entire digital footprint.

Fueled by the depth and breadth of our technology, we experiment and create meaningful solutions. Add to that our worldwide network of doers and experts, and you’ll see that the opportunities to grow and build are limitless. We work as a team, collaborating with empathy to make really big things happen on a global scale. Because our solutions are everywhere, our impact is everywhere.

We are Cisco, and our power starts with you.

Seniority level: Mid‐Senior level

Employment type: Full‐time

Job function: Engineering and Information Technology, Software Development

Senior Site Reliability Engineer, Production Engineering in London employer: Cisco

Cisco is an exceptional employer that fosters a collaborative and innovative work culture, empowering employees to make a significant impact in the AI era. With a strong focus on professional growth, Cisco offers extensive opportunities for skill development and career advancement, all while working on cutting-edge technology that shapes the future of digital experiences. Located in a dynamic environment, employees benefit from a supportive team atmosphere and the chance to contribute to meaningful projects that resonate globally.
C

Contact Detail:

Cisco Recruiting Team

StudySmarter Expert Advice 🤫

We think this is how you could land Senior Site Reliability Engineer, Production Engineering in London

✨Tip Number 1

Network like a pro! Reach out to current Cisco employees on LinkedIn or at industry events. A friendly chat can give us insider info and maybe even a referral, which can really boost our chances.

✨Tip Number 2

Show off your skills! Prepare a portfolio or a GitHub repository showcasing your projects, especially those related to Kubernetes, Python, or cloud solutions. This gives us a chance to demonstrate our expertise beyond the CV.

✨Tip Number 3

Ace the interview by practising common SRE scenarios. Think about how we would handle incidents or improve system reliability. Being ready with real-life examples will help us stand out as problem solvers.

✨Tip Number 4

Don’t forget to apply through our website! It’s the best way to ensure our application gets seen by the right people. Plus, it shows we’re genuinely interested in joining the Cisco team.

We think you need these skills to ace Senior Site Reliability Engineer, Production Engineering in London

Kubernetes
Python
Go
AWS
Web Technologies
Networking
Linux
Site Reliability Principles
Incident Response
Change Management
Distributed Systems
Deployment Strategies
SLOs
Communication Skills
Attention to Detail

Some tips for your application 🫡

Tailor Your CV: Make sure your CV reflects the skills and experiences that match the Senior Site Reliability Engineer role. Highlight your expertise in Kubernetes, cloud providers like AWS, and any relevant software development experience. We want to see how you can bring value to our team!

Craft a Compelling Cover Letter: Your cover letter is your chance to shine! Use it to tell us why you're passionate about Site Reliability Engineering and how your background aligns with our mission at Cisco. Be genuine and let your personality come through – we love seeing the real you!

Showcase Your Projects: If you've worked on any projects that demonstrate your skills in automation, incident response, or high-availability systems, make sure to mention them. We’re keen to see how you’ve tackled challenges and what solutions you’ve implemented in the past.

Apply Through Our Website: We encourage you to apply directly through our website for the best chance of getting noticed. It’s super easy, and you’ll be able to keep track of your application status. Plus, it shows us you’re serious about joining our team!

How to prepare for a job interview at Cisco

✨Know Your Tech Inside Out

Make sure you brush up on your Kubernetes knowledge and be ready to discuss your experience with cloud providers like AWS. Be prepared to dive deep into your technical skills, especially around automation and service reliability.

✨Showcase Your Problem-Solving Skills

Prepare examples of how you've tackled complex issues in production environments. Think about specific incidents where you led post-mortems or developed monitoring tools that improved system reliability.

✨Communicate Clearly and Confidently

Since strong communication is key, practice explaining your past projects and solutions in a clear and concise manner. Use the STAR method (Situation, Task, Action, Result) to structure your responses.

✨Demonstrate Your Collaborative Spirit

Cisco values teamwork, so be ready to discuss how you've worked with cross-functional teams. Highlight any partnerships you've forged to deliver solutions that meet project objectives, showcasing your ability to collaborate effectively.

Senior Site Reliability Engineer, Production Engineering in London
Cisco
Location: London
Go Premium

Land your dream job quicker with Premium

You’re marked as a top applicant with our partner companies
Individual CV and cover letter feedback including tailoring to specific job roles
Be among the first applications for new jobs with our AI application
1:1 support and career advice from our career coaches
Go Premium

Money-back if you don't land a job in 6-months

C
  • Senior Site Reliability Engineer, Production Engineering in London

    London
    Full-Time
    36000 - 60000 ÂŁ / year (est.)
  • C

    Cisco

    10000+
Similar positions in other companies
UK’s top job board for Gen Z
discover-jobs-cta
Discover now
>