Job Board

Companies

Cisco Systems Inc

Senior Site Reliability Engineer, Production Engineering - Cisco ThousandEyes

Senior Site Reliability Engineer, Production Engineering - Cisco ThousandEyes in London

London Full-Time 48000 - 72000 £ / year (est.) Home office (partial)

At a Glance

Tasks: Design and manage large-scale distributed systems, enhancing reliability and performance.
Company: Join Cisco ThousandEyes, a leader in Digital Experience Assurance.
Benefits: Competitive salary, health benefits, and opportunities for professional growth.
Why this job: Make a real impact on digital experiences with cutting-edge technology.
Qualifications: Expertise in Kubernetes, Python or Go, and cloud technologies required.
Other info: Collaborative environment with limitless growth opportunities.

The predicted salary is between 48000 - 72000 £ per year.

Cisco ThousandEyes is a Digital Experience Assurance platform that empowers organizations to deliver flawless digital experiences across every network - even the ones they don’t own. Powered by AI and an unmatched set of cloud, internet and enterprise network telemetry data, ThousandEyes enables IT teams to proactively detect, diagnose, and remediate issues - before they impact end-user experiences. ThousandEyes is deeply integrated across the entire Cisco technology portfolio and beyond, helping customers deploy at scale while also delivering AI-powered assurance insights within Cisco’s leading Networking, Security, Collaboration, and Observability portfolios.

We are seeking a skilled Senior Site Reliability Engineer (SRE) in Production Engineering with a strong background in SaaS and operations. You will design and manage large-scale, highly available distributed systems in the cloud, collaborating directly with application development teams to enhance the reliability, performance, and security of our platform.

Technical Leadership & Collaboration: Forge strong partnerships with cross-functional stakeholders to identify requirements and deliver solutions that address project and departmental objectives.
Solution Design & Deployment: Architect and implement small to mid-size or moderately complex solutions that elevate reliability, availability, latency, and performance across diverse environments and customer segments.
Automation & Service Reliability: Combine expertise in design, automation, deployment, and coding to enhance system reliability for new and existing platforms, tailoring approaches to regional, national, or customer-specific needs.
High Availability & Disaster Recovery: Develop and validate automated high-availability and disaster recovery mechanisms, ensuring systems are robust, scalable, and support rapid velocity in delivery. Take part in regular disaster recovery drills.
Capacity Planning & Reporting: Analyze resource usage and produce actionable reports to forecast and address capacity constraints, supporting proactive decision-making and operational excellence.
Monitoring & Tooling: Design, build, and deploy tools that deliver comprehensive visibility into infrastructure performance and reliability. Automate key platform functions for efficiency and resilience.
Incident Response & Continuous Improvement: Monitor production environments, collaborate with Development and Operations to diagnose issues, and develop monitoring tools to preemptively identify and resolve problems. Serve as on-call Site Reliability Engineer (SRE), lead post-mortems, and deliver clear root cause analyses.
Security & Compliance: Embed strong security controls in architectural design, collaborate with security teams to enhance safeguards, and contribute to incident response efforts as needed. Work closely with various teams specializing in security, to ensure various platform components and infrastructure is secure at the highest possible level.

Minimum Qualifications:

Expert-level knowledge of Kubernetes and its ecosystem.
Proficiency in software development with languages such as Python or Go.
In-depth knowledge of cloud providers, preferably AWS.
Solid conceptual and practical knowledge in Web technologies, Networking, and Linux.
Knowledge of Site Reliability principles: Incident Response, Change Management, Distributed Systems, Deployment Strategies, and SLOs.

Preferred Qualifications:

Familiarity with best practices for operating a large-scale, highly available enterprise platform.
5+ years of experience in a related role.
Proven ability to build and implement scalable and well-tested solutions.
Excellent communication and documentation skills.
Strong sense of ownership, drive, and attention to detail.

At Cisco, we’re revolutionizing how data and infrastructure connect and protect organizations in the AI era – and beyond. We’ve been innovating fearlessly for 40 years to create solutions that power how humans and technology work together across the physical and digital worlds. These solutions provide customers with unparalleled security, visibility, and insights across the entire digital footprint. Fueled by the depth and breadth of our technology, we experiment and create meaningful solutions. Add to that our worldwide network of doers and experts, and you’ll see that the opportunities to grow and build are limitless. We work as a team, collaborating with empathy to make really big things happen on a global scale. Because our solutions are everywhere, our impact is everywhere. We are Cisco, and our power starts with you.

Senior Site Reliability Engineer, Production Engineering - Cisco ThousandEyes in London employer: Cisco Systems Inc

Cisco ThousandEyes is an exceptional employer that fosters a collaborative and innovative work culture, empowering employees to make a significant impact in the realm of digital experience assurance. With a strong focus on professional growth, Cisco offers extensive opportunities for skill development and career advancement, all while working with cutting-edge technology in a supportive environment. Located in a vibrant tech hub, employees benefit from a dynamic atmosphere that encourages creativity and teamwork, making it an ideal place for those seeking meaningful and rewarding employment.

Contact Detail:

Cisco Systems Inc Recruiting Team

View Cisco Systems Inc Profile

StudySmarter Expert Advice 🤫

We think this is how you could land Senior Site Reliability Engineer, Production Engineering - Cisco ThousandEyes in London

✨Tip Number 1

Network like a pro! Attend industry meetups, webinars, or tech conferences where you can connect with folks from Cisco and other companies. Building relationships can open doors that a CV just can't.

✨Tip Number 2

Show off your skills! Create a portfolio or GitHub repository showcasing your projects, especially those related to Kubernetes, cloud systems, or automation. This gives potential employers a taste of what you can do.

✨Tip Number 3

Prepare for the interview like it’s a big game! Research Cisco ThousandEyes, understand their products, and be ready to discuss how your experience aligns with their needs. Tailor your answers to highlight your SRE expertise.

✨Tip Number 4

Don’t forget to apply through our website! It’s the best way to ensure your application gets seen by the right people. Plus, it shows you’re genuinely interested in joining the team at Cisco.

We think you need these skills to ace Senior Site Reliability Engineer, Production Engineering - Cisco ThousandEyes in London

Kubernetes

Python

AWS

Web Technologies

Networking

Linux

Site Reliability Principles

Incident Response

Change Management

Distributed Systems

Deployment Strategies

SLOs

Automation

Monitoring Tools

Some tips for your application 🫡

Tailor Your CV: Make sure your CV reflects the skills and experiences that match the Senior Site Reliability Engineer role. Highlight your expertise in Kubernetes, cloud providers like AWS, and any relevant SaaS experience to catch our eye!

Craft a Compelling Cover Letter: Use your cover letter to tell us why you’re passionate about Site Reliability Engineering. Share specific examples of how you've improved system reliability or performance in past roles – we love hearing about real-world impacts!

Showcase Your Technical Skills: Don’t hold back on showcasing your technical prowess! Mention your proficiency in Python or Go, and any experience with automation and monitoring tools. We want to see how you can contribute to our mission of delivering flawless digital experiences.

Apply Through Our Website: We encourage you to apply directly through our website for the best chance of getting noticed. It’s the easiest way for us to keep track of your application and ensure it reaches the right team!

How to prepare for a job interview at Cisco Systems Inc

✨Know Your Tech Inside Out

Make sure you brush up on your knowledge of Kubernetes, Python, and AWS. Be ready to discuss how you've used these technologies in past projects, especially in relation to building scalable systems and ensuring high availability.

✨Showcase Your Problem-Solving Skills

Prepare to share specific examples of how you've tackled incidents in production environments. Highlight your experience with incident response and how you've contributed to continuous improvement in system reliability.

✨Understand the Company’s Vision

Familiarise yourself with Cisco ThousandEyes and its role in digital experience assurance. Be ready to discuss how your skills can contribute to their mission of delivering flawless digital experiences and enhancing system performance.

✨Emphasise Collaboration and Communication

Since this role involves working closely with cross-functional teams, be prepared to talk about your experience in collaborating with different stakeholders. Share examples that demonstrate your ability to communicate technical concepts clearly and effectively.

Senior Site Reliability Engineer, Production Engineering - Cisco ThousandEyes in London

Cisco Systems Inc

Location: London

Senior Site Reliability Engineer, Production Engineering - Cisco ThousandEyes in London

London

Full-Time

48000 - 72000 £ / year (est.)
Cisco Systems Inc

10,000+

View Cisco Systems Inc Profile

Similar positions in other companies

UK’s top job board for Gen Z

Discover now

Senior Site Reliability Engineer, Production Engineering - Cisco ThousandEyes in London

At a Glance

Senior Site Reliability Engineer, Production Engineering - Cisco ThousandEyes in London employer: Cisco Systems Inc

StudySmarter Expert Advice 🤫

✨Tip Number 1

✨Tip Number 2

✨Tip Number 3

✨Tip Number 4

We think you need these skills to ace Senior Site Reliability Engineer, Production Engineering - Cisco ThousandEyes in London

Some tips for your application 🫡

How to prepare for a job interview at Cisco Systems Inc

Senior Site Reliability Engineer, Production Engineering - Cisco ThousandEyes in London

Land your dream job quicker with Premium

Similar positions in other companies

UK’s top job board for Gen Z