Director of Site Reliability Engineering, NADP Bangalore, India
Director of Site Reliability Engineering, NADP Bangalore, India

Director of Site Reliability Engineering, NADP Bangalore, India

Full-Time 43200 - 72000 £ / year (est.) No home office possible
T

At a Glance

  • Tasks: Lead and inspire a team of site reliability engineers to ensure system reliability and security.
  • Company: Join ThousandEyes, a Cisco company, revolutionizing cloud and network visibility.
  • Benefits: Enjoy a collaborative culture, professional growth opportunities, and the chance to work on cutting-edge technology.
  • Why this job: Shape the future of cloud infrastructure while driving innovation and operational excellence.
  • Qualifications: Deep understanding of SRE principles and extensive experience in cloud and ML/AI infrastructure required.
  • Other info: Diverse candidates are encouraged to apply; we value potential over perfection.

The predicted salary is between 43200 - 72000 £ per year.

Director of Site Reliability Engineering, NADP

Bangalore, India or Lisbon, Portugal or London, UK

Who We Are

The name ThousandEyes was born from two big ideas: the power to see things not ordinarily possible and the ability to collect insights from a multitude of vantage points. As the world continues its digital transformation and relies more on cloud services and the Internet, the “network,” which is now both public and private, has become a black box our customers cannot see or understand.

Our Internet and cloud intelligence platform delivers the only collectively powered real-time view of the Internet and private networks, cloud, and SaaS platforms, helping enterprises and service providers identify problems before they impact revenue, damage brand reputation, or halt employee productivity.

In August 2020, Cisco Systems completed the acquisition of ThousandEyes, which now forms the ThousandEyes Business Unit within the Cisco Networking Business Group and is the Network Assurance solution for Cisco across the Cisco Networking Cloud and Cisco Security Cloud. ThousandEyes is also a foundational component of Cisco’s growing Full-Stack Observability (“FSO”) business.

About the role

As the Director of Site Reliability Engineering, Network Assurance Data Platform you will play a critical role in shaping and executing our cloud and big data, ML/AI infrastructure strategy, driving operational excellence, and ensuring the highest levels of system reliability and security. You will lead teams of talented engineers and collaborate closely with cross-functional teams, including software development, operations, and security, to design, build, and maintain our infrastructure, cloud platforms, and security practices, operating at a multi-region scale.

What You’ll Do

  • Lead and inspire a talented team of site reliability engineers, fostering a culture of innovation, collaboration, and excellence in development and operation of infrastructure platforms
  • Drive the strategic vision for the development, implementation, and management of cloud, data, ML/AI platforms.
  • Collaborate closely with cross-functional teams, including development, product management, and security to define and implement reliable, secure, and scalable infrastructure platforms
  • Provide oversight and direction in the development and operation of cloud platforms, ensuring high-quality, scalable, and reliable solutions that meet customer needs
  • Drive operational excellence in operations and security processes
  • Mentor and develop engineering talent, fostering a culture of continuous learning and professional growth within the site reliability engineering group

Qualifications

  • You have a deep understanding of the distributed systems design, cloud technology and their components, dependencies, and code that define infrastructure
  • You possess a deep understanding of SRE principles, including monitoring, alerting, error budgets, fault analysis, and other common reliability engineering concepts
  • Extensive hands-on experience building cloud, big data and/or ML/AI infrastructure (e.g. EMR, Airflow, Comet ML, AWS SageMaker, Spark, etc)
  • Extensive hands-on experience operating mission-critical services in production environments which are required to have high availability and reliability.
  • Proven ability to think strategically and align technical initiatives with business objectives
  • Can provide a strong technical vision for your teams and ensure consistent delivery of objectives
  • Have experience formulating a team’s technical strategy and roadmap; you’ve collaborated and partnered effectively with several other teams to execute shared goals
  • Understand how to balance tactical needs with strategic growth and quality-based initiatives that can span multiple quarters
  • Proven site reliability engineering management experience leading multiple teams

Cisco values the perspectives and skills that emerge from employees with diverse backgrounds. That’s why Cisco is expanding the boundaries of discovering top talent by not only focusing on candidates with educational degrees and experience but also placing more emphasis on unlocking potential. We believe that everyone has something to offer and that diverse teams are better equipped to solve problems, innovate, and create a positive impact.

We encourage you to apply even if you do not believe you meet every single qualification . Not all strong candidates will meet every single qualification. Research shows that people from underrepresented groups are more prone to experiencing imposter syndrome and doubting the strength of their candidacy. We urge you not to prematurely exclude yourself and to apply if you’re interested in this work.

Cisco is an Affirmative Action and Equal Opportunity Employer and all qualified applicants will receive consideration for employment without regard to race, color, religion, gender, sexual orientation, national origin, genetic information, age, disability, veteran status, or any other legally protected basis. Cisco will consider for employment, on a case by case basis, qualified applicants with arrest and conviction records.

#J-18808-Ljbffr

Director of Site Reliability Engineering, NADP Bangalore, India employer: ThousandEyes

At ThousandEyes, we pride ourselves on being an exceptional employer, offering a dynamic work culture that fosters innovation and collaboration. Located in Bangalore, India, our team enjoys access to cutting-edge technology and the opportunity to lead impactful projects in cloud and big data infrastructure. We are committed to employee growth, providing mentorship and continuous learning opportunities, ensuring that every team member can thrive and contribute to our mission of delivering unparalleled network visibility.
T

Contact Detail:

ThousandEyes Recruiting Team

StudySmarter Expert Advice 🤫

We think this is how you could land Director of Site Reliability Engineering, NADP Bangalore, India

Tip Number 1

Familiarize yourself with the latest trends in cloud technology and SRE principles. Understanding distributed systems design and having hands-on experience with tools like AWS SageMaker or Spark will give you a competitive edge.

Tip Number 2

Showcase your leadership skills by discussing past experiences where you led teams in high-pressure environments. Highlight how you fostered collaboration and innovation within your team to achieve operational excellence.

Tip Number 3

Prepare to discuss your strategic vision for cloud and big data infrastructure. Be ready to explain how you align technical initiatives with business objectives, as this is crucial for the role.

Tip Number 4

Network with professionals in the field of Site Reliability Engineering. Engaging with others can provide insights into the company culture and expectations, which can be beneficial during interviews.

We think you need these skills to ace Director of Site Reliability Engineering, NADP Bangalore, India

Deep understanding of distributed systems design
Expertise in cloud technology and its components
Proficiency in SRE principles (monitoring, alerting, error budgets, fault analysis)
Extensive hands-on experience with cloud, big data, and ML/AI infrastructure (e.g. EMR, Airflow, Comet ML, AWS SageMaker, Spark)
Experience operating mission-critical services in production environments
Strategic thinking and alignment of technical initiatives with business objectives
Strong technical vision and consistent delivery of objectives
Ability to formulate technical strategy and roadmap for teams
Collaboration and partnership with cross-functional teams
Balancing tactical needs with strategic growth and quality-based initiatives
Proven management experience in site reliability engineering leading multiple teams
Mentoring and developing engineering talent
Fostering a culture of continuous learning and professional growth

Some tips for your application 🫡

Understand the Role: Before applying, make sure you fully understand the responsibilities and qualifications required for the Director of Site Reliability Engineering position. Tailor your application to highlight relevant experiences that align with the job description.

Highlight Relevant Experience: In your CV and cover letter, emphasize your hands-on experience with cloud technologies, big data, and ML/AI infrastructure. Provide specific examples of how you've led teams and driven operational excellence in previous roles.

Showcase Leadership Skills: Since this role involves leading a team, be sure to include examples of your leadership style and how you've fostered a culture of innovation and collaboration in past positions. Mention any mentoring or development initiatives you've implemented.

Tailor Your Application: Customize your cover letter to reflect your understanding of ThousandEyes and Cisco's mission. Discuss how your technical vision aligns with their goals and how you can contribute to their cloud and data strategy.

How to prepare for a job interview at ThousandEyes

Show Your Leadership Skills

As a Director of Site Reliability Engineering, it's crucial to demonstrate your ability to lead and inspire teams. Share examples of how you've fostered a culture of innovation and collaboration in previous roles.

Discuss Your Technical Vision

Be prepared to articulate your technical vision for cloud and big data infrastructure. Discuss how you align technical initiatives with business objectives and provide specific examples of successful strategies you've implemented.

Highlight Your Experience with SRE Principles

Make sure to discuss your deep understanding of SRE principles such as monitoring, alerting, and fault analysis. Provide concrete examples of how you've applied these concepts to improve system reliability in past projects.

Emphasize Collaboration Across Teams

Collaboration is key in this role. Share experiences where you've effectively partnered with cross-functional teams, including development and security, to achieve shared goals and deliver high-quality solutions.

Director of Site Reliability Engineering, NADP Bangalore, India
ThousandEyes
T
  • Director of Site Reliability Engineering, NADP Bangalore, India

    Full-Time
    43200 - 72000 £ / year (est.)

    Application deadline: 2027-01-31

  • T

    ThousandEyes

Similar positions in other companies
UK’s top job board for Gen Z
discover-jobs-cta
Discover now
>