At a Glance
- Tasks: Design and develop monitoring dashboards, automate workflows, and ensure platform stability.
- Company: Join TCS, a purpose-led transformation company making a meaningful impact globally.
- Benefits: Competitive salary, health care, life assurance, and extensive training resources.
- Why this job: Be part of innovative projects that shape large-scale digital platforms and drive continuous improvement.
- Qualifications: Experience in Site Reliability Engineering and strong communication skills required.
- Other info: Diverse and inclusive workplace with excellent career growth opportunities.
The predicted salary is between 60000 - 80000 £ per year.
Ready to apply your experience and expertise in Site Reliability Engineering? We have an exciting opportunity for you.
TCS is a purpose-led transformation company, built on belief. We do not just help businesses to transform through technology. We support them in making a meaningful difference to the people and communities they serve - our clients include some of the biggest brands in the UK and worldwide. For you, it means more to make an impact that matters, through challenging projects which demand ambitious innovation and thought leadership.
Collaborate within dynamic, high‑performing engineering teams. Tackle technical challenges that demand innovation and resilience. Deliver impactful solutions that help shape large‑scale digital platforms.
The Role
As a Site Reliability Engineer, you will combine software engineering and operations expertise to ensure platforms are scalable, resilient and highly available. You will lead automation initiatives, enhance observability, support production systems and collaborate closely with engineering and operations teams to drive continuous improvement across reliability, performance and stability.
Key responsibilities:
- Design and develop dashboards to monitor application health, performance and key metrics.
- Implement SLAs, SLOs and SLIs across microservices and data pipelines.
- Support large‑scale systems handling high traffic volumes, ensuring availability and resilience.
- Automate monitoring, alerting, reporting and observability workflows.
- Collaborate with engineering, operations and business teams to ensure platform stability.
- Analyse performance trends and provide recommendations for continuous improvement.
- Support capacity planning, disaster recovery and compliance activities.
- Implement tooling for improved incident triage, granular alerting, runbooks and auto‑remediation.
Your Profile
- Strong experience in Site Reliability Engineering and production support.
- Expertise with Dynatrace monitoring and observability tooling.
- Experience designing journey‑level metrics and synthetic monitoring (canaries) in test/production.
- Strong understanding of SLIs, SLOs, error budgets and availability modelling.
- Experience implementing SRE practices in regulated environments (e.g., banking, payments).
- Experience working with engineering teams to identify pain points and reduce MTTR/MMTD.
- Experience with additional observability tools or service‑mesh architectures.
- Strong stakeholder‑communication and coordination skills.
TCS is consistently voted a Top Employer in the UK and globally. Our competitive salary packages feature pension, health care, life assurance, laptop, phone, access to extensive training resources and discounts within the larger Tata network. We offer health & wellness initiatives and sports events; we are the proud sponsor of the London Marathon.
Diversity, Inclusion and Wellbeing
Tata Consultancy Services UK&I is committed to meeting the accessibility needs of all individuals in accordance with the UK Equality Act 2010 and the UK Human Rights Act 1998. We welcome and embrace diversity in race, nationality, ethnicity, disability, neurodiversity, gender identity, age, physical ability, gender reassignment, sexual orientation. We are a disability inclusive employer and encourage disabled people to apply for this role. As a Disability Confident Employer, we offer an interview to applicants with disabilities or long-term conditions who meet the minimum criteria for the role.
If you are an applicant who needs any adjustments to the application process or interview, please contact us to request an adjustment. We welcome requests prior to you completing the application and at any stage of the recruitment process.
Site Reliability Engineer in London employer: Tata Consultancy Services
Contact Detail:
Tata Consultancy Services Recruiting Team
StudySmarter Expert Advice 🤫
We think this is how you could land Site Reliability Engineer in London
✨Tip Number 1
Network like a pro! Reach out to current or former Site Reliability Engineers on LinkedIn. Ask them about their experiences at TCS and any tips they might have for the interview process. Personal connections can give you insights that job descriptions just can't.
✨Tip Number 2
Prepare for technical interviews by brushing up on your SRE skills. Practice coding challenges and system design questions that are relevant to the role. We recommend using platforms like LeetCode or HackerRank to get in the zone!
✨Tip Number 3
Showcase your passion for reliability engineering during interviews. Share specific examples of how you've tackled challenges in past roles, especially around automation and observability. This will help you stand out as someone who truly understands the field.
✨Tip Number 4
Don't forget to apply through our website! It’s the best way to ensure your application gets seen by the right people. Plus, it shows you're genuinely interested in joining TCS and making an impact in the world of Site Reliability Engineering.
We think you need these skills to ace Site Reliability Engineer in London
Some tips for your application 🫡
Tailor Your CV: Make sure your CV is tailored to the Site Reliability Engineer role. Highlight your experience with monitoring tools like Dynatrace and any SRE practices you've implemented. We want to see how your skills align with what we're looking for!
Craft a Compelling Cover Letter: Your cover letter is your chance to shine! Use it to explain why you're passionate about Site Reliability Engineering and how you can contribute to our mission at TCS. Keep it concise but impactful, and let your personality show through.
Showcase Your Projects: If you've worked on relevant projects, don't hesitate to showcase them! Whether it's automating workflows or enhancing observability, we love seeing real-world examples of your work. It helps us understand your hands-on experience better.
Apply Through Our Website: We encourage you to apply directly through our website. It’s the easiest way for us to receive your application and ensures you’re considered for the role. Plus, you’ll find all the details you need right there!
How to prepare for a job interview at Tata Consultancy Services
✨Know Your SRE Fundamentals
Brush up on your understanding of SLIs, SLOs, and error budgets. Be ready to discuss how you've implemented these concepts in past roles, especially in regulated environments. This shows you’re not just familiar with the theory but have practical experience too.
✨Showcase Your Automation Skills
Prepare examples of how you've led automation initiatives in previous positions. Discuss specific tools you've used for monitoring and observability, like Dynatrace, and how they improved system reliability. This will highlight your technical expertise and problem-solving abilities.
✨Collaboration is Key
Be ready to talk about your experience working with cross-functional teams. Share instances where you collaborated with engineering and operations to enhance platform stability. This demonstrates your ability to communicate effectively and work well in a team setting.
✨Prepare for Scenario Questions
Think through potential scenarios related to high traffic volumes or system failures. Prepare to explain how you would approach these challenges, focusing on your analytical skills and decision-making process. This will show that you can think on your feet and handle pressure.