At a Glance
- Tasks: Lead observability projects, ensuring system reliability and performance across clients.
- Company: Join a forward-thinking tech company focused on innovation and collaboration.
- Benefits: Competitive salary, flexible work options, and opportunities for professional growth.
- Other info: Dynamic team environment with a focus on continuous improvement and operational excellence.
- Why this job: Make a real impact by driving observability strategies and mentoring future engineers.
- Qualifications: 5+ years in SRE or DevOps with expertise in APM and IaC.
The predicted salary is between 60000 - 80000 € per year.
Requirements
- Deep expertise in Application Performance Monitoring (APM), Infrastructure as Code (IaC), automation, and distributed tracing using OpenTelemetry
- 5+ years of experience in SRE, Observability, or DevOps roles, with leadership responsibilities
- Proven expertise with Application Performance Monitoring (APM) tools such as New Relic, Datadog, AppDynamics, or Dynatrace
- Hands‑on experience with OpenTelemetry (OTel) for distributed tracing and observability instrumentation
- Strong proficiency in Infrastructure as Code (IaC) using Terraform
- Solid understanding of cloud platforms including AWS, GCP, or Azure
- Experience with automation/configuration management tools like Ansible, Chef, or Puppet
- Deep knowledge of CI/CD pipelines and tools such as GitHub Actions, Jenkins, or Azure DevOps
- Experience managing Kubernetes and containerized environments (Docker, Helm)
- Familiarity with log aggregation and analysis platforms like ELK Stack or Splunk
- Excellent leadership, communication, and collaboration skills
What the job involves
- Drive the strategy and execution of observability and reliability projects across our clients
- Guide the design, implementation, and continuous improvement of observability solutions, ensuring system reliability, performance, and scalability while fostering best practices in SRE and DevOps
- Lead the strategic development and management of observability and reliability frameworks across the organization, ensuring alignment with business goals and technical requirements
- Design and implement monitoring and observability solutions, collaborating with engineering teams to define standards and best practices
- Manage Infrastructure as Code (IaC) initiatives using Terraform, coordinating with cloud and infrastructure teams to ensure scalable and secure deployments
- Drive automation strategies for monitoring, alerting, and logging pipelines, focusing on process improvements and operational efficiency
- Develop and maintain comprehensive observability roadmaps, including distributed tracing, logging, and metrics collection strategies
- Collaborate with product management, sales, and pre‑sales teams to provide technical expertise and support during solution design and customer engagements
- Lead cross‑functional teams to enhance CI/CD pipelines and deployment reliability, ensuring smooth integration of observability tools and practices
- Engage with vendors and strategic partners to evaluate, select, and integrate observability and monitoring solutions, ensuring alignment with organizational needs and fostering strong collaborative relationships
- Mentor and develop junior engineers and analysts, fostering a culture of reliability, observability, and operational excellence
Site Reliability Engineer (SRE / Observability Technical Lead) in London employer: Deepstreamtech
Join a forward-thinking company that prioritises innovation and employee development, offering a dynamic work culture where collaboration and continuous learning are at the forefront. As a Site Reliability Engineer (SRE) / Observability Technical Lead, you will not only lead critical projects but also have access to extensive growth opportunities, mentorship, and cutting-edge technologies in a supportive environment that values your contributions. Located in a vibrant area, our company provides a unique blend of professional challenges and a fulfilling work-life balance, making it an exceptional place to advance your career.
StudySmarter Expert Advice🤫
We think this is how you could land Site Reliability Engineer (SRE / Observability Technical Lead) in London
✨Tip Number 1
Network like a pro! Reach out to your connections in the industry, attend meetups, and join online forums. You never know who might have the inside scoop on job openings or can refer you directly.
✨Tip Number 2
Show off your skills! Create a portfolio or GitHub repository showcasing your projects related to APM, IaC, and automation. This gives potential employers a tangible look at what you can do, especially in SRE and observability.
✨Tip Number 3
Prepare for interviews by brushing up on your technical knowledge and soft skills. Practice common SRE scenarios and be ready to discuss how you've tackled challenges in previous roles. Confidence is key!
✨Tip Number 4
Don’t forget to apply through our website! We love seeing candidates who are genuinely interested in joining our team. Plus, it’s a great way to ensure your application gets the attention it deserves.
We think you need these skills to ace Site Reliability Engineer (SRE / Observability Technical Lead) in London
Some tips for your application 🫡
Tailor Your CV:Make sure your CV reflects the skills and experiences that match our job description. Highlight your expertise in APM, IaC, and automation tools, as these are key for us.
Craft a Compelling Cover Letter:Use your cover letter to tell us why you're the perfect fit for the SRE role. Share specific examples of your leadership experience and how you've driven observability projects in the past.
Showcase Your Technical Skills:Don’t shy away from listing your technical proficiencies! Mention your hands-on experience with OpenTelemetry, Terraform, and any CI/CD tools you’ve worked with. We love seeing those details!
Apply Through Our Website:We encourage you to apply directly through our website. It’s the best way for us to receive your application and ensures you don’t miss out on any important updates from our team.
How to prepare for a job interview at Deepstreamtech
✨Know Your Tools Inside Out
Make sure you’re well-versed in the APM tools mentioned in the job description, like New Relic or Datadog. Be ready to discuss your hands-on experience with these tools and how you've used them to improve application performance.
✨Showcase Your Leadership Skills
As a Site Reliability Engineer, you'll be expected to lead projects and teams. Prepare examples of past leadership experiences where you guided a team through challenges, especially in SRE or DevOps roles. Highlight your communication and collaboration skills.
✨Demonstrate Your IaC Expertise
Since Infrastructure as Code is crucial for this role, brush up on Terraform and be prepared to discuss specific projects where you implemented IaC. Share insights on how it improved deployment processes or system reliability.
✨Prepare for Technical Questions
Expect technical questions around distributed tracing and observability. Review OpenTelemetry and be ready to explain how you’ve used it in previous roles. Also, think about CI/CD pipelines and how you’ve enhanced them in your past work.