Site Reliability Engineer (SRE / Observability Technical Lead)

Site Reliability Engineer (SRE / Observability Technical Lead)

Full-Time 70000 - 90000 € / year (est.) Home office (partial)
Deepstreamtech

At a Glance

  • Tasks: Lead observability projects, ensuring system reliability and performance across clients.
  • Company: Join a forward-thinking tech company focused on innovation and collaboration.
  • Benefits: Competitive salary, flexible work options, and opportunities for professional growth.
  • Other info: Dynamic team environment with a focus on continuous improvement and operational excellence.
  • Why this job: Make a real impact by driving observability strategies and mentoring future engineers.
  • Qualifications: 5+ years in SRE or DevOps with expertise in APM and IaC.

The predicted salary is between 70000 - 90000 € per year.

Requirements

  • Deep expertise in Application Performance Monitoring (APM), Infrastructure as Code (IaC), automation, and distributed tracing using OpenTelemetry
  • 5+ years of experience in SRE, Observability, or DevOps roles, with leadership responsibilities
  • Proven expertise with Application Performance Monitoring (APM) tools such as New Relic, Datadog, AppDynamics, or Dynatrace
  • Hands‑on experience with OpenTelemetry (OTel) for distributed tracing and observability instrumentation
  • Strong proficiency in Infrastructure as Code (IaC) using Terraform
  • Solid understanding of cloud platforms including AWS, GCP, or Azure
  • Experience with automation/configuration management tools like Ansible, Chef, or Puppet
  • Deep knowledge of CI/CD pipelines and tools such as GitHub Actions, Jenkins, or Azure DevOps
  • Experience managing Kubernetes and containerized environments (Docker, Helm)
  • Familiarity with log aggregation and analysis platforms like ELK Stack or Splunk
  • Excellent leadership, communication, and collaboration skills

What the job involves

  • Drive the strategy and execution of observability and reliability projects across our clients
  • Guide the design, implementation, and continuous improvement of observability solutions, ensuring system reliability, performance, and scalability while fostering best practices in SRE and DevOps
  • Lead the strategic development and management of observability and reliability frameworks across the organization, ensuring alignment with business goals and technical requirements
  • Design and implement monitoring and observability solutions, collaborating with engineering teams to define standards and best practices
  • Manage Infrastructure as Code (IaC) initiatives using Terraform, coordinating with cloud and infrastructure teams to ensure scalable and secure deployments
  • Drive automation strategies for monitoring, alerting, and logging pipelines, focusing on process improvements and operational efficiency
  • Develop and maintain comprehensive observability roadmaps, including distributed tracing, logging, and metrics collection strategies
  • Collaborate with product management, sales, and pre‑sales teams to provide technical expertise and support during solution design and customer engagements
  • Lead cross‑functional teams to enhance CI/CD pipelines and deployment reliability, ensuring smooth integration of observability tools and practices
  • Engage with vendors and strategic partners to evaluate, select, and integrate observability and monitoring solutions, ensuring alignment with organizational needs and fostering strong collaborative relationships
  • Mentor and develop junior engineers and analysts, fostering a culture of reliability, observability, and operational excellence

Site Reliability Engineer (SRE / Observability Technical Lead) employer: Deepstreamtech

Join a forward-thinking company that prioritises innovation and excellence in the field of Site Reliability Engineering. With a strong commitment to employee growth, we offer extensive training opportunities, a collaborative work culture, and the chance to lead impactful projects that enhance system reliability and performance. Located in a vibrant tech hub, our team enjoys a dynamic environment that fosters creativity and professional development, making it an ideal place for those seeking meaningful and rewarding employment.

Deepstreamtech

Contact Detail:

Deepstreamtech Recruiting Team

StudySmarter Expert Advice🤫

We think this is how you could land Site Reliability Engineer (SRE / Observability Technical Lead)

Tip Number 1

Network like a pro! Attend meetups, webinars, or tech conferences related to SRE and observability. It's a great way to meet industry folks and get your name out there. Plus, you never know who might be hiring!

Tip Number 2

Show off your skills! Create a portfolio or GitHub repository showcasing your projects, especially those involving APM tools or IaC with Terraform. This gives potential employers a taste of what you can do and sets you apart from the crowd.

Tip Number 3

Prepare for interviews by brushing up on your technical knowledge and soft skills. Practice common SRE scenarios and be ready to discuss your experience with tools like OpenTelemetry and CI/CD pipelines. Confidence is key!

Tip Number 4

Don't forget to apply through our website! We love seeing candidates who are genuinely interested in joining our team. Tailor your application to highlight your leadership experience and expertise in observability—let's make it happen!

We think you need these skills to ace Site Reliability Engineer (SRE / Observability Technical Lead)

Application Performance Monitoring (APM)
OpenTelemetry
Infrastructure as Code (IaC)
Terraform
Cloud Platforms (AWS, GCP, Azure)
Automation/Configuration Management (Ansible, Chef, Puppet)
CI/CD Pipelines (GitHub Actions, Jenkins, Azure DevOps)

Some tips for your application 🫡

Tailor Your CV:Make sure your CV reflects the skills and experiences that match our job description. Highlight your expertise in APM, IaC, and any leadership roles you've held. We want to see how you fit into our vision!

Craft a Compelling Cover Letter:Your cover letter is your chance to shine! Use it to explain why you're passionate about SRE and observability. Share specific examples of your past work that align with what we’re looking for, and don’t forget to show your enthusiasm for joining our team.

Showcase Your Technical Skills:When listing your technical skills, be specific! Mention the tools and technologies you’ve used, like Terraform, OpenTelemetry, or any APM tools. We love seeing hands-on experience, so don’t hold back on the details!

Apply Through Our Website:We encourage you to apply directly through our website. It’s the best way for us to receive your application and ensures you’re considered for the role. Plus, it shows you’re keen on joining StudySmarter!

How to prepare for a job interview at Deepstreamtech

Know Your Tools Inside Out

Make sure you’re well-versed in the APM tools mentioned in the job description, like New Relic or Datadog. Be ready to discuss your hands-on experience with these tools and how you've used them to improve application performance.

Showcase Your Leadership Skills

As a Site Reliability Engineer, you'll be expected to lead projects and teams. Prepare examples of past leadership experiences, focusing on how you guided teams through challenges and implemented best practices in SRE and DevOps.

Demonstrate Your Automation Expertise

Be prepared to talk about your experience with Infrastructure as Code (IaC) using Terraform and automation tools like Ansible or Chef. Share specific instances where your automation strategies improved operational efficiency.

Engage in Technical Discussions

Expect technical questions around distributed tracing and observability. Brush up on OpenTelemetry and be ready to discuss how you’ve implemented monitoring solutions in previous roles. Engaging confidently in these discussions will show your depth of knowledge.