Senior Platform Engineer - Observability

Senior Platform Engineer - Observability

Full-Time 70000 - 90000 £ / year (est.) Home office (partial)
Teya

At a Glance

  • Tasks: Design and build observability products that enhance performance and reliability across teams.
  • Company: Join Teya, a forward-thinking tech company with a focus on innovation and collaboration.
  • Benefits: Enjoy flexible hours, health support, generous leave, and a vibrant office environment.
  • Other info: Be part of a diverse team committed to inclusivity and professional growth.
  • Why this job: Make a real impact by improving systems that help hundreds of teams work faster and smarter.
  • Qualifications: Strong software engineering skills, especially in Golang, and experience with observability tools.

The predicted salary is between 70000 - 90000 £ per year.

Teya is hiring a Senior Platform Engineer to join the Observability team within Cloud Platform. This team builds and operates the observability products and foundations used by engineering across Teya. The role is not a traditional operations or support position. It is a senior engineering role focused on building reliable, scalable and self-service observability capabilities that help hundreds of teams move faster while improving reliability, performance and visibility. You will work on systems with meaningful scale and company-wide impact. Cloud Platform supports self-service software delivery across the organisation, and the observability stack underpins monitoring, alerting, logging, tracing and dashboards used by teams across multiple scopes and environments.

What this team owns

  • The platform currently operates at large scale, including 13 monitoring stacks, hundreds of thousands of metric samples per second, tens of thousands of traces per second, and significant long-term storage footprints across metrics, logs and traces.
  • The Observability team is responsible for core platform capabilities across:
    • Monitoring built around Prometheus, Thanos, exporters, recording rules and alerting.
    • Logging built around Loki and associated querying, storage and alerting workflows.
    • Tracing built around Tempo and OpenTelemetry-compatible ingestion paths.
    • Visualisation and user workflows through Grafana.
    • Self-service observability for product teams, including metric collection, dashboards, recording rules, alerting definitions and integrations with Slack and incident tooling.

Responsibilities

  • Design, build, deploy and improve observability products used across Cloud Platform and engineering teams.
  • Improve the scalability, reliability and performance of monitoring, logging, tracing and alerting systems.
  • Build self-service abstractions that let teams instrument services, define alerts, create dashboards and troubleshoot systems with minimal platform dependency.
  • Help define and evangelise best practices around observability, reliability and performance engineering.
  • Work closely with engineers across the company to improve telemetry quality, reduce toil and raise the standard of operational excellence.
  • Contribute to platform-wide engineering work in areas such as automation, Kubernetes, cloud infrastructure, incident learning and internal tooling.

What we are looking for

  • Strong software engineering background, ideally with significant experience in Golang and the broader software development lifecycle.
  • Strong understanding of distributed systems, reliability concerns and performance trade-offs at scale.
  • Hands‑on experience with modern observability tooling such as Prometheus, Grafana, Loki, Tempo, Thanos, OpenTelemetry or similar systems.
  • Experience with cloud platforms and their APIs, plus infrastructure patterns in environments such as AWS.
  • Experience working with Kubernetes and container‑based platforms.
  • Ability to operate with high ownership in a senior team and make sound engineering decisions in systems with large organisational blast radius.
  • Deep understanding of PromQL and LogQL.
  • Good intuition for latency, tail latency, sampling, percentiles and telemetry trade-offs between logs, metrics and traces.
  • Experience designing or operating observability systems for many teams, not only for a single service or product.
  • Experience improving developer experience through automation and platform abstractions.

The Perks

  • We trust you, so we offer flexible working hours, as long it suits both you and your team.
  • Physical and mental health support through our partnership with GymPass giving free access to over 1,500 gyms in the UK, 1‑1 therapy, meditation sessions, digital fitness and nutrition apps.
  • Our company offers extended and improved maternity and paternity leave choices, giving employees more flexibility and support.
  • Cycle‑to‑Work Scheme.
  • Health and Life Insurance.
  • Pension Scheme.
  • 25 days of Annual Leave (+ Bank Holidays).
  • Office snacks every day.
  • Friendly, comfortable and informal office environment in Central London.

Teya is proud to be an equal opportunity employer. We are committed to creating an inclusive environment where everyone regardless of race, ethnicity, gender identity or expression, sexual orientation, age, disability, religion, or background can thrive and do their best work. We believe that a diverse team leads to better ideas, stronger outcomes, and a more supportive workplace for all. If you require any reasonable adjustments at any stage of the recruitment process whether for interviews, assessments, or other parts of the application—we encourage you to let us know. We are committed to ensuring that every candidate has a fair and accessible experience with us.

Senior Platform Engineer - Observability employer: Teya

Teya is an exceptional employer that fosters a culture of innovation and collaboration, particularly within the dynamic environment of Central London. With a strong emphasis on employee well-being, we offer flexible working hours, comprehensive health support, and generous leave policies, ensuring our team members can thrive both personally and professionally. Our commitment to diversity and inclusion, alongside opportunities for growth in cutting-edge observability technologies, makes Teya a rewarding place to advance your career.

Teya

Contact Details:

Teya Recruitment Team

StudySmarter Expert Advice🤫

We think this is how you could land Senior Platform Engineer - Observability

Tip Number 1

Network like a pro! Reach out to folks in the industry, especially those at Teya or similar companies. A friendly chat can open doors and give you insights that a job description just can't.

Tip Number 2

Show off your skills! If you've got a GitHub or portfolio, make sure it's up to date. Share projects that highlight your experience with observability tools like Prometheus or Grafana. Let your work speak for itself!

Tip Number 3

Prepare for the interview by diving deep into Teya's tech stack. Understand how they use tools like Loki and Tempo. This shows you're genuinely interested and ready to contribute from day one.

Tip Number 4

Apply through our website! It’s the best way to ensure your application gets seen. Plus, it shows you’re keen on joining the team directly. Don’t miss out on this opportunity!

We think you need these skills to ace Senior Platform Engineer - Observability

Golang
Distributed Systems
Observability Tooling
Prometheus
Grafana
Loki
Tempo

Some tips for your application 🫡

Tailor Your CV:Make sure your CV reflects the skills and experiences that align with the Senior Platform Engineer role. Highlight your experience with observability tools like Prometheus and Grafana, and don’t forget to mention any relevant projects you've worked on!

Craft a Compelling Cover Letter:Your cover letter is your chance to show us your personality and passion for the role. Share why you’re excited about working in observability and how your background makes you a great fit for our team at Teya.

Showcase Your Problem-Solving Skills:In your application, give examples of how you've tackled challenges in previous roles, especially those related to scalability and reliability. We love seeing how you think through problems and come up with innovative solutions!

Apply Through Our Website:We encourage you to apply directly through our website. It’s the best way to ensure your application gets into the right hands and shows us you’re serious about joining our team!

How to prepare for a job interview at Teya

Know Your Tech Stack

Make sure you’re well-versed in the technologies mentioned in the job description, like Prometheus, Grafana, and Golang. Brush up on your understanding of distributed systems and be ready to discuss how you've used these tools in past projects.

Showcase Your Problem-Solving Skills

Prepare to share specific examples of how you've tackled challenges related to observability, reliability, and performance. Think about situations where you improved a system's scalability or reduced latency, and be ready to explain your thought process.

Understand the Bigger Picture

Demonstrate your knowledge of how observability impacts the entire engineering team. Be prepared to discuss how your work can help improve developer experience and operational excellence across multiple teams, not just within a single service.

Ask Insightful Questions

Prepare thoughtful questions that show your interest in the role and the company. Inquire about the current challenges the Observability team is facing or how they measure success in their observability initiatives. This shows you're engaged and thinking critically about the position.