Observability Platform Engineer UK

Observability Platform Engineer UK

Full-Time 60000 - 80000 £ / year (est.) Home office (partial)
Nscale Ltd.

At a Glance

  • Tasks: Design and manage observability platforms for our global AI datacentre infrastructure.
  • Company: Join Nscale, the GPU cloud engineered for AI with a culture of innovation.
  • Benefits: Competitive salary, inclusive environment, and opportunities for professional growth.
  • Other info: Diverse workplace encouraging applications from all backgrounds.
  • Why this job: Make a real impact in AI development while working with cutting-edge technology.
  • Qualifications: Experience in observability platforms and strong collaboration skills required.

The predicted salary is between 60000 - 80000 £ per year.

Nscale is the GPU cloud engineered for AI. We provide cost‑effective, high-performance infrastructure for AI start‑ups and large enterprise customers. Nscale enables AI‑focused companies to achieve superior results by reducing the complexity of AI development. Our GPU cloud bolsters technical capabilities and directly supports strategic business outcomes, including cost management, rapid innovation, and environmental responsibility.

We thrive on a culture of relentless innovation, ownership, and accountability, where every team member takes pride in their work and drives it with excellence and urgency. As an Nscaler, you’ll build trust through openness and transparency, where everyone is inspired to do their best work. If you join our team, you’ll be contributing to building the technology that powers the future.

About The Role

Nscale is seeking an Observability Platform Engineer to design, deploy, and operate the monitoring, logging, and tracing systems that power observability across our global AI datacentre infrastructure. You will focus on scalability, automation, and integration of observability platforms, ensuring that metrics, logs, and traces are accurate, accessible, and actionable.

You’ll work closely with SRE, Infrastructure, and Engineering teams to ensure our GPU‑powered cloud is fully instrumented for reliability and performance.

What You’ll Be Doing

  • Design, build, and maintain observability platforms (monitoring, logging, tracing, alerting) at global scale.
  • Deploy and manage tools such as Prometheus, Grafana, Datadog, ELK/Opensearch, OpenTelemetry, and Jaeger.
  • Automate observability infrastructure using Infrastructure‑as‑Code and CI/CD pipelines.
  • Partner with engineering and SRE teams to instrument applications and systems for telemetry.
  • Develop dashboards, alerts, and analytics to provide real‑time visibility into infrastructure health.
  • Ensure observability data is accurate, reliable, and retained per compliance requirements.
  • Troubleshoot observability platform issues, ensuring high availability and performance.
  • Drive adoption of best practices for monitoring, logging, and tracing across the company.
  • Contribute to continuous improvement of incident detection, response, and resolution.
  • Document observability standards, tools, and processes.

About You (Skills / Qualifications)

  • Strong experience in designing and operating observability platforms at scale.
  • Hands‑on expertise with monitoring, logging, and tracing tools (Prometheus, Grafana, Datadog, ELK/Opensearch, Splunk, OpenTelemetry, Jaeger).
  • Experience with cloud‑native infrastructure (Kubernetes, containers, service meshes).
  • Proficiency in scripting/automation (e.g., Python, Go, Bash).
  • Knowledge of Infrastructure‑as‑Code (Terraform, Ansible, Pulumi) and CI/CD practices.
  • Strong understanding of distributed systems reliability and incident management.
  • Excellent problem‑solving skills with the ability to diagnose performance issues across systems.
  • Good collaboration skills to work with engineering, operations, and product teams.

Nice to have:

  • Experience with AI/ML workload observability.
  • Familiarity with hyperscale datacentre environments.
  • Knowledge of AIOps and advanced telemetry analytics.
  • Exposure to sustainability monitoring (e.g., power usage effectiveness, efficiency metrics).

At Nscale, we are committed to fostering an inclusive, diverse, and equitable workplace. We believe that a variety of perspectives enriches our work environment, and we encourage applications from candidates of all backgrounds, experiences, and abilities. We strongly encourage applications from people of colour, the LGBTQ+ community, people with disabilities, neurodivergent people, parents, carers, and people from lower socio‑economic backgrounds.

For information on how Nscale handles candidate personal data, please see our Employee & Candidate Privacy Notice.

Observability Platform Engineer UK employer: Nscale Ltd.

Nscale is an exceptional employer, offering a dynamic work environment where innovation and accountability are at the forefront. As an Observability Platform Engineer, you will have the opportunity to work with cutting-edge technology in a culture that values transparency and collaboration, while also benefiting from professional growth opportunities and a commitment to diversity and inclusion. Located in the UK, Nscale provides a unique chance to contribute to the future of AI infrastructure, all while being part of a team that prioritises excellence and environmental responsibility.

Nscale Ltd.

Contact Details:

Nscale Ltd. Recruitment Team

StudySmarter Expert Advice🤫

We think this is how you could land Observability Platform Engineer UK

Join Local Tech Meetups

Get out there and mingle with fellow developers by joining local tech meetups. It’s a fantastic way to meet people who might be working at Nscale Ltd. or know someone who does. Plus, you can pick up some trendy tech skills and trends while you're at it!

Contribute to Open Source Projects

Show off your coding chops by jumping into open-source projects. Not only does this give you practical experience, but it also gets you noticed in the dev community. You'll create a killer portfolio that speaks volumes about your skills to Nscale Ltd..

Tap into Online Developer Communities

Don’t underestimate the power of online developer communities like GitHub, Stack Overflow, and even Reddit. Participate in discussions, share your projects, and build your visibility. We can often find opportunities through these channels that can lead to a full-time gig at companies like Nscale Ltd..

Explore Job Boards Specifically for Tech Roles

Keep your eyes peeled on job boards that focus on tech roles. Sites like TechCareers or Stack Overflow Jobs can often have listings for companies like Nscale Ltd. that might not show up on broader job sites. Make it a habit to check these regularly, and don’t hesitate to apply directly through our website!

We think you need these skills to ace Observability Platform Engineer UK

Observability Platforms Design
Monitoring Tools (Prometheus, Grafana, Datadog, ELK/Opensearch, Splunk, OpenTelemetry, Jaeger)
Cloud-Native Infrastructure (Kubernetes, containers, service meshes)
Scripting/Automation (Python, Go, Bash)
Infrastructure-as-Code (Terraform, Ansible, Pulumi)
CI/CD Practices
Distributed Systems Reliability

Some tips for your application 🫡

Show off your coding skills:When applying for a software engineering role, it's super important to showcase your coding skills. Make sure your CV includes your tech stack, any relevant programming languages you’re comfortable with, and examples of projects you've worked on. If you have a GitHub profile, link it up! We love to see code in action.

Tailor your portfolio:For a full-time role, we’d expect to see some solid examples of your work in your portfolio. Make sure to include at least two or three projects that highlight your problem-solving skills and your ability to work with different technologies. Focus on the projects that are most relevant to the position at Nscale Ltd..

Craft a killer cover letter:Your cover letter is your chance to stand out—make it personal! Explain why you want to work at Nscale Ltd. and how your skills align with the role. Show us your passion for software development. We dig enthusiastic candidates who understand the value of collaboration and continuous learning!

Be clear and concise:When it comes to writing your CV and cover letter, clarity is key. Avoid jargon that could confuse us and stick to simple, direct language. Highlight your achievements with quantifiable results where possible, and keep everything easy to read. A well-organised application goes a long way!

How to prepare for a job interview at Nscale Ltd.

Brush Up on Your Coding Skills

For a full-time software engineering role, it's crucial that we stay sharp with our coding abilities. Expect technical questions that might involve solving problems on the spot or discussing algorithms. Practise on platforms like LeetCode or HackerRank to get comfortable with the types of questions that often come up.

Know Your Tools and Frameworks

Make sure we’re well-acquainted with the tools and technologies listed in the job description. Familiarise ourselves with any specific frameworks or programming languages mentioned. If Nscale Ltd. uses React or Node.js, for instance, be ready to discuss how we’ve used them in previous projects or coursework.

Showcase Your Projects

Bring along a portfolio that highlights our best work. This could be code samples, GitHub repositories, or any side projects we’ve built. Make sure we can talk through our thought process for each project, especially the challenges we faced and how we solved them—this shows our problem-solving skills in action.

Prepare for Behavioural Questions

While technical skills are key, full-time positions also require cultural fit. Be ready to discuss our previous experiences and how we handle teamwork, conflict, and deadlines. Brush up on the STAR method—Situation, Task, Action, Result—to clearly articulate our past experiences when discussing how we've contributed to a team.