Observability Platform Engineer
Observability Platform Engineer

Observability Platform Engineer

London Full-Time 36000 - 60000 ÂŁ / year (est.) No home office possible
Go Premium
N

At a Glance

  • Tasks: Design and manage observability systems for AI workloads, ensuring robust monitoring and alerting.
  • Company: Nscale, a cutting-edge GPU cloud company focused on AI infrastructure.
  • Benefits: Competitive salary, inclusive culture, and opportunities for professional growth.
  • Why this job: Join a team driving innovation in AI technology and make a real impact.
  • Qualifications: 2-5 years in software engineering or related fields; scripting skills required.
  • Other info: Dynamic, collaborative environment with a focus on sustainability and diversity.

The predicted salary is between 36000 - 60000 ÂŁ per year.

Join to apply for the Observability Platform Engineer role at NscaleAbout Nscale Nscale is the GPU cloud engineered for AI. We offer high‐performance, cost‐efficient infrastructure designed for modern AI workloads, blending the power of bespoke supercomputers with the flexibility of cloud services. Our vertically integrated platform spans GPU‐dense, energy‐efficient data centres through Kubernetes and Slurm orchestration to AI‐ready services.We thrive on a culture of relentless innovation, ownership, and accountability, where every team member takes pride in their work and drives it with excellence and urgency. As an Nscaler, you\’ll build trust through openness and transparency, where everyone is inspired to do their best work. If you join our team, you\’ll be contributing to building the technology that powers the future.About The Role (Job Purpose) As an Observability Platform Engineer, you will design, build, and manage the systems that surface deep visibility into Nscale\’s infrastructure and AI workloads. You\’ll treat observability as a product, partnering with engineering and SRE teams to ensure our monitoring, logging, tracing, and alerting platforms are robust, scalable, and easy to use.This role requires hands‐on engineering experience combined with empathy for how other teams consume observability data. You\’ll ensure infrastructure health, reliability, and performance by enabling proactive insights and reducing operational friction.What You\’ll Do Design, build, and support scalable observability infrastructure (metrics, logs, traces, alerts).Collaborate with internal teams to embed observability as a seamless product across GPU clusters, Kubernetes, Slurm, and AI services.Implement and refine monitoring and alerting patterns to enhance system reliability and reliability culture.Maintain production and pre‐production observability clusters and help others adopt best practices.Automate observability pipelines using IaC tools and scripting for repeatability and consistency.Troubleshoot observability platform issues and support incident remediation efforts.Serve as an advocate for observability best practices, training teams on effective usage and instrumentation.About You Skills / Experience 2–5 years of experience in Software Engineering, SRE, DevOps, or observability‐related roles.Proficiency in at least one scripting or programming language (Python, Go, Bash).Experience with Kubernetes or containerised environments.Familiarity with on‐call responsibilities, triaging, and escalating live production issues.Comfortable with observability tooling, Grafana, Prometheus, Loki, OpenTelemetry, ClickHouse, Elastic, Thanos, VictoriaMetrics, etc.Strong communication and collaboration skills, able to empathise with users of observability systems and translate needs into solutions.Preferred Hands‐on experience operating observability infrastructure at scale.Knowledge of Infrastructure‐as‐Code (e.g. Terraform) to automate deployments.Exposure to streaming systems or pipelines for observability data.In All We Do, Our Core Values Guide Us Relentless Innovation At Nscale, we constantly push the boundaries of innovation, embracing creative risks to shape the future. Our aim is to deliver products that not only meet but exceed today\’s expectations, setting new standards for tomorrow.Ownership and Accountability Every Nscaler is fully accountable for their work, driving it with excellence and urgency. We set high standards, ensuring that our contributions are not just good but exceptional.Openness and Transparency We believe trust and transparency are key to our success. We maintain open communication within our teams and with stakeholders, sharing both successes and challenges. Our open‐source approach allows customers to explore our technology, building trust and ensuring our solutions are both innovative, secure, and reliable.Customer‐Centric Focus Our customers are central to our mission, and we are committed to delivering impactful solutions that drive real‐world success. We focus on deeply understanding their needs and challenges, striving to exceed expectations in both product quality and service.Sustainability We are dedicated to considering the long‐term environmental and societal impacts of our technologies. By integrating sustainability into our operations and product development, we ensure that our innovations are both effective and responsible, contributing positively to the world around us.Full‐Speed Collaboration Collaboration at Nscale is fast, efficient, and respectful. We work together seamlessly, with clear communication and mutual respect, ensuring our shared goals are met with high standards and impactful outcomes.Equal Opportunities Statement We strongly encourage applications from people of colour, the LGBTQ+ community, people with disabilities, neurodivergent people, parents, carers, and people from lower socio‐economic backgrounds. If there\’s anything we can do to accommodate your specific situation, please let us know.The responsibilities outlined in this job description are not exhaustive and are intended to provide a general overview of the position. The employee may be required to perform additional duties, tasks, and responsibilities as assigned by management, consistent with the skills and qualifications required for the role.For information on how Nscale handles candidate personal data, please see our Employee & Candidate Privacy Notice: Here.London, England, United Kingdom

#J-18808-Ljbffr

Observability Platform Engineer employer: Nscale

Nscale is an exceptional employer, offering a dynamic work environment in London where innovation and accountability are at the forefront of our culture. As an Observability Platform Engineer, you will have the opportunity to collaborate with talented teams, driving impactful solutions while enjoying a commitment to employee growth and inclusivity. With a focus on sustainability and open communication, Nscale empowers its employees to excel and contribute to cutting-edge technology that shapes the future of AI.
N

Contact Detail:

Nscale Recruiting Team

StudySmarter Expert Advice đŸ€«

We think this is how you could land Observability Platform Engineer

✹Tip Number 1

Network like a pro! Reach out to folks in the industry, especially those at Nscale. A friendly chat can open doors and give you insights that a job description just can't.

✹Tip Number 2

Show off your skills! Create a portfolio or GitHub repo showcasing your projects related to observability. This is your chance to demonstrate your hands-on experience and creativity.

✹Tip Number 3

Prepare for the interview by understanding Nscale's culture and values. Be ready to discuss how your experience aligns with their focus on innovation, ownership, and collaboration.

✹Tip Number 4

Apply through our website! It’s the best way to ensure your application gets seen. Plus, it shows you're genuinely interested in joining the Nscale team.

We think you need these skills to ace Observability Platform Engineer

Observability Infrastructure Design
Monitoring and Alerting Patterns
Kubernetes
Scripting (Python, Go, Bash)
Infrastructure-as-Code (Terraform)
Troubleshooting Skills
Collaboration Skills
Empathy for User Needs
Grafana
Prometheus
OpenTelemetry
Elastic
Incident Remediation
Automation of Observability Pipelines
Streaming Systems Knowledge

Some tips for your application đŸ«Ą

Tailor Your CV: Make sure your CV speaks directly to the role of Observability Platform Engineer. Highlight your experience with observability tools and any relevant projects you've worked on. We want to see how your skills align with our needs!

Craft a Compelling Cover Letter: Your cover letter is your chance to shine! Share your passion for observability and how you can contribute to Nscale's mission. Be sure to mention specific experiences that showcase your problem-solving skills and teamwork.

Showcase Your Technical Skills: Don’t forget to highlight your proficiency in scripting languages and any hands-on experience with Kubernetes or observability tooling. We love seeing candidates who can demonstrate their technical chops clearly and confidently.

Apply Through Our Website: We encourage you to apply through our website for a smoother application process. It helps us keep track of your application and ensures you don’t miss out on any important updates from us!

How to prepare for a job interview at Nscale

✹Know Your Tools

Make sure you’re familiar with the observability tools mentioned in the job description, like Grafana, Prometheus, and OpenTelemetry. Brush up on how they work and be ready to discuss your experience with them during the interview.

✹Showcase Your Collaboration Skills

Since this role involves working closely with engineering and SRE teams, prepare examples of how you've successfully collaborated in the past. Highlight any instances where you’ve translated user needs into effective observability solutions.

✹Demonstrate Problem-Solving Abilities

Be ready to talk about specific challenges you’ve faced in observability or related roles. Discuss how you approached troubleshooting and what steps you took to resolve issues, showcasing your hands-on engineering experience.

✹Emphasise Continuous Learning

Nscale values relentless innovation, so share how you stay updated with industry trends and new technologies. Mention any recent projects or learning experiences that demonstrate your commitment to growth in the observability space.

Observability Platform Engineer
Nscale
Location: London
Go Premium

Land your dream job quicker with Premium

You’re marked as a top applicant with our partner companies
Individual CV and cover letter feedback including tailoring to specific job roles
Be among the first applications for new jobs with our AI application
1:1 support and career advice from our career coaches
Go Premium

Money-back if you don't land a job in 6-months

N
  • Observability Platform Engineer

    London
    Full-Time
    36000 - 60000 ÂŁ / year (est.)
  • N

    Nscale

    50-100
Similar positions in other companies
UK’s top job board for Gen Z
discover-jobs-cta
Discover now
>