Remote Backend Engineer - Platform - Stacks | UK | Remote in Milton Keynes

Remote Backend Engineer - Platform - Stacks | UK | Remote in Milton Keynes

Milton Keynes Full-Time No working from home possible
Grafana Labs

At a Glance

  • Tasks: Join us in building and maintaining critical systems for Grafana Cloud.
  • Company: Grafana Labs, a remote-first open-source powerhouse with a global collaborative culture.
  • Benefits: Enjoy competitive salary, flexible remote work, and opportunities for professional growth.
  • Other info: Be part of a diverse team that values innovation and embraces AI-assisted development.
  • Why this job: Make a real impact by solving complex problems in a dynamic tech environment.
  • Qualifications: Experience in backend development, particularly with Golang and distributed systems.

Grafana Labs is a remote-first, open-source powerhouse. There are more than 20M users of Grafana, the open source visualization tool, around the globe, monitoring everything from beehives to climate change in the Alps. The instantly recognizable dashboards have been spotted everywhere from a NASA launch and Minecraft HQ to Wimbledon and the Tour de France. Grafana Labs also helps more than 3,000 companies -- including Bloomberg, JPMorgan Chase, and eBay -- manage their observability strategies with the Grafana LGTM Stack, which can be run fully managed with Grafana Cloud or self-managed with the Grafana Enterprise Stack, both featuring scalable metrics (Grafana Mimir), logs (Grafana Loki), and traces (Grafana Tempo). We’re scaling fast and staying true to what makes us different: an open-source legacy, a global collaborative culture, and a passion for meaningful work. Our team thrives in an innovation-driven environment where transparency, autonomy, and trust fuel everything we do.

The Opportunity: Application Core Services (AppCore) is a group within Platform, in the Foundations department. Foundations produces the Internal Engineering Platform (IEP) and partners closely with our Cloud, Enterprise, and Grafana teams. Our team develops the essential systems driving Grafana's business operations. We utilize the grafana.com platform to engineer bespoke integrations and solutions that unify the diverse technical ecosystem of a modern software enterprise. The team owns important domain areas that help keep both our customer workflows and internal business processes running smoothly. AppCore is made up of multiple squads, each focused on one or more of these domains. Our work includes maintaining the billing engine responsible for customer usage calculation, automating provisioning after a customer signs a contract, integrating with cloud marketplaces such as AWS, Azure, and GCP, and building and maintaining the user portal our customers rely on to manage their accounts. This is a team working at the intersection of product, platform, and business operations. The systems we build are critical to how Grafana scales.

We are looking for engineers who enjoy solving complex workflow and systems problems, improving reliability and developer experience, and building software that directly supports both customers and internal stakeholders. As a company we are remote-first and global, we embrace people of different experiences and backgrounds to build diverse teams where every person brings a unique perspective to the software. Engineers at Grafana also have the opportunity to contribute to Open Source communities and collaborate across teams beyond their immediate scope.

What You’ll Be Doing: The AppCore Stacks squad owns the systems that create, configure, reconcile, migrate, and operate Grafana Cloud stacks at scale. A stack is the customer-facing Grafana Cloud environment that connects an organization to Grafana and the backend services it uses, including Mimir, Loki, Tempo, plugins, dashboards, data sources, and stack-level configuration. Our work sits at the intersection of product, platform, and operations. We build the control-plane services and workflows that keep stack state aligned across grafana.com, Stack State Service (SSS), Hosted Grafana, cloud regions, and the underlying Grafana Cloud infrastructure. When this domain works well, customers get reliable stack creation, safe configuration rollout, predictable migrations, and fewer manual operational interventions.

  • Design, build, and operate reconciliation systems, including the SSS backend, to track desired stack state, detect and repair drift across stack templates, grafana.com state, Hosted Grafana, and actual customer stack configuration.
  • Collaborate across SSS, grafana.com, and deployment configurations to ensure stack lifecycle workflows remain reliable, observable, and resilient.
  • Improve operational efficiency by reducing deployment complexity (e.g., aiming for single PR regional SSS deployment) and contributing to the Stack Config Reconciliation project.
  • Manage rollout mechanisms for provisioned plugins, dashboards, data sources, Grafana versions, release channels, and stack-level configuration.
  • Support new region and cluster rollouts, including the operational paths required to bring stacks online safely in new Grafana Cloud regions.
  • Improve incident response and recovery paths for stack misalignment, reconciliation failures, plugin rollout issues, and Hosted Grafana integration failures.
  • Partner with Product, Hosted Grafana, Infrastructure, Support, and adjacent AppCore squads on customer-impacting stack lifecycle work.
  • Contribute to roadmap planning, technical design, OnCall improvements, and long-term simplification of stack operations.

You will help own the production behavior of the systems you build. That includes improving runbooks, dashboards, alerts, reconciliation safety, rollout controls, and recovery procedures. You should be comfortable debugging across service boundaries and making careful changes in systems that affect customer stacks.

What Makes You a Great Fit: At Grafana, we actively embrace AI-assisted and agentic development practices, integrating these technologies into both our engineering workflows and the systems we deliver. We encourage our engineers to thoughtfully leverage AI tools to enhance every stage of the lifecycle, from design and implementation to testing, documentation, and operations. We also look for strategic opportunities to embed agentic capabilities within our services to eliminate toil, bolster reliability, and ensure that complex customer workflows remain resilient and safe.

We are seeking a Backend Engineer who thrives on building production systems where correctness, scalability, and operational clarity are paramount. As a remote-first organization, you should be comfortable collaborating asynchronously across time zones and taking full ownership of the critical systems powering Grafana Cloud. Our team is small and operates with a high degree of independence; you will be expected to lead major projects, coordinate across service boundaries, and help define the technical direction for our domain. You will be particularly successful in this role if you enjoy solving challenges related to stateful systems, eventual consistency, and reconciliation loops. We value engineers who can take ambiguous lifecycle requirements and transform them into explicit, modular solutions. You should be adept at breaking down complex systems work into safe, iterative increments while clearly communicating technical tradeoffs to both internal stakeholders and adjacent product teams.

Some things you might be expected to do could include:

  • Writing efficient, readable, and easy to maintain code.
  • Implementing new microservices or systems.
  • Collaborating with teammates and other departments to reach consensus on proposed solutions.
  • Coordinating with product and UX when needed.
  • Responding to customer requests and feedback.
  • When ready, participating in our follow-the-sun OnCall rotation.
  • Participating in team decisions, such as roadmap planning and prioritization.

Requirements:

  • You have at least 1 year of fully remote work experience.
  • You have some experience working on a SaaS platform and are familiar with common distributed systems concepts (e.g., scalability, multi-tenancy, HA).
  • Have professional experience with Golang and be willing to work across both backend service and application code.
  • Care deeply about developer and user experience and the quality of the products that you work on.
  • Have some experience contributing to the delivery of projects, from initial brainstorming to shipping a product to the customer.
  • You write clean, well-tested software that other engineers can understand, operate, and maintain.
  • Can take on well-defined tasks, break them down, and execute iteratively to deliver working solutions and gather feedback.
  • You are willing to collaborate across teams and ensure your work is aligned with the needs of other squads and external stakeholders.
  • Familiarity with Kubernetes in AWS, GCP, or Azure, and exposure to infrastructure-as-code tooling (Helm, Terraform, Jsonnet, etc.).
  • Experience participating in blameless incident response and contributing to post-incident reviews.

Bonus Points For:

  • Experience with TypeScript/Node.js.
  • Experience with Kubernetes control-plane patterns, operators, reconcilers, or desired-state systems.
  • Experience with Jsonnet/Tanka, Terraform, Flux, Argo, or similar deployment/configuration tooling.
  • Experience working on SaaS provisioning, tenancy, regional expansion, plugin rollout, or customer lifecycle systems.
  • Experience with incident response involving configuration drift, partial failure, or cross-service state mismatch.

Remote Backend Engineer - Platform - Stacks | UK | Remote in Milton Keynes employer: Grafana Labs

Grafana Labs is an exceptional employer that champions a remote-first, collaborative culture, allowing engineers to thrive in an innovation-driven environment. With a commitment to open-source values and meaningful work, employees enjoy opportunities for professional growth while contributing to impactful projects that support a diverse global community. The company fosters transparency and autonomy, ensuring that every team member can make significant contributions to the success of Grafana Cloud.

Grafana Labs

Contact Details:

Grafana Labs Recruitment Team

StudySmarter Expert Advice🤫

We think this is how you could land Remote Backend Engineer - Platform - Stacks | UK | Remote in Milton Keynes

Tip Number 1

Network like a pro! Reach out to folks in your industry on LinkedIn or join relevant online communities. Engaging with others can lead to referrals and insider info about job openings that might not be advertised.

Tip Number 2

Prepare for interviews by practising common questions and scenarios related to backend engineering. We recommend doing mock interviews with friends or using platforms that offer interview prep to boost your confidence.

Tip Number 3

Showcase your skills through personal projects or contributions to open-source. This not only demonstrates your technical abilities but also your passion for the field, which is something companies like Grafana Labs really value.

Tip Number 4

Don’t hesitate to apply directly through our website! Even if you don’t meet every single requirement, if the role excites you, go for it. We love seeing enthusiastic candidates who are eager to learn and grow.

We think you need these skills to ace Remote Backend Engineer - Platform - Stacks | UK | Remote in Milton Keynes

Golang
SaaS Platform Experience
Distributed Systems Concepts
Microservices Development
Kubernetes
Infrastructure-as-Code Tooling
Incident Response

Some tips for your application 🫡

Tailor Your Application:Make sure to customise your CV and cover letter for the Backend Engineer role. Highlight your experience with SaaS platforms and distributed systems, and don’t forget to mention any relevant projects you've worked on that align with our mission at Grafana Labs.

Show Off Your Skills:When detailing your experience, focus on your proficiency in Golang and any work you've done with Kubernetes or infrastructure-as-code tools. We want to see how you can contribute to our team, so be specific about your technical skills and achievements!

Be Authentic:Don’t stress if you don’t meet every single requirement listed in the job description. If this role excites you, let us know why! Share your passion for open-source and how you can bring a unique perspective to our diverse team.

Apply Through Our Website:We encourage you to apply directly through our website. It’s the best way for us to receive your application and ensures you’re considered for this exciting opportunity. Plus, it shows you’re keen on joining our remote-first culture!

How to prepare for a job interview at Grafana Labs

Know Your Tech Stack

Make sure you’re familiar with the technologies mentioned in the job description, especially Golang and distributed systems concepts. Brush up on your knowledge of Kubernetes and infrastructure-as-code tools like Terraform or Helm, as these will likely come up during technical discussions.

Showcase Your Problem-Solving Skills

Prepare to discuss specific examples where you've tackled complex systems problems or improved operational efficiency. Grafana Labs values engineers who can break down ambiguous requirements into clear, modular solutions, so be ready to demonstrate your thought process.

Emphasise Collaboration

Since this is a remote-first role, highlight your experience working asynchronously across teams. Share examples of how you’ve successfully collaborated with product, UX, and other departments to deliver projects, ensuring everyone’s needs are met.

Be Ready for Scenario Questions

Expect questions that assess your ability to handle real-world scenarios, such as incident response or configuration drift. Think about past experiences where you’ve had to respond to issues and how you ensured system reliability and clarity in your solutions.