Technical Operations Manager - AI in London
Technical Operations Manager - AI

Technical Operations Manager - AI in London

London Full-Time 60000 - 80000 ÂŁ / year (est.) No home office possible
E

At a Glance

  • Tasks: Lead the operations of a cutting-edge Technical Operation Centre focused on AI infrastructure.
  • Company: Join Era4, a mission-driven start-up transforming energy sites into modern data centres.
  • Benefits: Enjoy autonomy, high visibility with leadership, and the chance to shape operations at scale.
  • Other info: Be part of a dynamic team driving innovation in critical national infrastructure.
  • Why this job: Make a real impact in AI infrastructure while promoting renewable energy solutions.
  • Qualifications: Experience in infrastructure operations, SRE, or managed services is essential.

The predicted salary is between 60000 - 80000 ÂŁ per year.

Era4 develops, owns and operates AI infrastructure across the UK, powered by renewable energy. Converting legacy industrial and energy sites into modern data‑centre facilities, Era4 is combining brownfield regeneration opportunities with cleaner, efficient, scalable compute capacity for healthcare, research, finance, enterprise, and public‑sector organisations.

The Technical Operations Manager is responsible for the implementation and day‑to‑day running of a new, greenfield Technical Operation Centre, encompassing Client Support, SRE / AIOps and Automation. This role ensures that Era4’s sovereign AI/HPC infrastructure is supported, monitored and delivered to contracted SLA targets from day one.

You will design and shape the function, embed SLO‑driven thinking, agentic approaches, own escalation pathways, and translate complex infrastructure events into clear customer communications. This is a foundational role with a direct line to leadership and genuine scope to shape how the function operates at scale.

Key Responsibilities
  • Own the end‑to‑end operational performance of the 24x7 Operations Centre: incident management, change management and problem management.
  • Serve as the primary escalation point for P1/P2 incidents, providing incident command and coordinating resolution across SRE, Service Desk, DevOps partners and third‑party vendors.
  • Maintain and continuously improve operational runbooks, SOPs and the post‑mortem‑to‑runbook learning pipeline.
  • Lead regular operational reviews (weekly, monthly) and produce management‑ready performance reports covering MTTR, SLA adherence, error budget consumption and incident trends.
  • Manage on‑call rotas and escalation schedules across the Operations Centre; coordinate overnight cover and hand‑off procedures.
  • Own the change advisory process, ensuring all infrastructure changes are risk‑assessed, scheduled and communicated appropriately.
  • Line‑manage the Service Desk function: own ticket triage workflows, SLA timers, first‑contact resolution targets and customer communication standards within the ITSM.
  • Champion a customer‑first culture across Service Desk.
  • Own the SLO/error budget framework at a programme level: hold the team accountable to error budget targets, use burn‑rate data to drive prioritisation decisions and escape when automation investment needs to be throttled or accelerated.
  • Provide operational context in sprint planning and backlog prioritisation; ensure the SRE team’s roadmap is anchored to customer experience, customer‑impacting risk reduction and compliance milestones, not engineering preference alone.
  • Manage and develop 3rd‑party integrations, at both a Service and Technical level.
Required Experience & Skills
  • Comfortable and confident dealing directly with clients, from technical support tickets to service reviews with senior leadership.
  • Proven background within infrastructure operations, HPC, SRE, NOC, managed services or equivalent mission‑critical environment in a management or senior lead role.
  • Demonstrated experience across at least two of the three domains: NOC/incident operations, service management (ITSM, SLA governance) and SRE/platform engineering, with sufficient working knowledge of the third to operate effectively as an escalation point.
  • Working knowledge of observability tooling, Grafana, Prometheus or equivalent; able to read dashboards, interrogate alert logic and hold meaningful conversations with engineering teams and third‑party vendors.
  • Fluency with SLA/SLO frameworks: designing, implementing and reporting against contractual and internal service targets.
  • Strong Linux, container and infrastructure knowledge, specifically supporting GPU and HPC workloads in production.
One or More Would Be An Advantage
  • Operational experience with GPU infrastructure (NVIDIA HGX, DGX, InfiniBand) or AI/HPC compute environments.
  • Familiarity with DCGM Exporter, GPU telemetry or equivalent high‑density compute monitoring.
  • Experience with integration and automation into ticket platforms (Halo, ServiceNow, Freshservice or equivalent) and ITIL‑based incident, problem and change management.
  • Hands‑on experience with GitLab, GitOps workflows and infrastructure‑as‑code (Terraform, Ansible or AWX).
  • Exposure to agentic remediation / AIOps tooling, automated alerting, event correlation or self‑healing runbooks.
  • Exposure to one or more of Python, Go, Bash, PromQL.
  • Experience in a data centre, hosting, cloud, colocation, managed hosting or sovereign cloud environment.

Why Join Era4: You’ll be joining a mission‑driven start‑up building critical national infrastructure, where operational excellence directly enables growth. This role offers high visibility with leadership, real autonomy and the chance to shape how a next‑generation company operates at scale.

Technical Operations Manager - AI in London employer: Era4

Era4 is an exceptional employer, offering a unique opportunity to lead the development of a greenfield Technical Operations Centre in the UK, where innovation meets sustainability. With a strong focus on operational excellence and a customer-first culture, employees are empowered to shape their roles and contribute to meaningful projects that support critical national infrastructure. The company fosters a collaborative work environment, providing ample opportunities for professional growth and development while championing renewable energy initiatives.
E

Contact Detail:

Era4 Recruiting Team

StudySmarter Expert Advice 🤫

We think this is how you could land Technical Operations Manager - AI in London

✨Tip Number 1

Network like a pro! Reach out to folks in the industry, attend meetups, and connect with people on LinkedIn. You never know who might have the inside scoop on job openings or can put in a good word for you.

✨Tip Number 2

Prepare for interviews by practising common questions and scenarios related to Technical Operations. Think about how your experience aligns with the role at Era4 and be ready to showcase your problem-solving skills.

✨Tip Number 3

Don’t just wait for job postings! Be proactive and reach out directly to companies you’re interested in, like Era4. Express your enthusiasm for their mission and how you can contribute to their AI infrastructure goals.

✨Tip Number 4

Follow up after interviews with a thank-you note. It’s a simple gesture that shows your appreciation and keeps you fresh in their minds. Plus, it’s a great opportunity to reiterate your interest in the role!

We think you need these skills to ace Technical Operations Manager - AI in London

Client Support
SRE / AIOps
Automation
Incident Management
Change Management
Problem Management
Operational Performance Management
Service Desk Management
SLA/SLO Frameworks
Observability Tooling (Grafana, Prometheus)
Linux Knowledge
GPU Infrastructure Support
Integration and Automation into Ticket Platforms
Infrastructure-as-Code (Terraform, Ansible)
Programming (Python, Go, Bash, PromQL)

Some tips for your application 🫡

Tailor Your Application: Make sure to customise your CV and cover letter to highlight your experience in infrastructure operations and SRE. We want to see how your skills align with the role of Technical Operations Manager, so don’t hold back on showcasing your relevant achievements!

Showcase Your Technical Skills: Don’t forget to mention your hands-on experience with tools like Grafana, Prometheus, and any automation frameworks you’ve used. We’re looking for someone who can dive into the technical details, so let us know what you’ve worked with!

Communicate Clearly: Since this role involves translating complex infrastructure events into clear customer communications, make sure your application reflects your ability to communicate effectively. Use straightforward language and avoid jargon where possible.

Apply Through Our Website: We encourage you to apply directly through our website. It’s the best way for us to receive your application and ensures you’re considered for the role. Plus, it shows you’re keen on joining our team at Era4!

How to prepare for a job interview at Era4

✨Know Your Tech Inside Out

Make sure you brush up on your knowledge of observability tools like Grafana and Prometheus. Be ready to discuss how you've used these in past roles, especially in relation to incident management and SRE practices.

✨Showcase Your Leadership Skills

Since this role involves managing the Service Desk function, prepare examples of how you've led teams in high-pressure environments. Highlight your experience with incident command and how you've coordinated resolutions across different teams.

✨Understand SLA/SLO Frameworks

Familiarise yourself with SLA and SLO concepts, as you'll need to demonstrate your ability to design and report against these targets. Be prepared to discuss how you've implemented these frameworks in previous roles.

✨Communicate Clearly and Confidently

As a Technical Operations Manager, clear communication is key. Practice explaining complex technical issues in simple terms, as you'll need to translate infrastructure events into customer communications effectively.

Technical Operations Manager - AI in London
Era4
Location: London

Land your dream job quicker with Premium

You’re marked as a top applicant with our partner companies
Individual CV and cover letter feedback including tailoring to specific job roles
Be among the first applications for new jobs with our AI application
1:1 support and career advice from our career coaches
Go Premium

Money-back if you don't land a job in 6-months

>