Site Reliability Engineer in Manchester
Site Reliability Engineer

Site Reliability Engineer in Manchester

Manchester Full-Time 49600 - 74400 £ / year (est.) Home office (partial)
M

At a Glance

  • Tasks: Design self-healing systems and optimise cloud infrastructure for high performance.
  • Company: Join Matillion, a global leader in data solutions with a collaborative culture.
  • Benefits: Enjoy flexible working, 30 days holiday, health insurance, and company equity.
  • Why this job: Be at the forefront of AI-driven operations and make a real impact.
  • Qualifications: Experience with Kubernetes, cloud services, and programming skills are essential.
  • Other info: Diverse and inclusive environment with excellent career growth opportunities.

The predicted salary is between 49600 - 74400 £ per year.

About the Role

We are looking for a Site Reliability Engineer who views "manual effort" as a bug to be fixed. In this role, you won’t just be keeping the lights on; you will be the architect of our system’s resilience. We need a proactive engineer who is obsessed with Kubernetes and Cloud infrastructure, but also has a visionary streak—someone eager to experiment with AI-driven operations (AIOps) to predict failures and automate responses. If you enjoy building self-healing systems and staying ahead of the tech curve, this is the place for you.

What you will be doing

  • Engineering Reliability: Designing and implementing self-healing infrastructure using Kubernetes to maintain high uptime and system integrity.
  • Scaling Cloud Ecosystems: Optimizing our cloud footprint (AWS/GCP/Azure) to ensure our platforms can handle rapid growth without breaking a sweat.
  • Innovating with AI: Proactively identifying opportunities to integrate AI tools into our observability stack to automate incident detection and root-cause analysis.
  • Eliminating Toil: Writing clean, efficient code to automate repetitive operational tasks, turning manual workflows into seamless "set and forget" processes.
  • Defining Observability: Building advanced monitoring and alerting frameworks that provide deep insights into system health and performance.

What we are looking for

  • Kubernetes Power User: Extensive experience managing production-grade K8s environments, including ingress, service mesh, and container security.
  • Cloud Infrastructure Expert: A deep understanding of cloud networking, storage, and compute services within a major provider (AWS, Azure, or GCP).
  • Proactive Mindset: An engineer who doesn’t wait for a ticket; you naturally seek out system weaknesses and build solutions to strengthen them.
  • AI Curiosity: An active interest in the AI landscape and a desire to leverage LLMs or machine learning to improve SRE workflows.
  • Programming Literacy: Ideally experience with at least one language (such as Java, Python, Go, or Ruby) to bridge the gap between software engineering and operations.

At Matillion, we are committed to providing competitive salaries in line with market standards. Our estimated compensation range for this position is £49,600 - £74,400, but the final salary will be based on your relevant skills, experience and qualifications demonstrated in the hiring process.

At Matillion, we’re here to do something hard - change the way the world works with data, and build a great company along the way. Big, bold goals aren’t for the faint-hearted, and we don’t shy away from them. But we don’t do it alone. No egos, no politics - just great people working together, guided by our six core values;

  • Confidence without arrogance
  • Working with integrity
  • Customer obsessed
  • Innovate and demand quality
  • Bias for action
  • We care

We operate a flexible working culture that promotes work-life balance, with benefits including:

  • Company Equity
  • 30 days holiday + bank holidays
  • 5 days paid volunteering leave
  • Health insurance
  • Life Insurance
  • Pension
  • Access to mental health support

More about Matillion

Thousands of enterprises including Cisco, London Stock Exchange Group, EDF and Slack trust Matillion for a wide range of use cases from insights and operational analytics, to data science, machine learning and AI. We are a truly global workforce, dual headquartered in Manchester, UK and Denver, Colorado, with expanding offices in Hyderabad, India, along with valuable remote colleagues around the world.

We are keen to hear from prospective Matillioners, so even if you don’t feel you match all the criteria please apply and a member of our Talent Acquisition team will be in touch. Alternatively, if you’re interested in Matillion but don’t see a suitable role, please email talent@matillion.com.

Matillion is an equal opportunity employer. We celebrate diversity and we are committed to creating an inclusive environment for all of our team. Matillion prohibits discrimination and harassment of any type. Matillion does not discriminate on the basis of race, colour, religion, age, sex, national origin, disability status, genetics, sexual orientation, gender identity or expression, or any other characteristic protected by law.

Site Reliability Engineer in Manchester employer: Matillion

At Matillion, we pride ourselves on being an exceptional employer that fosters a culture of innovation and collaboration. Our flexible working environment promotes a healthy work-life balance, complemented by generous benefits such as company equity, extensive holiday allowances, and access to mental health support. With a commitment to employee growth and a focus on cutting-edge technologies like AI and Kubernetes, we empower our Site Reliability Engineers to thrive in their roles while contributing to transformative projects that shape the future of data management.
M

Contact Detail:

Matillion Recruiting Team

StudySmarter Expert Advice 🤫

We think this is how you could land Site Reliability Engineer in Manchester

✨Tip Number 1

Network like a pro! Reach out to current or former employees at Matillion on LinkedIn. A friendly chat can give you insider info and might just get your foot in the door.

✨Tip Number 2

Show off your skills! If you’ve got a GitHub or personal project showcasing your Kubernetes or AI-driven solutions, share it during interviews. It’s a great way to demonstrate your hands-on experience.

✨Tip Number 3

Prepare for technical challenges! Brush up on your coding skills and be ready to tackle some real-world problems during the interview. We love seeing how you think on your feet!

✨Tip Number 4

Don’t forget to apply through our website! It’s the best way to ensure your application gets seen by the right people. Plus, we’re always on the lookout for proactive engineers like you!

We think you need these skills to ace Site Reliability Engineer in Manchester

Kubernetes
Cloud Infrastructure (AWS/GCP/Azure)
AI-driven Operations (AIOps)
Incident Detection Automation
Root-Cause Analysis
Programming (Java, Python, Go, Ruby)
Observability Frameworks
System Resilience Engineering
Monitoring and Alerting
Proactive Problem-Solving
Container Security
Cloud Networking
Storage and Compute Services
Self-Healing Systems
Efficiency in Code Writing

Some tips for your application 🫡

Show Your Passion for Reliability: When writing your application, let us see your enthusiasm for Site Reliability Engineering. Share examples of how you've tackled manual processes and turned them into automated solutions. We love seeing candidates who are proactive and eager to innovate!

Highlight Your Kubernetes Expertise: Make sure to showcase your experience with Kubernetes in your application. Whether it's managing production-grade environments or implementing service meshes, we want to know how you've used K8s to enhance system resilience. Don't hold back on the details!

Demonstrate Your AI Curiosity: If you've dabbled in AI-driven operations or have ideas on integrating AI tools into observability, mention it! We’re looking for someone who’s excited about leveraging technology to improve workflows. Let us know how you stay ahead of the tech curve.

Keep It Clean and Concise: While we appreciate detail, clarity is key! Make sure your application is well-structured and easy to read. Use bullet points where necessary and keep your language straightforward. Remember, we want to get to know you quickly, so make every word count!

How to prepare for a job interview at Matillion

✨Know Your Kubernetes Inside Out

Make sure you can talk confidently about your experience with Kubernetes. Be ready to discuss specific projects where you've managed production-grade K8s environments, including any challenges you faced and how you overcame them.

✨Show Off Your Cloud Savvy

Brush up on your knowledge of cloud infrastructure, especially AWS, GCP, or Azure. Prepare to explain how you've optimised cloud ecosystems in the past and how you would approach scaling for rapid growth.

✨Embrace AI and Automation

Since this role involves integrating AI into operations, be prepared to share your thoughts on AI-driven tools. Discuss any relevant experience you have with machine learning or LLMs, and how you envision using them to enhance SRE workflows.

✨Demonstrate a Proactive Mindset

Highlight your ability to identify system weaknesses before they become issues. Share examples of how you've proactively built solutions to strengthen system reliability, showcasing your problem-solving skills and initiative.

Site Reliability Engineer in Manchester
Matillion
Location: Manchester

Land your dream job quicker with Premium

You’re marked as a top applicant with our partner companies
Individual CV and cover letter feedback including tailoring to specific job roles
Be among the first applications for new jobs with our AI application
1:1 support and career advice from our career coaches
Go Premium

Money-back if you don't land a job in 6-months

M
Similar positions in other companies
UK’s top job board for Gen Z
discover-jobs-cta
Discover now
>