At a Glance
- Tasks: Lead the development of containerisation solutions and empower engineering teams.
- Company: Join a dynamic tech company focused on innovation and collaboration.
- Benefits: Enjoy 25 days holiday, enhanced parental leave, private medical insurance, and wellness programs.
- Other info: Be part of a diverse team committed to inclusion and personal growth.
- Why this job: Shape the future of technology while mentoring a diverse team in a fast-paced environment.
- Qualifications: Proficient in cloud technologies, containerisation, and strong problem-solving skills required.
The predicted salary is between 80000 - 100000 ÂŁ per year.
This position is an exciting opportunity to help Partnerize scale its entire platform portfolio across on‑prem datacentres and AWS, including our legacy systems, the BrandVerity, Ascend and recently acquired Konnecto. You will lead the development of an enterprise on‑prem containerisation solution to shift our engineering culture toward a "you build it, you own it" model and enable rapid, independent deployment for our teams.
The Team
You will manage and develop a diverse group of technical generalists, specialists and junior engineers, acting as a player/coach who mentors, up‑skills and guides career paths as we transition to a DevOps‑centric operating model.
Operational Reality
The role operates in a fast‑paced, high‑velocity environment where you will shape the architectural future of the business. You will apply modern incident‑management frameworks to troubleshoot and manage tickets, ensuring all issues across our estate are resolved decisively and efficiently.
As a Lead SRE, You Will:
- Strategic & Operational Management
- Developer Empowerment & Containerisation – collaborate on the design, build and rollout of a robust containerisation strategy (Kubernetes/Docker) so Engineering teams can own code from build to deployment.
- Reliability & Error Budgets – define Service Level Indicators (SLIs), set Service Level Objectives (SLOs) and manage error budgets to balance feature velocity and platform stability.
- Hybrid Platform Engineering & Konnecto – build software and systems to manage infrastructure on‑prem and AWS, lead integration and modernisation of Konnecto’s data ingestion and AI layers.
- FinOps / Cloud Cost Optimisation – monitor and optimise cloud spend across hybrid environments while ensuring high performance and cost effectiveness.
- CI/CD Pipeline Responsibility – continuously improve delivery pipelines to facilitate rapid engineering velocity.
- Mentorship – deliver coaching sessions, act as a technical escalation point and foster knowledge sharing.
- Workload Management – scope incoming work, prioritise maintenance vs. project delivery and delegate tasks to ensure timely resolution.
- Design & Threat Modelling – produce production‑grade application security designs and perform threat modelling.
- Security Strategy – drive security improvements through planning, vulnerability assessments and testing.
- Toil Reduction – automate repetitive operational work, systematically engineering it out of existence.
- Act as the ultimate escalation point for complex incidents, lead blameless post‑mortems, conduct root cause analysis and drive metrics such as MTTR.
- Consulting & Planning – participate in system design, platform management, and capacity planning.
- Escalation Support – serve as escalation point for complex incidents while maintaining a high level of quality.
- On‑Call – participate in the on‑call rotation.
Essential Knowledge, Skills and Experience
Core Competencies
- Technical Ability – a highly proficient SME capable of applying technical methods, leading cultural shifts such as DevOps adoption and developing skills in colleagues.
- Problem Solving & Decision Making – make quick, decisive decisions, weighing options and applying methodical, innovative problem‑solving.
- Communication & Influence – effectively communicate initiatives to all stakeholders and secure buy‑in for transformational projects.
Technical Competencies
- Cloud, Hybrid & Containerisation – essential knowledge of hybrid architectures, AWS and on‑prem environments, and extensive hands‑on experience with Docker, Kubernetes, Argo Workflows.
- Konnecto Tech Stack & Data Pipelines – experience with MongoDB, Snowflake, clickdata streams, S3 ingestion and Airflow ETL.
- Programming & Automation – proficiency in Python or Bash, deep understanding of GitHub, AI coding tools and practices, Terraform and Ansible.
- Security & Observability – experience managing DevOps security, observability stacks such as Prometheus, Grafana and Loki.
- Operations & Troubleshooting – exceptional Linux administration skills, incident management expertise and ability to diagnose and resolve issues independently.
Desirable Knowledge, Skills and Experience
- Innovation & Debt Management – interest in new technologies and refactoring technical debt.
- Legacy Databases – strong experience with MySQL, PostgreSQL, Redis.
- Data Streaming – experience with Kafka, Druid and other streaming/queuing technologies.
- Web & Storage – knowledge of Nginx and storage technologies like Gluster.
UK Benefits & Perks
- 25 days holiday + bank holidays
- Enhanced parental leave: 6 months full pay for birth parent; 4 weeks full pay for non‑birth parent after 1 year employment
- 5 extra 'Partnerize Parental Days' each year
- Private medical insurance via Vitality
- Enhanced pension contributions
- Cycle to Work scheme
- Eye care vouchers
- Life assurance
- Enhanced wellness program – access to EAP, Wellness Coaching & Wellness Fridays
- Regular company events and activities
Our Commitment to Diversity & Inclusion
Partnerize is an equal‑opportunity employer and is committed to attracting, developing and advancing outstanding team members regardless of race, ethnic identity, sexual orientation, religion, age, gender, gender identity, physical abilities or any other dimension of diversity. We foster an environment where individuals can be authentic, raise concerns and innovate without fear.
Lead Site Reliability Engineer (Snowflake/Terraform/Linux) in Cambridge employer: Partnerize
Contact Detail:
Partnerize Recruiting Team
StudySmarter Expert Advice 🤫
We think this is how you could land Lead Site Reliability Engineer (Snowflake/Terraform/Linux) in Cambridge
✨Tip Number 1
Network like a pro! Reach out to folks in your industry on LinkedIn or at meetups. A friendly chat can lead to opportunities that aren’t even advertised yet.
✨Tip Number 2
Show off your skills! Create a portfolio or GitHub repo showcasing your projects, especially those related to containerisation and cloud tech. It’s a great way to demonstrate your expertise beyond the CV.
✨Tip Number 3
Prepare for interviews by practising common SRE scenarios. Think about how you’d handle incidents or optimise cloud costs. We want to see your problem-solving skills in action!
✨Tip Number 4
Don’t forget to apply through our website! It’s the best way to ensure your application gets seen by the right people. Plus, we love seeing candidates who are proactive!
We think you need these skills to ace Lead Site Reliability Engineer (Snowflake/Terraform/Linux) in Cambridge
Some tips for your application 🫡
Tailor Your CV: Make sure your CV is tailored to the Lead Site Reliability Engineer role. Highlight your experience with containerisation, cloud environments, and any relevant technical skills that match the job description. We want to see how you can contribute to our team!
Craft a Compelling Cover Letter: Your cover letter is your chance to shine! Use it to explain why you're passionate about this role and how your background aligns with our goals at Partnerize. Don’t forget to mention your leadership style and how you empower teams.
Showcase Your Problem-Solving Skills: In your application, include examples of how you've tackled complex issues in past roles. We love candidates who can think on their feet and come up with innovative solutions, especially in fast-paced environments like ours.
Apply Through Our Website: We encourage you to apply directly through our website for the best chance of getting noticed. It’s super easy, and you’ll be able to keep track of your application status. Plus, we love seeing applications come in through our own platform!
How to prepare for a job interview at Partnerize
✨Know Your Tech Stack
Make sure you’re well-versed in the technologies mentioned in the job description, like Docker, Kubernetes, and AWS. Brush up on your knowledge of Snowflake and Terraform too, as these will likely come up during technical discussions.
✨Showcase Your Leadership Skills
As a Lead Site Reliability Engineer, you'll be expected to mentor and guide others. Prepare examples of how you've successfully led teams or projects in the past, focusing on your ability to empower others and foster a collaborative environment.
✨Prepare for Incident Management Scenarios
Expect questions around incident management and troubleshooting. Think of specific incidents you've managed, how you approached them, and what the outcomes were. Highlight your experience with post-mortems and root cause analysis.
✨Communicate Clearly and Confidently
Effective communication is key in this role. Practice explaining complex technical concepts in simple terms, as you’ll need to secure buy-in from various stakeholders. Be ready to discuss how you’ve influenced change in previous roles.