Manager, System and Platform Operations

Manager, System and Platform Operations

Full-Time 60000 - 80000 € / year (est.) Home office (partial)
UNAVAILABLE

At a Glance

  • Tasks: Lead the support and reliability of Epsilon's production systems in a dynamic digital marketing environment.
  • Company: Join Epsilon, a key player in multi-channel marketing services and technologies.
  • Benefits: Enjoy competitive pay, great benefits, and hybrid working options at a vibrant office.
  • Other info: Be part of an inclusive team dedicated to professional growth and innovation.
  • Why this job: Make a real impact in digital marketing while solving complex challenges with cutting-edge technology.
  • Qualifications: 5+ years in Site Reliability, strong tech skills, and leadership experience required.

The predicted salary is between 60000 - 80000 € per year.

A subsidiary of Publicis Groupe, Epsilon is a leading provider of multi‑channel marketing services, technologies, and database solutions. We do more than collect and store data, and we might be the most important Internet company you’ve never heard of. Join our team for your chance to work in the digital marketing space and solve meaningful problems on a massive scale—and have fun doing it.

The System and Platform Operations Manager is a technical leadership role that is responsible for the support, reliability and stability of Epsilon Retail Media production systems, environments, and offerings. The team owns the reliability vision for the company, driving continuous improvement through a combination of development and operations initiatives as well as process excellence. This position and its team have solid‑line responsibility for operations including deployment, management, monitoring, reporting, troubleshooting, and repair of production systems. Core to the success of the role is to provide a premium customer support experience focused on a “center of excellence” that allows for a full‑service delivery support cycle.

This role is responsible for managing the Platform Operations Team centralized within a single geo‑region, orchestrating regional teamwork, serving with both technical and professional support, and championing the company values. The Platform Operations Engineer works closely with the Engineering team to ensure ongoing system stability and supports the Technical Account Managers from an environmental perspective. The Platform Operations team is responsible for supporting all retailers once they are live. Critically important is how this team collaborates and liaises with other teams such as Customer Support, Technical Account Management, Engineering and Customer Success teams.

Responsibilities

What you’ll do:

  • Establish and manage operational practices and ensure we design, implement and operate a support model that is fit for purpose for our future.
  • Adopt a “Measure Everything” approach to ensure that internal service level objectives and customer service level agreements are exceeded, including executive‑level reporting on operational health metrics such as SLAs, incident resolution, performance, availability, reliability, capacity, etc.
  • Take ownership of complex issues related to performance, reliability, and scalability and lead resolution of serious incidents and events, including communications with customers and wider stakeholders.
  • Provide insight and expertise on how customers will perceive the changes or impacts to customers to drive customer organization change management and communication.
  • Empower the Delivery teams to release new products, features, updates and fixes quickly, while ensuring platforms remain reliable and stable.
  • Work with the wider Engineering, Product, Delivery and Security teams to ensure that appropriate attention is given to production/system reliability.
  • Identify the capabilities needed to meet current and emerging business needs of a significant function.
  • As subject‑matter expert on the team, maintain understanding of current technology, database management, reliability practices, and future trends through ongoing education, conference attendance and industry press.
Qualifications

Who You Are:

  • At least 5 years of hands‑on experience in Site Reliability focused positions.
  • Strong knowledge of containerization technologies (Docker, Kubernetes).
  • Experience with infrastructure as code (Terraform).
  • Solid understanding of networking, security, and system architecture.
  • Proficient in scripting languages (Java, Golang, Python, Bash, or similar).
  • Experience with monitoring and observability tools (DataDog, Prometheus, Grafana).
  • Knowledge of database management systems (PostgreSQL, Bigtable).
  • Understanding of API and microservices architecture.
  • Strong people‑leadership skills with at least a year of leading and driving high‑performance technical teams.
  • Experience with operations teams within enterprise environments with knowledge of DevOps, ITIL, Cloud Services, IT Infrastructure and Operations supporting and maintaining production and development environments and building cloud services that are secure, reliable, scalable and observable.
  • Experience with establishing Service Delivery strategies that align to new ways of work methods, including Agile.
  • Experience establishing and delivering IT support services in a high‑availability (HA) environment such as 24/7 operations.

We know that we have some of the brightest and most talented employees in the world, and we believe in rewarding them accordingly. If you work here, expect competitive compensation, a great benefits package and endless opportunities to advance your career. We offer hybrid working opportunities, with our office space located in the Iconic Television Centre, White City.

As part of our dedication to enhance our inclusive and diverse workforce, Epsilon is committed to equal access to opportunity for people without regard to race, age, sex, disability, neurodiversity, sexual orientation, gender identity, pregnancy and maternity, marriage and civil partnership, or religion or belief. We are committed to providing reasonable adjustments for candidates in our application process.

Manager, System and Platform Operations employer: UNAVAILABLE

Epsilon is an exceptional employer that fosters a dynamic work culture focused on innovation and collaboration, particularly in the vibrant setting of the Iconic Television Centre, White City. Employees benefit from competitive compensation, a comprehensive benefits package, and ample opportunities for professional growth, all while being part of a diverse and inclusive team dedicated to excellence in digital marketing solutions.

UNAVAILABLE

Contact Detail:

UNAVAILABLE Recruiting Team

StudySmarter Expert Advice🤫

We think this is how you could land Manager, System and Platform Operations

Tip Number 1

Network like a pro! Reach out to current employees at Epsilon on LinkedIn or other platforms. Ask them about their experiences and any tips they might have for landing the Manager, System and Platform Operations role.

Tip Number 2

Prepare for the interview by brushing up on your technical skills. Make sure you can confidently discuss containerization technologies and scripting languages. We want to see that you can handle the nitty-gritty of the role!

Tip Number 3

Showcase your leadership experience! Be ready to share examples of how you've led high-performance teams in the past. We love candidates who can inspire and drive their teams to success.

Tip Number 4

Don’t forget to apply through our website! It’s the best way to ensure your application gets noticed. Plus, it shows you’re serious about joining our awesome team at Epsilon.

We think you need these skills to ace Manager, System and Platform Operations

Site Reliability Engineering
Containerization Technologies (Docker, Kubernetes)
Infrastructure as Code (Terraform)
Networking
Security
System Architecture
Scripting Languages (Java, Golang, Python, Bash)

Some tips for your application 🫡

Tailor Your CV:Make sure your CV is tailored to the role of Manager, System and Platform Operations. Highlight your experience with containerization technologies and any relevant leadership roles you've held. We want to see how your skills align with our needs!

Craft a Compelling Cover Letter:Your cover letter is your chance to shine! Use it to explain why you're passionate about digital marketing and how your background makes you a perfect fit for our team. Don’t forget to mention your experience in high-availability environments!

Showcase Your Technical Skills:In your application, be sure to showcase your technical skills, especially in scripting languages and monitoring tools. We love candidates who can demonstrate their hands-on experience and knowledge in these areas, so don’t hold back!

Apply Through Our Website:We encourage you to apply through our website for a smoother application process. It’s the best way for us to receive your application and ensures you’re considered for the role. Plus, it shows you’re keen on joining our team!

How to prepare for a job interview at UNAVAILABLE

Know Your Tech Inside Out

Make sure you brush up on your knowledge of containerization technologies like Docker and Kubernetes, as well as infrastructure as code tools like Terraform. Be ready to discuss how you've used these in past roles, especially in relation to system reliability and performance.

Showcase Your Leadership Skills

Since this role involves managing a team, be prepared to share examples of how you've led high-performance technical teams. Highlight your people-leadership skills and any strategies you've implemented to empower your team and drive success.

Understand the Customer Perspective

Epsilon values a premium customer support experience, so think about how you've previously managed customer communications during incidents. Be ready to explain how you ensure customers perceive changes positively and how you handle change management.

Prepare for Scenario-Based Questions

Expect questions that assess your problem-solving abilities, especially around complex issues related to performance and reliability. Prepare scenarios where you've successfully resolved serious incidents, detailing your approach and the outcomes.