Job Board

Companies

Universal Music Group

Service Reliability Eng – Kings Cross, London

London Full-Time 36000 - 60000 £ / year (est.) No home office possible

Apply now

At a Glance

Tasks: Ensure the reliability and performance of critical systems that connect artists and fans.
Company: Join Universal Music, the world's leading music company with a passion for innovation.
Benefits: Inclusive culture, career growth opportunities, and a chance to work in the music industry.
Why this job: Make a real impact on global music services while working with cutting-edge technology.
Qualifications: Experience in systems administration and proficiency in programming languages like Python or Java.
Other info: Dynamic team environment with a commitment to diversity and inclusion.

The predicted salary is between 36000 - 60000 £ per year.

Music is Universal It’s the passionate and dedicated team at Universal Music who help make us the world’s leading music company. From A&R to finance, legal to digital, sales to marketing, Universal Music is the place to grow and develop your career within a truly commercial and innovative business that leads in everything it does.Everyone is welcome to apply for our roles, and we are determined to ensure that no applicant or employee receives less favourable treatment because of gender, race, disability, sexual orientation, religion, belief, age, marital status, background, pregnancy, or caring responsibilities. We also recognise the importance of diversity of thought within our teams and are fully committed to embracing the talents of people with autism, dyslexia, ADHD, and other forms of neurocognitive will always seek to make appropriate adjustments to recruitment, workplaces, and work processes to be fully inclusive to people with different needs and working styles. If you need us to make any reasonable adjustments for you from application onwards, including alternatives to the online form or to disclose a neurocognitive condition, please email .**Job Summary:**We are UMG, the Universal Music Group. We are the world’s leading music company. In everything we do, we are committed to artistry, innovation and entrepreneurship. We own and operate a broad array of businesses engaged in recorded music, music publishing, merchandising, and audiovisual content in more than 60 countries. We identify and develop recording artists and songwriters, and we produce, distribute and promote the most critically acclaimed and commercially successful music to delight and entertain fans around the world.As a key member of our Global Technical Operations team, you will be responsible for the reliability, scalability, and performance of the critical systems that power a global enterprise. By blending a software engineering mindset with operational expertise, you will engineer solutions that improve system reliability, automate complex processes, and reduce manual toil. You will be an essential partner to our development, infrastructure, and security teams, driving a culture of resilience and continuous improvement across the a Site Reliability Engineer, you won\’t just be supporting systems; you\’ll be ensuring the services that connect artists and fans around the globe are always on.**Job Functions:**Key Responsibilities:System Reliability & Performance:* Design, build, and maintain the availability, scalability, and performance of critical services.* Develop and maintain robust monitoring, alerting, and observability systems (e.g., using AWS CloudWatch, Dynatrace) to ensure rapid issue detection and resolution.* Monitor infrastructure capacity and performance, providing analysis and suggestions for service delivery improvement.Automation & Efficiency:* Drive the automation of repetitive operational tasks, including infrastructure provisioning, deployments, and scaling.* Create and maintain scripts and custom code to support and enhance our operational toolset.* Support and optimize CI/CD pipelines to improve deployment speed and reliability.Incident Management & Collaboration:* Participate in an on-call rotation to troubleshoot and mitigate production incidents.* Lead post-incident reviews and root cause analyses to implement lasting solutions.* Partner with engineering and IT stakeholders to embed SRE best practices (SLOs, error budgets) into the design and development lifecycle.**Job Requirements:**Required Experience & Skills:* A strong background in systems administration (Linux/Windows) in a large-scale environment.* Proficiency in at least one programming language (e.g., Python, Go, Java).* Hands-on experience with a major cloud platform (AWS, GCP, or Azure), with a high preference for AWS.* Solid understanding of networking, containers (Docker, Kubernetes), and Infrastructure as Code (e.g., Terraform, Ansible).* Experience with modern monitoring and observability tools (e.g., Prometheus, Grafana, Datadog, Splunk, Dynatrace).* Proven analytical and problem-solving abilities with experience in a high-pressure environment.* Excellent communication skills and the ability to foster a collaborative team environment.Preferred Experience & Skills:* Bachelor\’s degree in an IT-related field.* Experience managing large-scale, distributed systems for a global organization.* Familiarity with IT governance standards like ITIL.* Direct experience with ServiceNow for IT service management.* Knowledge of chaos engineering, resilience testing, and advanced capacity planning.Just So You Know…The company presents this job description as a guide to the major areas and duties for which the jobholder is accountable. However, the business operates in an environment that demands change and the jobholder\’s specific responsibilities and activities will vary and develop. Therefore, the job description should be seen as indicative and not as a permanent, definitive, and exhaustive statement.## **Job Category:**Universal Music Group #J-18808-Ljbffr

Service Reliability Eng – Kings Cross, London employer: Universal Music Group

At Universal Music, we pride ourselves on being an inclusive and innovative employer, offering a vibrant work culture that fosters creativity and collaboration. Located in the heart of Kings Cross, London, our team enjoys access to diverse career growth opportunities within the dynamic music industry, alongside comprehensive benefits that support both personal and professional development. Join us to be part of a passionate community dedicated to connecting artists and fans worldwide while embracing diversity and neurocognitive variation.

Contact Detail:

Universal Music Group Recruiting Team

View Universal Music Group Profile

StudySmarter Expert Advice 🤫

We think this is how you could land Service Reliability Eng – Kings Cross, London

✨Tip Number 1

Network like a pro! Reach out to folks in the industry, attend events, and connect with people on LinkedIn. You never know who might have the inside scoop on job openings or can put in a good word for you.

✨Tip Number 2

Prepare for interviews by researching the company and its culture. Understand their values and how they align with your own. This will help you stand out and show that you're genuinely interested in being part of their team.

✨Tip Number 3

Practice your technical skills! For a role like Service Reliability Engineer, brush up on your programming languages and cloud platforms. Consider doing mock interviews or coding challenges to get comfortable with the types of questions you might face.

✨Tip Number 4

Don't forget to apply through our website! It’s the best way to ensure your application gets seen. Plus, it shows you're serious about joining the Universal Music family. Good luck!

We think you need these skills to ace Service Reliability Eng – Kings Cross, London

Systems Administration (Linux/Windows)

Programming (Python, Go, Java)

Cloud Platform Experience (AWS, GCP, Azure)

Networking Knowledge

Container Management (Docker, Kubernetes)

Infrastructure as Code (Terraform, Ansible)

Monitoring and Observability Tools (Prometheus, Grafana, Datadog, Splunk, Dynatrace)

Analytical Skills

Problem-Solving Skills

Communication Skills

Collaboration Skills

Incident Management

CI/CD Pipeline Optimization

ServiceNow for IT Service Management

IT Governance Standards (ITIL)

Some tips for your application 🫡

Tailor Your CV: Make sure your CV is tailored to the Service Reliability Engineer role. Highlight your experience with systems administration, cloud platforms, and any relevant programming skills. We want to see how your background aligns with what we do at Universal Music!

Craft a Compelling Cover Letter: Your cover letter is your chance to shine! Share your passion for music and technology, and explain why you’re excited about the opportunity to work with us. Let’s see your personality come through while keeping it professional.

Showcase Your Problem-Solving Skills: In your application, don’t forget to mention specific examples of how you've tackled challenges in high-pressure environments. We love seeing analytical minds at work, so share those stories that demonstrate your problem-solving abilities!

Apply Through Our Website: We encourage you to apply directly through our website. It’s the best way to ensure your application gets into the right hands. Plus, you’ll find all the details you need about the role and our company culture there!

How to prepare for a job interview at Universal Music Group

✨Know Your Tech Inside Out

Make sure you brush up on your systems administration skills, especially with Linux and Windows. Be ready to discuss your experience with cloud platforms like AWS, and don’t forget to highlight any hands-on work you've done with monitoring tools like Dynatrace or Prometheus.

✨Showcase Your Problem-Solving Skills

Prepare to share specific examples of how you've tackled complex issues in high-pressure environments. Think about incidents you've managed and how you led post-incident reviews to implement lasting solutions. This will demonstrate your analytical abilities and resilience.

✨Emphasise Collaboration

Universal Music values teamwork, so be ready to talk about how you've partnered with engineering and IT teams in the past. Highlight any experiences where you’ve embedded SRE best practices into projects, as this shows you understand the importance of collaboration in achieving system reliability.

✨Be Ready for Technical Questions

Expect technical questions that test your knowledge of programming languages like Python or Go, and your understanding of containers and Infrastructure as Code. Practise explaining your thought process clearly, as communication is key in this role.

Service Reliability Eng – Kings Cross, London

Universal Music Group

Location: London

Apply now

Service Reliability Eng – Kings Cross, London

At a Glance

Service Reliability Eng – Kings Cross, London employer: Universal Music Group

StudySmarter Expert Advice 🤫

✨Tip Number 1

✨Tip Number 2

✨Tip Number 3

✨Tip Number 4

We think you need these skills to ace Service Reliability Eng – Kings Cross, London

Some tips for your application 🫡

How to prepare for a job interview at Universal Music Group

Service Reliability Eng – Kings Cross, London

Land your dream job quicker with Premium