Sr. Observability Engineer

Sr. Observability Engineer

Full-Time 70000 - 90000 £ / year (est.) No working from home possible
Dormont Manufacturing Co

At a Glance

  • Tasks: Lead the design and implementation of observability strategies for critical IT systems.
  • Company: Join Universal Music Group, the world's leading music company with a vibrant culture.
  • Benefits: Enjoy competitive salary, health benefits, and opportunities for professional growth.
  • Other info: Inclusive environment that values diversity and encourages continuous learning.
  • Why this job: Make an impact in the music industry while working with cutting-edge technology.
  • Qualifications: 5-7 years in Observability or SRE roles with strong technical leadership skills.

The predicted salary is between 70000 - 90000 £ per year.

Music is Universal. It’s the passionate and dedicated team at Universal Music who help make us the world’s leading music company. From A&R to finance, legal to digital, sales to marketing, Universal Music is the place to grow and develop your career within a truly commercial and innovative business that leads in everything it does.

Everyone is welcome to apply for our roles, and we are determined to ensure that no applicant or employee receives less favourable treatment because of gender, race, disability, sexual orientation, religion, belief, age, marital status, background, pregnancy, or caring responsibilities. We also recognise the importance of diversity of thought within our teams and are fully committed to embracing the talents of people with autism, dyslexia, ADHD, and other forms of neurocognitive variation. We will always seek to make appropriate adjustments to recruitment, workplaces, and work processes to be fully inclusive to people with different needs and working styles.

If you need us to make any reasonable adjustments for you from application onwards, including alternatives to the online form or to disclose a neurocognitive condition, please email UniversalMusicCareers@umusic.com.

We are UMG, the Universal Music Group. We are the world’s leading music company. In everything we do, we are committed to artistry, innovation and entrepreneurship. We own and operate a broad array of businesses engaged in recorded music, music publishing, merchandising, and audiovisual content in more than 60 countries. We identify and develop recording artists and songwriters, and we produce, distribute and promote the most critically acclaimed and commercially successful music to delight and entertain fans around the world.

As a Senior Observability Engineer, you will be a driving force for technical excellence and strategic vision within our global team. You will be instrumental in architecting, building, and leading our comprehensive observability strategy to ensure the reliability, performance, and scalability of our critical IT systems. This senior role demands a passion for data-driven strategy, a commitment to automation, and the ability to mentor and lead. You will not only solve complex technical challenges but also influence the direction of observability practices across UMG globally, ensuring our technology landscape is as world-class as our music.

Job Functions

  • Architecture & Strategy: Lead the architectural design and strategic roadmap for our observability stack. Drive the vision for world-class monitoring, logging, tracing, and alerting solutions across our hybrid and cloud-native environments.
  • Innovate & Automate: Spearhead the evaluation, selection, and implementation of cutting-edge observability tools and platforms (e.g., Dynatrace, OpenTelemetry, Prometheus, Grafana). Architect and build robust, automated observability pipelines. Take an active part in documenting and defining processes and best practice.
  • Optimize & Analyze: Conduct deep-dive analysis of telemetry data to proactively identify performance bottlenecks, optimize resource utilization, and guide capacity planning.
  • Lead & Mentor: Act as a technical leader and mentor for the observability team and wider engineering groups. Champion and enforce best practices, fostering a culture of proactive and data-informed decision-making.
  • Drive Incident & Problem Management: Work with Operations teams on high-priority incident resolution efforts, utilising deep analysis of telemetry data for swift root cause identification. Drive post-incident reviews and implement long-term solutions to enhance system resilience.
  • Collaborate & Influence: Partner with Development, SRE, and Infrastructure leaders to embed observability into the entire technology lifecycle. Influence and drive the adoption of observability best practices across the global organization.
  • Make UMG the place to be: Mentor, manage and genuinely lead the Observability team in a way that attracts and retains the best talent. UMG is a place where everyone can bring themselves fully to work and thrive; as a Leader you are a key part of this.

Job Requirements

Essential Qualifications

  • Experience: 5-7+ years of hands-on experience in an Observability, Site Reliability Engineering (SRE), or DevOps role, with a proven track record of leading complex projects.
  • Technical Leadership: Demonstrated experience in architecting and designing large-scale monitoring and observability solutions.
  • Expert-Level Tooling: Deep expertise with modern observability platforms (e.g., Dynatrace, AWS Cloudwatch, Prometheus, Grafana, ELK Stack, Splunk, OpenTelemetry).
  • Cloud & Infrastructure: Advanced knowledge of major cloud platforms (AWS, Azure, GCP), containerization (Docker, Kubernetes), and Infrastructure as Code (Terraform, Ansible).
  • Programming & Automation: Strong programming and scripting skills (e.g., Python, Go, Shell) with a focus on creating scalable automation and custom tooling.
  • Problem-Solving: Exceptional analytical and strategic problem-solving skills, with the ability to lead through complex technical challenges.
  • Data Analysis: Expertise in analysing and visualising telemetry data into meaningful information to drive actions.
  • Hands-on: Demonstrable hands-on engineering and coding experience, ability to deep-dive into existing and emerging technologies to identify opportunities and solutions.
  • Containerization and Orchestration: Understanding of container technologies (e.g., Docker) and container orchestration platforms (e.g., Kubernetes) to monitor and manage containerized applications.
  • Networking Knowledge: Understanding of networking principles and protocols to effectively monitor and troubleshoot network-related issues.
  • Security Awareness: Awareness of security best practices and the ability to integrate security monitoring into observability processes.
  • Communication & Influence: Excellent communication and interpersonal skills, capable of articulating a technical vision to diverse audiences and influencing senior stakeholders. Ability to collaborate with cross-functional teams, convey findings, and discuss improvements with developers and operations teams.
  • Continuous Learning: Given the dynamic nature of technology, a commitment to continuous learning and staying updated on the latest trends in observability and monitoring. Self-motivated with a high degree of initiative and excellent follow-up skills, along with strong analytical and problem-solving skills.

Travel may be required but is not part of the regular work schedule. Bachelor’s degree in technology related field as well as 5+ years of relevant experience within the Observability field.

Desired Qualifications

  • Advanced Concepts: Proven experience with Chaos Engineering, AI-driven analytics, defining SLOs/SLIs, and advanced deployment strategies (Canary/Blue-Green).
  • Software Engineering Foundation: Strong background in software engineering principles, database administration, and distributed systems architecture.
  • Certifications: Relevant senior-level industry certifications (e.g., AWS Certified DevOps Engineer - Professional, Certified Kubernetes Administrator).

The company presents this job description as a guide to the major areas and duties for which the jobholder is accountable. However, the business operates in an environment that demands change and the jobholder’s specific responsibilities and activities will vary and develop. Therefore, the job description should be seen as indicative and not as a permanent, definitive, and exhaustive statement.

Sr. Observability Engineer employer: Dormont Manufacturing Co

Universal Music Group is an exceptional employer that fosters a vibrant and inclusive work culture, where creativity and innovation thrive. As a Senior Observability Engineer, you will not only lead cutting-edge projects but also benefit from extensive professional development opportunities in a globally recognised music company. With a commitment to diversity and employee well-being, UMG ensures that every team member can contribute their unique talents while enjoying a supportive environment that champions growth and collaboration.

Dormont Manufacturing Co

Contact Details:

Dormont Manufacturing Co Recruitment Team

StudySmarter Expert Advice🤫

We think this is how you could land Sr. Observability Engineer

Tip Number 1

Network like a pro! Reach out to folks in the industry, attend meetups, and connect with people on LinkedIn. You never know who might have the inside scoop on job openings or can put in a good word for you.

Tip Number 2

Show off your skills! Create a portfolio or GitHub repository showcasing your projects and contributions. This is your chance to demonstrate your technical prowess and passion for observability.

Tip Number 3

Prepare for interviews by brushing up on common technical questions and scenarios related to observability. Practice explaining your thought process clearly, as communication is key in this role.

Tip Number 4

Apply through our website! It’s the best way to ensure your application gets seen by the right people. Plus, it shows you’re genuinely interested in being part of the Universal Music family.

We think you need these skills to ace Sr. Observability Engineer

Observability
Site Reliability Engineering (SRE)
DevOps
Architecting Monitoring Solutions
Dynatrace
AWS Cloudwatch
Prometheus

Some tips for your application 🫡

Tailor Your CV:Make sure your CV reflects the skills and experiences that align with the Sr. Observability Engineer role. Highlight your hands-on experience with observability tools and any leadership roles you've had in past projects.

Craft a Compelling Cover Letter:Use your cover letter to tell us why you're passionate about observability and how your background makes you a great fit for our team. Don’t forget to mention any innovative solutions you've implemented in previous roles!

Showcase Your Technical Skills:Be specific about your technical expertise, especially with tools like Dynatrace, Prometheus, and AWS. We want to see how you've used these technologies to solve complex problems in your previous positions.

Apply Through Our Website:We encourage you to apply directly through our website. It’s the best way for us to receive your application and ensures you’re considered for the role. Plus, it shows you’re keen on joining our amazing team!

How to prepare for a job interview at Dormont Manufacturing Co

Know Your Tech Inside Out

As a Senior Observability Engineer, you’ll need to showcase your expertise with tools like Dynatrace, Prometheus, and Grafana. Brush up on your knowledge of these platforms and be ready to discuss how you've used them in past projects. Prepare specific examples that highlight your problem-solving skills and technical leadership.

Showcase Your Strategic Vision

This role is all about driving the observability strategy. Be prepared to talk about your vision for monitoring and logging solutions. Think about how you would architect a comprehensive observability stack and be ready to share your ideas on optimising performance and scalability in a hybrid environment.

Demonstrate Your Mentorship Skills

Since mentoring is a key part of this role, think of examples where you've led teams or guided colleagues. Discuss how you foster a culture of learning and best practices within your team. Highlight any experiences where you’ve influenced others to adopt new technologies or processes.

Prepare for Scenario-Based Questions

Expect questions that assess your analytical and strategic problem-solving abilities. Prepare for scenarios where you might need to resolve complex incidents or optimise resource utilisation. Practise articulating your thought process clearly, as communication is crucial in collaborating with cross-functional teams.