Monitoring and Observability Platform Engineer (Datadog and Solarwinds)
Monitoring and Observability Platform Engineer (Datadog and Solarwinds)

Monitoring and Observability Platform Engineer (Datadog and Solarwinds)

Luton Full-Time 36000 - 60000 £ / year (est.) No home office possible
I

At a Glance

  • Tasks: Administer and optimize Datadog and SolarWinds for application monitoring and observability.
  • Company: Join Iron Mountain, a global leader in technology and innovation.
  • Benefits: Enjoy remote work flexibility, competitive rewards, and opportunities for personal growth.
  • Why this job: Be part of a diverse team driving innovation and making a real impact.
  • Qualifications: Must have UK residency, SC clearance, and experience with Datadog and SolarWinds.
  • Other info: Work in a collaborative environment with a focus on continuous improvement.

The predicted salary is between 36000 - 60000 £ per year.

THE OPPORTUNITY

Title: Monitoring and Observability Platform Engineer (Datadog and Solarwinds)

Location: UK, 100% remote

Full time, permanent role

SC requirements: Must have UK passport, be UK based for more than five consecutive years and able to obtain SC security clearance

Global Technology and Innovation:

Driving performance and growth through people, innovation, security, and new ways of working, Global Technology and Innovation provides secure and stable infrastructure, competitively differentiated solutions, innovative technology platforms, and business operations for Iron Mountain.

Job summary:

This role is responsible for the comprehensive administration, configuration, and optimization of Datadog and SolarWinds monitoring platforms to ensure the health, performance, and availability of diverse applications and infrastructure.

The Monitoring and Observability Platform Engineer will leverage deep technical expertise in instrumenting applications, configuring infrastructure, network, and application monitoring, and establishing centralized logging solutions. This position requires a strong understanding of monitoring protocols, event correlation, and data trend analysis to provide end-to-end observability. The engineer will collaborate with cross-functional teams to integrate monitoring data with other critical platforms, support critical production issues, and contribute to the continuous improvement of monitoring strategies and tools. This role also includes the installation, maintenance, and upgrade of monitoring systems, as well as the creation of insightful dashboards and visualizations to drive proactive problem resolution and informed decision-making.

Your role in our mission:

  • Motivated self-starter with the ability to work on individual and team tasks
  • Engineer must be able to work effectively with the Enterprise Architects, OS engineers, and operation support teams to provide training, develop guidelines, and serve as a subject matter expert
  • Ability to share knowledge of monitoring best practices with system owners and system administrators to enhance overall monitoring and alerting posture
  • Ability to plan and execute system and software installations upgrades and changes across the organization
  • Identify risks/roadblocks and mitigate them throughout all projects and tasks while ensuring major design flaws are addressed
  • Ability to prioritize competing priorities and maintain a backlog list
  • Experience with gathering and organizing large amounts of data to use for instrumentation into an Enterprise monitoring solution

People/Leadership:

  • On-call and flexible working schedule
  • Strong communication skills to relate technical details to non-technical leaders and stakeholders
  • Promote a positive working environment for the team and stakeholders
  • Enthusiastic about working with cross-functional teams and feel ownership over the success of each project
  • Working expertise in a collaborative environment and promoting a teamwork mentality
  • Excellent time management and organizational skills and experience establishing guidelines in these areas for others
  • Situationally Aware – Must be the first to notice differences and issues as they arise and elevate them to management
  • Conflict resolution – Must be able to facilitate discussion and facilitate alternatives or different approaches.

Required Skills and Experience:

  • This role requires the candidate to be resident in the UK. UK Government SC clearance is required.
  • British National who has lived in the UK for more than 5 consecutive years and is able to pass a Home office Security Clearance check (SC)

Must have:

  • Datadog
  • Solarwinds
  • Python or Ansible or Powershell scripting

Broader/General:

  • application performance monitoring or network monitoring or log monitoring
  • browser tests or synthetic monitoring or real user monitoring
  • log configuration or log aggregation or log formatting
  • event correlation
  • end to end Observability

Nice to have: SIEM tools:

  • Solarwinds SEM or Chronicle Nagios
  • Coding expertise in Ansible or python or Powershell
  • Ability to create and execute complex SQL queries for reporting, alerting, correlation, etc

Minimum Skills & Qualifications:

Minimum of four years of hands-on experience in the following:

  • Demonstrated expertise in administering Datadog and SolarWinds platforms by instrumenting diversified applications/solutions
  • Proficient in configuring Infrastructure Monitoring, Network Monitoring, Centralized Logging, and App monitoring (browser tests, API tests, APM, and synthetics) in Datadog and Solarwinds
  • Knowledge of the monitoring configuration protocols (SNMP v2/v3, SSH, WinRM, WMI, JMX) and event correlation
  • Working expertise in performance monitoring tool alerts, dashboards, and data trend analysis in a monitoring tool
  • Hands-on experience in monitoring a variety of end devices – routers, switches, firewalls, F5 Load balancer, Infoblox, storage, virtual, Windows servers, Linux servers, and UNIX servers
  • Working expertise in implementing end-to-end observability by enriching the monitoring data with other platform data such as CMDB/ServiceNow ticketing platform, and other vendor platforms
  • Responsibilities encompass script development, installation, management, and maintenance of monitoring tools, along with seamless integrations with other systems and collaboration across teams/platforms
  • Configuration of centralized logging, aggregating logs from diverse sources such as WebSphere, Tomcat, and IIS WebServers into Datadog/Solarwinds, security/infrastructure logs with expertise in handling various log formats, including JSON Payload
  • Proficient in instrumenting diverse applications within Datadog and Solarwinds, setting up health rules, and optimizing monitoring settings
  • Implementation of End User Monitoring and Real User Monitoring using Datadog and SolarWinds, including the injection of required scripts
  • Support for critical production issues, includes data gathering, performance analysis, solution recommendations, and issuing comprehensive issue reports
  • Install and perform Solarwinds upgrades/patches
  • Creation of data visualization dashboards in Datadog and Solarwinds
  • Collaboration with Systems and Application Architecture teams to have systems monitoring requirements in the migration/implementation process
  • Coordination with project teams to ensure the availability of monitoring for applications before their release into production
  • Contribution to the review and analysis of business and system requirements, specifically focusing on systems monitoring tool protocols and future tool utilization
  • Ability to implement and support a highly available continuous monitoring platform to be utilized by 24×7 operations and cross-functional teams
  • Knowledgeable in SSL setup and proficient in the installation and management of monitoring infrastructure certificates
  • Working expertise in automating infrastructure as code/operations using appropriate automation tools. Preferably Ansible and Python platforms to establish event correlation
  • Leverage expertise in recommending baseline monitoring thresholds, recommend performance monitoring KPIs and SLAs, and provide monitoring tool infrastructure recommendations
  • Working expertise in a ticketing/CMDB platform. Preferably SNOW, but other tools acceptable such as Remedy, Assyst, etc
  • Diploma or Bachelor\’s degree in computer science, information technology or a related field

Discover what awaits you:

Discover Limitless Possibilities: Embark on an exciting journey with Iron Mountain, a global organization that embraces transformation and innovation.

Empowering Inclusion: Join a supportive environment where everyone\’s voice is heard, opinions are valued, and feedback is encouraged, fostering an atmosphere of inclusion and belonging.

Global Connectivity: Connect with 26,000+ talented individuals from 59 countries, opening doors to diverse cultures and fostering global learning opportunities.

Championing Individuality: Be part of a winning team that celebrates diversity and encourages individual differences to drive greatness.

Competitive Total Rewards: supporting your career at Iron Mountain, family, personal wellness, and wellbeing. (Local benefits may vary based on country-specific policies.)

Embrace Flexibility: Experience the freedom of remote/hybrid work, enabling a harmonious work-life balance (dependent on role).

Unleash Your Potential: Access abundant opportunities for personal and professional growth, preparing you for a digitalized future.

Valuing Every Contribution: Join a workplace that actively encourages and supports all talents, recognizing the unique impact of each individual.

Pioneering Sustainability: Contribute to our vision of fostering a sustainable and thriving workforce, leaving an enduring legacy for generations to come.

Monitoring and Observability Platform Engineer (Datadog and Solarwinds) employer: Iron Mountain

Iron Mountain is an exceptional employer that champions innovation and inclusivity, providing a supportive remote work environment for the Monitoring and Observability Platform Engineer role. With a commitment to employee growth, you will have access to abundant opportunities for professional development while collaborating with a diverse team of over 26,000 talented individuals globally. Enjoy competitive rewards and a flexible work-life balance, all while contributing to a sustainable future.
I

Contact Detail:

Iron Mountain Recruiting Team

StudySmarter Expert Advice 🤫

We think this is how you could land Monitoring and Observability Platform Engineer (Datadog and Solarwinds)

✨Tip Number 1

Familiarize yourself with Datadog and SolarWinds by exploring their documentation and community forums. This will not only enhance your technical knowledge but also show your genuine interest in the tools during interviews.

✨Tip Number 2

Network with professionals in the field through LinkedIn or relevant tech meetups. Engaging with others who work with monitoring platforms can provide insights and potentially lead to referrals.

✨Tip Number 3

Consider contributing to open-source projects or creating your own projects that utilize Datadog and SolarWinds. This hands-on experience can be a great talking point in interviews and demonstrate your practical skills.

✨Tip Number 4

Prepare for potential technical interviews by practicing common scenarios related to monitoring and observability. Be ready to discuss how you would approach specific challenges using Datadog and SolarWinds.

We think you need these skills to ace Monitoring and Observability Platform Engineer (Datadog and Solarwinds)

Datadog Administration
SolarWinds Administration
Python Scripting
Ansible Scripting
Powershell Scripting
Application Performance Monitoring
Network Monitoring
Log Monitoring
Event Correlation
End-to-End Observability
Centralized Logging Configuration
Data Visualization in Datadog and SolarWinds
SQL Query Development
Monitoring Protocols (SNMP, SSH, WinRM, WMI, JMX)
Performance Analysis
Collaboration with Cross-Functional Teams
Installation and Maintenance of Monitoring Tools
SSL Setup and Management
Automation using Ansible and Python
Knowledge of Ticketing/CMDB Platforms (e.g., ServiceNow)

Some tips for your application 🫡

Tailor Your CV: Make sure your CV highlights your experience with Datadog and SolarWinds. Include specific examples of how you've administered these platforms, configured monitoring solutions, and optimized performance.

Craft a Strong Cover Letter: In your cover letter, express your enthusiasm for the role and the company. Mention your understanding of monitoring protocols and your ability to collaborate with cross-functional teams, as these are key aspects of the job.

Showcase Relevant Skills: Clearly list your technical skills related to Python, Ansible, or PowerShell scripting. Highlight your hands-on experience with application performance monitoring and end-to-end observability, as these are crucial for the position.

Demonstrate Problem-Solving Abilities: Provide examples in your application that demonstrate your ability to identify risks and resolve conflicts. This will show your potential employer that you can handle critical production issues effectively.

How to prepare for a job interview at Iron Mountain

✨Showcase Your Technical Expertise

Be prepared to discuss your hands-on experience with Datadog and SolarWinds. Highlight specific projects where you configured monitoring systems, optimized performance, or resolved critical issues. This will demonstrate your deep technical knowledge and problem-solving skills.

✨Communicate Clearly with Non-Technical Stakeholders

Since the role requires strong communication skills, practice explaining complex technical concepts in simple terms. Think of examples where you've successfully communicated with non-technical team members or stakeholders, as this will show your ability to bridge the gap between technical and non-technical audiences.

✨Demonstrate Collaboration Skills

Prepare to discuss your experience working in cross-functional teams. Share examples of how you've collaborated with Enterprise Architects, OS engineers, and operation support teams to enhance monitoring strategies or resolve production issues. This will highlight your teamwork mentality and ability to promote a positive working environment.

✨Prepare for Scenario-Based Questions

Expect scenario-based questions that assess your ability to identify risks, prioritize tasks, and manage competing priorities. Think of past experiences where you successfully navigated challenges in monitoring or observability projects, and be ready to explain your thought process and decision-making.

Monitoring and Observability Platform Engineer (Datadog and Solarwinds)
Iron Mountain
I
  • Monitoring and Observability Platform Engineer (Datadog and Solarwinds)

    Luton
    Full-Time
    36000 - 60000 £ / year (est.)

    Application deadline: 2027-03-27

  • I

    Iron Mountain

Similar positions in other companies
UK’s top job board for Gen Z
discover-jobs-cta
Discover now
>