Monitoring and Observability Platform Engineer (Datadog and Solarwinds)
Monitoring and Observability Platform Engineer (Datadog and Solarwinds)

Monitoring and Observability Platform Engineer (Datadog and Solarwinds)

Woking Full-Time 36000 - 60000 £ / year (est.) No home office possible
I

At a Glance

  • Tasks: Administer and optimize Datadog and SolarWinds for application monitoring and observability.
  • Company: Join Iron Mountain, a global leader in technology and innovation.
  • Benefits: Enjoy remote work flexibility, competitive rewards, and opportunities for personal growth.
  • Why this job: Be part of a diverse team driving innovation and making a real impact.
  • Qualifications: Must have UK residency, SC clearance, and experience with Datadog and SolarWinds.
  • Other info: Work in a collaborative environment with a focus on continuous improvement.

The predicted salary is between 36000 - 60000 £ per year.

THE OPPORTUNITY

Title: Monitoring and Observability Platform Engineer (Datadog and Solarwinds)

Location: UK, 100% remote

Full time, permanent role

SC requirements: Must have UK passport, be UK based for more than five consecutive years and able to obtain SC security clearance

Global Technology and Innovation:

Driving performance and growth through people, innovation, security, and new ways of working, Global Technology and Innovation provides secure and stable infrastructure, competitively differentiated solutions, innovative technology platforms, and business operations for Iron Mountain.

Job summary:

This role is responsible for the comprehensive administration, configuration, and optimization of Datadog and SolarWinds monitoring platforms to ensure the health, performance, and availability of diverse applications and infrastructure.

The Monitoring and Observability Platform Engineer will leverage deep technical expertise in instrumenting applications, configuring infrastructure, network, and application monitoring, and establishing centralized logging solutions. This position requires a strong understanding of monitoring protocols, event correlation, and data trend analysis to provide end-to-end observability. The engineer will collaborate with cross-functional teams to integrate monitoring data with other critical platforms, support critical production issues, and contribute to the continuous improvement of monitoring strategies and tools. This role also includes the installation, maintenance, and upgrade of monitoring systems, as well as the creation of insightful dashboards and visualizations to drive proactive problem resolution and informed decision-making.

Your role in our mission:

  • Motivated self-starter with the ability to work on individual and team tasks
  • Engineer must be able to work effectively with the Enterprise Architects, OS engineers, and operation support teams to provide training, develop guidelines, and serve as a subject matter expert
  • Ability to share knowledge of monitoring best practices with system owners and system administrators to enhance overall monitoring and alerting posture
  • Ability to plan and execute system and software installations upgrades and changes across the organization
  • Identify risks/roadblocks and mitigate them throughout all projects and tasks while ensuring major design flaws are addressed
  • Ability to prioritize competing priorities and maintain a backlog list
  • Experience with gathering and organizing large amounts of data to use for instrumentation into an Enterprise monitoring solution

People/Leadership:

  • On-call and flexible working schedule
  • Strong communication skills to relate technical details to non-technical leaders and stakeholders
  • Promote a positive working environment for the team and stakeholders
  • Enthusiastic about working with cross-functional teams and feel ownership over the success of each project
  • Working expertise in a collaborative environment and promoting a teamwork mentality
  • Excellent time management and organizational skills and experience establishing guidelines in these areas for others
  • Situationally Aware – Must be the first to notice differences and issues as they arise and elevate them to management
  • Conflict resolution – Must be able to facilitate discussion and facilitate alternatives or different approaches.

Required Skills and Experience:

  • This role requires the candidate to be resident in the UK. UK Government SC clearance is required.
  • British National who has lived in the UK for more than 5 consecutive years and is able to pass a Home office Security Clearance check (SC)

Must have:

  • Datadog
  • Solarwinds
  • Python or Ansible or Powershell scripting

Broader/General:

  • application performance monitoring or network monitoring or log monitoring
  • browser tests or synthetic monitoring or real user monitoring
  • log configuration or log aggregation or log formatting
  • event correlation
  • end to end Observability

Nice to have: SIEM tools:

  • Solarwinds SEM or Chronicle Nagios
  • Coding expertise in Ansible or python or Powershell
  • Ability to create and execute complex SQL queries for reporting, alerting, correlation, etc

Minimum Skills & Qualifications:

Minimum of four years of hands-on experience in the following:

  • Demonstrated expertise in administering Datadog and SolarWinds platforms by instrumenting diversified applications/solutions
  • Proficient in configuring Infrastructure Monitoring, Network Monitoring, Centralized Logging, and App monitoring (browser tests, API tests, APM, and synthetics) in Datadog and Solarwinds
  • Knowledge of the monitoring configuration protocols (SNMP v2/v3, SSH, WinRM, WMI, JMX) and event correlation
  • Working expertise in performance monitoring tool alerts, dashboards, and data trend analysis in a monitoring tool
  • Hands-on experience in monitoring a variety of end devices – routers, switches, firewalls, F5 Load balancer, Infoblox, storage, virtual, Windows servers, Linux servers, and UNIX servers
  • Working expertise in implementing end-to-end observability by enriching the monitoring data with other platform data such as CMDB/ServiceNow ticketing platform, and other vendor platforms
  • Responsibilities encompass script development, installation, management, and maintenance of monitoring tools, along with seamless integrations with other systems and collaboration across teams/platforms
  • Configuration of centralized logging, aggregating logs from diverse sources such as WebSphere, Tomcat, and IIS WebServers into Datadog/Solarwinds, security/infrastructure logs with expertise in handling various log formats, including JSON Payload
  • Proficient in instrumenting diverse applications within Datadog and Solarwinds, setting up health rules, and optimizing monitoring settings
  • Implementation of End User Monitoring and Real User Monitoring using Datadog and SolarWinds, including the injection of required scripts
  • Support for critical production issues, includes data gathering, performance analysis, solution recommendations, and issuing comprehensive issue reports
  • Install and perform Solarwinds upgrades/patches
  • Creation of data visualization dashboards in Datadog and Solarwinds
  • Collaboration with Systems and Application Architecture teams to have systems monitoring requirements in the migration/implementation process
  • Coordination with project teams to ensure the availability of monitoring for applications before their release into production
  • Contribution to the review and analysis of business and system requirements, specifically focusing on systems monitoring tool protocols and future tool utilization
  • Ability to implement and support a highly available continuous monitoring platform to be utilized by 24×7 operations and cross-functional teams
  • Knowledgeable in SSL setup and proficient in the installation and management of monitoring infrastructure certificates
  • Working expertise in automating infrastructure as code/operations using appropriate automation tools. Preferably Ansible and Python platforms to establish event correlation
  • Leverage expertise in recommending baseline monitoring thresholds, recommend performance monitoring KPIs and SLAs, and provide monitoring tool infrastructure recommendations
  • Working expertise in a ticketing/CMDB platform. Preferably SNOW, but other tools acceptable such as Remedy, Assyst, etc
  • Diploma or Bachelor\’s degree in computer science, information technology or a related field

Discover what awaits you:

Discover Limitless Possibilities: Embark on an exciting journey with Iron Mountain, a global organization that embraces transformation and innovation.

Empowering Inclusion: Join a supportive environment where everyone\’s voice is heard, opinions are valued, and feedback is encouraged, fostering an atmosphere of inclusion and belonging.

Global Connectivity: Connect with 26,000+ talented individuals from 59 countries, opening doors to diverse cultures and fostering global learning opportunities.

Championing Individuality: Be part of a winning team that celebrates diversity and encourages individual differences to drive greatness.

Competitive Total Rewards: supporting your career at Iron Mountain, family, personal wellness, and wellbeing. (Local benefits may vary based on country-specific policies.)

Embrace Flexibility: Experience the freedom of remote/hybrid work, enabling a harmonious work-life balance (dependent on role).

Unleash Your Potential: Access abundant opportunities for personal and professional growth, preparing you for a digitalized future.

Valuing Every Contribution: Join a workplace that actively encourages and supports all talents, recognizing the unique impact of each individual.

Pioneering Sustainability: Contribute to our vision of fostering a sustainable and thriving workforce, leaving an enduring legacy for generations to come.

Monitoring and Observability Platform Engineer (Datadog and Solarwinds) employer: Iron Mountain

Iron Mountain is an exceptional employer that champions innovation and inclusivity, providing a supportive remote work environment for the Monitoring and Observability Platform Engineer role. With a commitment to employee growth, you will have access to abundant opportunities for professional development while collaborating with a diverse team of over 26,000 talented individuals globally. Enjoy competitive rewards and a flexible work-life balance as you contribute to pioneering sustainability and transformative solutions in a dynamic industry.
I

Contact Detail:

Iron Mountain Recruiting Team

StudySmarter Expert Advice 🤫

We think this is how you could land Monitoring and Observability Platform Engineer (Datadog and Solarwinds)

✨Tip Number 1

Familiarize yourself with Datadog and SolarWinds by exploring their documentation and community forums. This will not only enhance your technical knowledge but also show your genuine interest in the tools during interviews.

✨Tip Number 2

Network with professionals in the field through platforms like LinkedIn. Engaging with others who work with monitoring tools can provide insights into best practices and may even lead to referrals.

✨Tip Number 3

Consider contributing to open-source projects or creating your own projects that utilize Datadog and SolarWinds. This hands-on experience can be a great talking point in interviews and demonstrates your proactive approach.

✨Tip Number 4

Prepare for potential technical interviews by practicing common scenarios related to monitoring and observability. Being able to discuss real-world applications of your skills will set you apart from other candidates.

We think you need these skills to ace Monitoring and Observability Platform Engineer (Datadog and Solarwinds)

Datadog Administration
SolarWinds Administration
Python Scripting
Ansible Scripting
Powershell Scripting
Application Performance Monitoring
Network Monitoring
Log Monitoring
Event Correlation
End-to-End Observability
Centralized Logging Configuration
Data Visualization in Datadog and SolarWinds
SQL Query Development
Monitoring Protocols (SNMP, SSH, WinRM, WMI, JMX)
Performance Analysis
Collaboration with Cross-Functional Teams
Installation and Maintenance of Monitoring Tools
SSL Setup and Management
Automation using Ansible and Python
Knowledge of Ticketing/CMDB Platforms (ServiceNow, Remedy)

Some tips for your application 🫡

Tailor Your CV: Make sure your CV highlights your experience with Datadog and SolarWinds, as well as your proficiency in Python, Ansible, or PowerShell scripting. Emphasize your hands-on experience in monitoring diverse applications and infrastructure.

Craft a Strong Cover Letter: In your cover letter, express your enthusiasm for the role and how your skills align with the responsibilities outlined in the job description. Mention specific projects where you successfully implemented monitoring solutions or improved observability.

Showcase Relevant Experience: When detailing your work history, focus on your experience with performance monitoring tools, event correlation, and data trend analysis. Provide examples of how you've contributed to system monitoring and optimization in previous roles.

Highlight Soft Skills: Don't forget to mention your strong communication skills and ability to work collaboratively with cross-functional teams. These are crucial for this role, so provide examples of how you've effectively communicated technical details to non-technical stakeholders.

How to prepare for a job interview at Iron Mountain

✨Showcase Your Technical Expertise

Be prepared to discuss your hands-on experience with Datadog and SolarWinds. Highlight specific projects where you configured monitoring systems, optimized performance, or resolved critical production issues.

✨Demonstrate Collaboration Skills

Since this role involves working with cross-functional teams, share examples of how you've effectively collaborated with others. Discuss any training or guidelines you've developed for team members to enhance monitoring practices.

✨Prepare for Scenario-Based Questions

Expect questions that assess your problem-solving abilities. Be ready to explain how you would handle specific monitoring challenges or system upgrades, including risk identification and mitigation strategies.

✨Communicate Clearly with Non-Technical Stakeholders

Strong communication skills are essential. Practice explaining complex technical concepts in simple terms, as you'll need to relate technical details to non-technical leaders and stakeholders during the interview.

Monitoring and Observability Platform Engineer (Datadog and Solarwinds)
Iron Mountain
I
  • Monitoring and Observability Platform Engineer (Datadog and Solarwinds)

    Woking
    Full-Time
    36000 - 60000 £ / year (est.)

    Application deadline: 2027-03-27

  • I

    Iron Mountain

Similar positions in other companies
Europas größte Jobbörse für Gen-Z
discover-jobs-cta
Discover now
>