At a Glance
- Tasks: Manage and optimize Datadog and SolarWinds for application and infrastructure monitoring.
- Company: Join Iron Mountain, a global leader in technology and innovation.
- Benefits: Enjoy remote work flexibility, competitive rewards, and opportunities for personal growth.
- Why this job: Be part of a diverse team driving innovation and sustainability in a supportive environment.
- Qualifications: Must have UK residency, SC clearance, and experience with Datadog, SolarWinds, and scripting.
- Other info: Work with cross-functional teams and contribute to critical production issues.
The predicted salary is between 36000 - 60000 £ per year.
THE OPPORTUNITY
Title: Monitoring and Observability Platform Engineer (Datadog and Solarwinds)
Location: UK, 100% remote
Full time, permanent role
SC requirements: Must have UK passport, be UK based for more than five consecutive years and able to obtain SC security clearance
Global Technology and Innovation:
Driving performance and growth through people, innovation, security, and new ways of working, Global Technology and Innovation provides secure and stable infrastructure, competitively differentiated solutions, innovative technology platforms, and business operations for Iron Mountain.
Job summary:
This role is responsible for the comprehensive administration, configuration, and optimization of Datadog and SolarWinds monitoring platforms to ensure the health, performance, and availability of diverse applications and infrastructure.
The Monitoring and Observability Platform Engineer will leverage deep technical expertise in instrumenting applications, configuring infrastructure, network, and application monitoring, and establishing centralized logging solutions. This position requires a strong understanding of monitoring protocols, event correlation, and data trend analysis to provide end-to-end observability. The engineer will collaborate with cross-functional teams to integrate monitoring data with other critical platforms, support critical production issues, and contribute to the continuous improvement of monitoring strategies and tools. This role also includes the installation, maintenance, and upgrade of monitoring systems, as well as the creation of insightful dashboards and visualizations to drive proactive problem resolution and informed decision-making.
Your role in our mission:
- Motivated self-starter with the ability to work on individual and team tasks
- Engineer must be able to work effectively with the Enterprise Architects, OS engineers, and operation support teams to provide training, develop guidelines, and serve as a subject matter expert
- Ability to share knowledge of monitoring best practices with system owners and system administrators to enhance overall monitoring and alerting posture
- Ability to plan and execute system and software installations upgrades and changes across the organization
- Identify risks/roadblocks and mitigate them throughout all projects and tasks while ensuring major design flaws are addressed
- Ability to prioritize competing priorities and maintain a backlog list
- Experience with gathering and organizing large amounts of data to use for instrumentation into an Enterprise monitoring solution
People/Leadership:
- On-call and flexible working schedule
- Strong communication skills to relate technical details to non-technical leaders and stakeholders
- Promote a positive working environment for the team and stakeholders
- Enthusiastic about working with cross-functional teams and feel ownership over the success of each project
- Working expertise in a collaborative environment and promoting a teamwork mentality
- Excellent time management and organizational skills and experience establishing guidelines in these areas for others
- Situationally Aware – Must be the first to notice differences and issues as they arise and elevate them to management
- Conflict resolution – Must be able to facilitate discussion and facilitate alternatives or different approaches.
Required Skills and Experience:
- This role requires the candidate to be resident in the UK. UK Government SC clearance is required.
- British National who has lived in the UK for more than 5 consecutive years and is able to pass a Home office Security Clearance check (SC)
Must have:
- Datadog
- Solarwinds
- Python or Ansible or Powershell scripting
Broader/General:
- application performance monitoring or network monitoring or log monitoring
- browser tests or synthetic monitoring or real user monitoring
- log configuration or log aggregation or log formatting
- event correlation
- end to end Observability
Nice to have: SIEM tools:
- Solarwinds SEM or Chronicle Nagios
- Coding expertise in Ansible or python or Powershell
- Ability to create and execute complex SQL queries for reporting, alerting, correlation, etc
Minimum Skills & Qualifications:
Minimum of four years of hands-on experience in the following:
- Demonstrated expertise in administering Datadog and SolarWinds platforms by instrumenting diversified applications/solutions
- Proficient in configuring Infrastructure Monitoring, Network Monitoring, Centralized Logging, and App monitoring (browser tests, API tests, APM, and synthetics) in Datadog and Solarwinds
- Knowledge of the monitoring configuration protocols (SNMP v2/v3, SSH, WinRM, WMI, JMX) and event correlation
- Working expertise in performance monitoring tool alerts, dashboards, and data trend analysis in a monitoring tool
- Hands-on experience in monitoring a variety of end devices – routers, switches, firewalls, F5 Load balancer, Infoblox, storage, virtual, Windows servers, Linux servers, and UNIX servers
- Working expertise in implementing end-to-end observability by enriching the monitoring data with other platform data such as CMDB/ServiceNow ticketing platform, and other vendor platforms
- Responsibilities encompass script development, installation, management, and maintenance of monitoring tools, along with seamless integrations with other systems and collaboration across teams/platforms
- Configuration of centralized logging, aggregating logs from diverse sources such as WebSphere, Tomcat, and IIS WebServers into Datadog/Solarwinds, security/infrastructure logs with expertise in handling various log formats, including JSON Payload
- Proficient in instrumenting diverse applications within Datadog and Solarwinds, setting up health rules, and optimizing monitoring settings
- Implementation of End User Monitoring and Real User Monitoring using Datadog and SolarWinds, including the injection of required scripts
- Support for critical production issues, includes data gathering, performance analysis, solution recommendations, and issuing comprehensive issue reports
- Install and perform Solarwinds upgrades/patches
- Creation of data visualization dashboards in Datadog and Solarwinds
- Collaboration with Systems and Application Architecture teams to have systems monitoring requirements in the migration/implementation process
- Coordination with project teams to ensure the availability of monitoring for applications before their release into production
- Contribution to the review and analysis of business and system requirements, specifically focusing on systems monitoring tool protocols and future tool utilization
- Ability to implement and support a highly available continuous monitoring platform to be utilized by 24×7 operations and cross-functional teams
- Knowledgeable in SSL setup and proficient in the installation and management of monitoring infrastructure certificates
- Working expertise in automating infrastructure as code/operations using appropriate automation tools. Preferably Ansible and Python platforms to establish event correlation
- Leverage expertise in recommending baseline monitoring thresholds, recommend performance monitoring KPIs and SLAs, and provide monitoring tool infrastructure recommendations
- Working expertise in a ticketing/CMDB platform. Preferably SNOW, but other tools acceptable such as Remedy, Assyst, etc
- Diploma or Bachelor\’s degree in computer science, information technology or a related field
Discover what awaits you:
Discover Limitless Possibilities: Embark on an exciting journey with Iron Mountain, a global organization that embraces transformation and innovation.
Empowering Inclusion: Join a supportive environment where everyone\’s voice is heard, opinions are valued, and feedback is encouraged, fostering an atmosphere of inclusion and belonging.
Global Connectivity: Connect with 26,000+ talented individuals from 59 countries, opening doors to diverse cultures and fostering global learning opportunities.
Championing Individuality: Be part of a winning team that celebrates diversity and encourages individual differences to drive greatness.
Competitive Total Rewards: supporting your career at Iron Mountain, family, personal wellness, and wellbeing. (Local benefits may vary based on country-specific policies.)
Embrace Flexibility: Experience the freedom of remote/hybrid work, enabling a harmonious work-life balance (dependent on role).
Unleash Your Potential: Access abundant opportunities for personal and professional growth, preparing you for a digitalized future.
Valuing Every Contribution: Join a workplace that actively encourages and supports all talents, recognizing the unique impact of each individual.
Pioneering Sustainability: Contribute to our vision of fostering a sustainable and thriving workforce, leaving an enduring legacy for generations to come.
Monitoring and Observability Platform Engineer (Datadog and Solarwinds) employer: Iron Mountain
Contact Detail:
Iron Mountain Recruiting Team
StudySmarter Expert Advice 🤫
We think this is how you could land Monitoring and Observability Platform Engineer (Datadog and Solarwinds)
✨Tip Number 1
Make sure to showcase your hands-on experience with Datadog and SolarWinds in your conversations. Highlight specific projects where you configured monitoring solutions or optimized performance, as this will demonstrate your expertise directly related to the role.
✨Tip Number 2
Familiarize yourself with the latest trends and best practices in observability and monitoring. Being able to discuss current methodologies and tools during interviews can set you apart and show that you're proactive about staying updated in the field.
✨Tip Number 3
Network with professionals in the industry, especially those who work with Datadog and SolarWinds. Engaging in relevant online communities or attending webinars can provide insights and connections that may help you land the job.
✨Tip Number 4
Prepare to discuss how you've collaborated with cross-functional teams in past roles. This position emphasizes teamwork, so sharing examples of successful collaborations will demonstrate your ability to work effectively within diverse groups.
We think you need these skills to ace Monitoring and Observability Platform Engineer (Datadog and Solarwinds)
Some tips for your application 🫡
Tailor Your CV: Make sure your CV highlights your experience with Datadog and SolarWinds, as well as your proficiency in Python, Ansible, or PowerShell scripting. Emphasize your hands-on experience in application performance monitoring and end-to-end observability.
Craft a Strong Cover Letter: In your cover letter, express your enthusiasm for the role and how your skills align with the responsibilities outlined in the job description. Mention specific projects where you've successfully implemented monitoring solutions or collaborated with cross-functional teams.
Showcase Relevant Experience: When detailing your work history, focus on your experience with monitoring protocols, event correlation, and data trend analysis. Provide examples of how you've contributed to improving monitoring strategies and tools in previous roles.
Highlight Soft Skills: Don't forget to mention your strong communication skills and ability to work in a collaborative environment. Highlight instances where you've facilitated discussions or resolved conflicts, as these are crucial for the role.
How to prepare for a job interview at Iron Mountain
✨Showcase Your Technical Expertise
Be prepared to discuss your hands-on experience with Datadog and SolarWinds. Highlight specific projects where you configured monitoring solutions, optimized performance, or resolved critical production issues.
✨Demonstrate Collaboration Skills
Since this role involves working with cross-functional teams, share examples of how you've successfully collaborated with others. Emphasize your ability to communicate technical details to non-technical stakeholders.
✨Prepare for Scenario-Based Questions
Expect questions that assess your problem-solving skills in real-world scenarios. Think about past experiences where you identified risks, mitigated issues, or improved monitoring strategies, and be ready to discuss them.
✨Highlight Your Continuous Improvement Mindset
Discuss how you stay updated with the latest monitoring tools and best practices. Share any initiatives you've taken to enhance monitoring capabilities or streamline processes in previous roles.