SmartSearch’s distinctive Anti-Money Laundering verification software protects our clients by offering the most advanced and comprehensive features available from an AML provider. We are looking for a Site Reliability Engineer (SRE) who will be responsible for ensuring the reliability, scalability, and performance of our cloud infrastructure and applications. This role focuses on maintaining and improving system observability, automating operations, and enhancing deployment practices to support business-critical services. Reporting directly to the Lead Site Reliability Engineer, you will be expected to work independently while collaborating closely with engineering and operations teams. You will be responsible for implementing and maintaining monitoring and logging solutions while producing clear documentation to support the cloud environment. Continuous learning and improving performance based on set targets will be expected. Please note, you\’ll be required to be within commutable distance to the Ilkley office for occasional office attendance. Ensuring system reliability, performance, and scalability through monitoring and automation Building and maintaining observability solutions using Grafana, Prometheus, Loki, OpenTelemetry Proactively identifying and resolving performance bottlenecks and infrastructure issues Automating infrastructure provisioning, configuration management, and deployments Implementing effective logging, monitoring, and alerting strategies Working with DevOps engineers to streamline CI/CD pipelines and automate testing Providing detailed documentation for cloud infrastructure, deployment processes, and best practices Actively participating in capacity planning and cloud architectural decisions Experience designing and implementing robust observability, monitoring and logging solutions Strong proficiency with observability and monitoring tools such as Grafana, Prometheus, and Loki An understanding of cloud networking architecture and load balancing techniques Good written and verbal communication skills, with a strong standard of English Desire to continuously learn and stay updated with technology advancements Several years’ experience in an SRE, DevOps, or similar role Knowledge of application performance monitoring solutions like DataDog or NewRelic Hands-on experience with DevOps practices, including CI/CD pipelines and automated deployments Understanding of software development, ideally with PHP Strong automation and scripting abilities with Python, Bash, or Go Proficiency in capacity planning and performance optimization We are a multi-award winning Tech company with an aspirational mentality Some of our most recent recognitions include: named in the renowned RegTech100 list for 2024 , listed in the Top 100 Fasted Growing Tech Companies by Northern Tech Awards 2024 as well as being named Technology Provider of the Year by Corporate Finance Awards 2024 There are excellent progression opportunities due to our growth and you will have personal development goals, regular feedback and support We are a diverse and inclusive team committed to promoting Diversity & Inclusion and Social Responsibility. Through our DE&I group, charitable initiatives and support for local schools, we actively foster a positive Impact on our community Our comprehensive benefit package includes: ~25 days holiday rising to 30 with each year of service ~ Private Medical Insurance covering dental and optical ~ Company pension scheme ~ Life Assurance – 4x your annual salary ~Employee Assistance Programme ~ Cycle to work scheme ~ On site gym
Contact Detail:
SmartSearch Recruiting Team