At a Glance
- Tasks: Lead the charge in ensuring our cloud platforms are reliable, scalable, and maintainable.
- Company: Join NiCE, a global leader in innovative software solutions for public safety.
- Benefits: Enjoy a competitive salary, flexible working options, and opportunities for professional growth.
- Other info: Be part of a dynamic team that values innovation and collaboration.
- Why this job: Make a real impact in public safety while working with cutting-edge technology.
- Qualifications: 6+ years in Site Reliability Engineering with strong technical and analytical skills.
The predicted salary is between 70000 - 90000 £ per year.
At NiCE, we don’t limit our challenges. We challenge our limits. Always. We’re ambitious. We’re game changers. And we play to win. We set the highest standards and execute beyond them. And if you’re like us, we can offer you the ultimate career opportunity that will light a fire within you.
Here at NICE Public Safety, we provide state of the art solutions for the Public Safety & Justice market, providing software as a service for multi-media evidence management and Emergency Contact Centres to a worldwide customer base. We are currently expanding our Cloud Platform Engineering team to ensure we continue to offer exemplary service to our customers. This is a very hands-on role. You will be involved in ensuring our cloud platforms are observable, measurable, reliable, scalable, and maintainable. It’s likely that the successful candidate will have significant experience in a DevOps, SRE, Cloud Engineer, or Cloud Development role.
How will you make an impact?
- Act as part of a team of SRE’s that act as the ‘gatekeepers’ of production and actively manage the work backlog and develop reliability improvements.
- Lead investigations into root cause outages, performance, and cost issues.
- Lead initiatives to develop the automation of low-value tasks balanced against project delivery demands.
- Provide technical leadership to wider Cloud Operations and Support teams along with providing oversight to the products and services they support.
- Collaborate with DevOps and engineering teams to establish and enforce SLOs, SLAs, and error budgets.
- Develop and configure monitoring dashboards and alerts in tools like Grafana and Azure Monitor.
- Installation and configuration of Observability Platform including tools like Grafana, Prometheus, Azure Monitor, Open telemetry etc.
- Developing bicep modules for monitoring infrastructure and deploy it.
- Optimize system performance, cost, and security through regular reviews and tuning.
Do you have what it takes?
- Must have 6+ years of experience in Site Reliability Engineering.
- Excellent technical, analytical and troubleshooting skills.
- Experience and in-depth knowledge of databases and data handling (MS-SQL, Elasticsearch, YML, JSON, XML).
- Experience with Azure cloud.
- Significant experience in programming or advanced scripting (Python, PowerShell, C# etc.).
- Experience with infrastructure/configuration as code and version control (ARM, BICEP, Git).
- Strong experience managing monitoring, alerting and dashboarding platforms (Azure Monitor, Prometheus, Grafana, Elasticsearch).
- Demonstrable experience of supporting live cloud services and platforms.
- Expert in developing queries for dashboards and alerting for microservices.
- Expertise in developing custom metrics for microservices.
- Production experience with Kubernetes and containerization (AKS).
- Exposure to Azure DevOps pipelines is desirable (CI/CD).
- Strong experience in infrastructure as a code, design and implementation strategies.
- Experience with AI (tools) to automate and accelerate is a plus.
- Efficient, effective, and respectful communication skills both with customers and within internal departments.
- Good listener, able to identify and validate assumptions.
- Able to use effective questioning to confirm understanding of a customer problem and then provide help to solve it.
- Methodical troubleshooting, technical skill and attention to detail used in diagnosing problems and reproducing issues in a local environment.
- Multi-tasking and time-management to prioritise and switch between varied tasks.
- Significant experience in platform engineering, observability, and provisioning.
- Proven ability to develop and implement a strategic vision for platform services, observability, and provisioning.
- Strong understanding of cyber security principles, governance, and compliance frameworks.
- Strong understanding and experience of cloud platforms, containerisation, and microservices architecture.
- Broad background across information technology with the ability to communicate clearly with non-security technical SMEs at a comfortable level.
- Strong proficiency in technical scoping, architecture design, and integration of security tools and processes.
- Ability to translate business needs into scalable, user-centric cloud solutions.
- Excellent communication and collaboration skills, with a focus on thought leadership and solution development.
- Experience in both operational and transformation roles or a clear working understanding of both perspectives.
- Knowledge of compliance with relevant frameworks, including ISO 27001, Cyber Essentials + or FEDRAMP.
Tooling:
- Kubernetes (Ideally AKS)
- Azure Devops Pipelines
- Elasticsearch and Cloud Observability Stacks
- IAC (Bicep/Terraform)
- Powershell / C#
Beneficial Certifications:
- AZ 104
- AZ 305
- AZ 500
- AZ 700
- CKA
NICE Ltd. (NASDAQ: NICE) software products are used by 25,000+ global businesses, including 85 of the Fortune 100 corporations, to deliver extraordinary customer experiences, fight financial crime and ensure public safety. Every day, NiCE software manages more than 120 million customer interactions and monitors 3+ billion financial transactions. Known as an innovation powerhouse that excels in AI, cloud and digital, NiCE is consistently recognized as the market leader in its domains, with over 8,500 employees across 30+ countries. NiCE is proud to be an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, national origin, age, sex, marital status, ancestry, neurotype, physical or mental disability, veteran status, gender identity, sexual orientation or any other category protected by law.
Lead Site Reliability Engineer in Southampton employer: NICE
Contact Detail:
NICE Recruiting Team
StudySmarter Expert Advice 🤫
We think this is how you could land Lead Site Reliability Engineer in Southampton
✨Tip Number 1
Network like a pro! Reach out to folks in your industry on LinkedIn or at meetups. We all know that sometimes it’s not just what you know, but who you know that can land you that dream job.
✨Tip Number 2
Prepare for the interview by researching NiCE and its culture. Understand their products and how they impact public safety. This will help us show that you’re genuinely interested and ready to contribute.
✨Tip Number 3
Practice your technical skills! Brush up on your SRE knowledge and be ready to discuss your experience with cloud platforms and automation tools. We want to see your expertise shine during those technical interviews.
✨Tip Number 4
Don’t forget to apply through our website! It’s the best way to ensure your application gets seen by the right people. Plus, we love seeing candidates who take that extra step!
We think you need these skills to ace Lead Site Reliability Engineer in Southampton
Some tips for your application 🫡
Show Your Passion: When writing your application, let your enthusiasm for the role shine through! We want to see how excited you are about Site Reliability Engineering and how you can contribute to our ambitious goals at NiCE.
Tailor Your Experience: Make sure to highlight your relevant experience in DevOps, SRE, or Cloud Engineering. We’re looking for specific examples that demonstrate your technical skills and how they align with the responsibilities outlined in the job description.
Be Clear and Concise: Keep your application straightforward and to the point. We appreciate clarity, so avoid jargon and focus on communicating your qualifications effectively. Remember, we want to understand your journey and what makes you a great fit!
Apply Through Our Website: Don’t forget to submit your application through our website! It’s the best way for us to receive your details and ensure you’re considered for this exciting opportunity. We can’t wait to hear from you!
How to prepare for a job interview at NICE
✨Know Your Tech Inside Out
Make sure you brush up on your technical skills, especially around Azure cloud, Kubernetes, and the tools mentioned in the job description like Grafana and Prometheus. Be ready to discuss your past experiences with these technologies and how you've used them to solve real-world problems.
✨Showcase Your Problem-Solving Skills
Prepare to share specific examples of how you've tackled outages or performance issues in the past. Use the STAR method (Situation, Task, Action, Result) to structure your answers, highlighting your analytical and troubleshooting skills.
✨Understand Their Business
Research NiCE and their role in public safety and justice. Understand their products and services, and think about how your experience can contribute to their mission. This will show your genuine interest and help you connect your skills to their needs.
✨Communicate Effectively
Practice clear and concise communication. You’ll need to explain complex technical concepts to non-technical stakeholders, so be prepared to demonstrate your ability to simplify your language while still conveying important information.