Lead Site Reliability Engineer in Nottingham

Lead Site Reliability Engineer in Nottingham

Nottingham Full-Time 80000 - 100000 £ / year (est.) No working from home possible

At a Glance

  • Tasks: Lead the reliability of critical data pipelines and drive strategic initiatives for performance optimisation.
  • Company: Join a forward-thinking tech company focused on data operations and innovation.
  • Benefits: Competitive salary, flexible working options, and opportunities for professional growth.
  • Other info: Collaborative culture with opportunities to mentor and lead diverse teams.
  • Why this job: Make a real impact by ensuring data security and optimising performance in a dynamic environment.
  • Qualifications: Experience in site reliability or data operations with strong technical skills in AWS and automation.

The predicted salary is between 80000 - 100000 £ per year.

We are seeking a Lead Site Reliability Engineer (DataOps) to lead the charge on ensuring the health, reliability, and security of our critical data pipelines. This is a senior, hands-on technical role for an expert who is comfortable with mission-critical batch data pipelines in a cloud environment, integrating with numerous real-time data sources. You will be responsible for managing highly sensitive and critical data streams and driving strategic initiatives to minimize incidents, optimize performance, and build a resilient hybrid data environment. Your focus will be on proactive problem-solving, automation, and continuous improvement, transforming our operational processes from reactive to resilient.

What you’ll do

  • Production Support & Reliability: Act as the subject matter expert and technical lead for resolving the most complex, high-impact incidents affecting data pipelines. Manage multiple stakeholders for critical events. Perform in-depth root cause analysis to prevent recurrence, focusing on data pipelines, scheduling platforms such as Control-M and AWS-related services.
  • Data Security & Governance: Ensure the integrity and security of highly sensitive and critical data throughout the entire pipeline. Implement and enforce security best practices, including managing encryption at rest and in transit, access controls, and compliance.
  • Automation & Tooling: Develop and implement automation for common operational tasks to reduce manual toil. Focus on building tools and monitoring solutions that provide visibility into the end-to-end health of pipelines.
  • Performance Optimization: Proactively analyse and tune the performance of batch schedules and AWS resource utilization. Identify and implement optimizations to improve efficiency and reduce operational costs.
  • Collaboration & Leadership: Act as a technical leader and mentor for both onsite and offshore team members. Ensure seamless collaboration, clear communication, and consistent operational standards across a distributed team. Contribute to the long-term technical strategy for data operations including modernization efforts.

What we’re looking for

  • Demonstrable hands-on experience in a production support, site reliability, or data operations role within a large-scale data environment.
  • Experience with data distribution platforms (e.g. Ab Initio & Spark centric solutions like AWS Glue & EMR), including deep understanding of ETL/ELT workflows & integration into data platforms like Snowflake.
  • Extensive experience with scheduling platforms such as Control-M, including complex scheduling, dependencies, and managing a large batch environment.
  • Working knowledge of IBM Sterling File Gateway or similar file transfer (MFT) solutions would be beneficial (e.g. AWS Transfer Family).
  • Deep knowledge of AWS and its data-related services, including knowledge of open-source, cloud-first data-pipeline orchestration capabilities like Apache Airflow.
  • Proficiency in Shell scripting & Python for automation and system administration.
  • Proven ability to manage highly sensitive and critical data pipelines, with a strong understanding of security and compliance requirements.
  • Demonstrated experience working effectively with both onsite and offshore teams, ensuring seamless operational handoffs and knowledge sharing.
  • Excellent communication skills, with the ability to articulate complex technical issues to both technical teams and business stakeholders.
  • Experience with DevOps or DataOps principles and practices is essential.

Lead Site Reliability Engineer in Nottingham employer: 慨正橡扯

As a Lead Site Reliability Engineer at our company, you will thrive in a dynamic and innovative work culture that prioritises collaboration and continuous improvement. We offer competitive benefits, including professional development opportunities and a commitment to employee well-being, all within a cutting-edge cloud environment that fosters growth and resilience in data operations. Join us to make a meaningful impact while working alongside a talented team dedicated to excellence in data management.

Contact Details:

慨正橡扯 Recruitment Team

StudySmarter Expert Advice🤫

We think this is how you could land Lead Site Reliability Engineer in Nottingham

Tip Number 1

Network like a pro! Reach out to your connections in the industry, attend meetups, and engage in online forums. You never know who might have the inside scoop on job openings or can refer you directly.

Tip Number 2

Show off your skills! Create a portfolio or GitHub repository showcasing your projects, especially those related to data pipelines and automation. This gives potential employers a tangible look at what you can do.

Tip Number 3

Prepare for interviews by brushing up on your technical knowledge and problem-solving skills. Practice common SRE scenarios and be ready to discuss how you've tackled complex incidents in the past.

Tip Number 4

Don’t forget to apply through our website! It’s the best way to ensure your application gets seen by the right people. Plus, we love seeing candidates who are proactive about their job search!

We think you need these skills to ace Lead Site Reliability Engineer in Nottingham

Site Reliability Engineering
DataOps
Cloud Environment Management
Batch Data Pipelines
Root Cause Analysis
Data Security
Automation

Some tips for your application 🫡

Tailor Your CV:Make sure your CV is tailored to the Lead Site Reliability Engineer role. Highlight your hands-on experience with data pipelines, AWS services, and any relevant automation skills. We want to see how your background aligns with our needs!

Craft a Compelling Cover Letter:Your cover letter is your chance to shine! Use it to explain why you're passionate about data reliability and how your experience makes you the perfect fit for our team. Don’t forget to mention specific projects or achievements that showcase your skills.

Showcase Your Problem-Solving Skills:In your application, be sure to highlight examples of how you've tackled complex incidents in the past. We love proactive problem-solvers, so share stories that demonstrate your ability to optimise performance and drive continuous improvement.

Apply Through Our Website:We encourage you to apply directly through our website. It’s the best way for us to receive your application and ensures you’re considered for the role. Plus, it shows you’re keen on joining our awesome team at StudySmarter!

How to prepare for a job interview at 慨正橡扯

Know Your Tech Inside Out

Make sure you’re well-versed in the technologies mentioned in the job description, especially AWS services and data pipeline orchestration tools like Apache Airflow. Brush up on your knowledge of ETL/ELT workflows and be ready to discuss how you've applied these in real-world scenarios.

Prepare for Scenario-Based Questions

Expect questions that ask you to solve hypothetical problems related to data pipelines or incident management. Think about past experiences where you’ve had to perform root cause analysis or optimise performance, and be ready to share those stories with clear outcomes.

Showcase Your Leadership Skills

As a Lead Site Reliability Engineer, you’ll need to demonstrate your ability to mentor and lead teams. Prepare examples of how you’ve successfully collaborated with both onsite and offshore teams, and how you’ve contributed to strategic initiatives in previous roles.

Emphasise Security and Compliance Knowledge

Given the focus on data security and governance, be prepared to discuss best practices for managing sensitive data. Highlight any experience you have with encryption, access controls, and compliance measures, as this will show you understand the critical nature of the role.