Senior Data Reliability Engineer

Senior Data Reliability Engineer

Full-Time 36000 - 60000 £ / year (est.) No working from home possible
E

At a Glance

  • Tasks: Drive engagement with Site Reliability and ensure robust, reliable software across teams.
  • Company: Join a leading tech company focused on data quality and reliability in the crypto ecosystem.
  • Benefits: Hybrid work, £500 remote budget, $1,000 learning budget, and 25 days annual leave.
  • Other info: Dynamic team culture with opportunities for personal and professional growth.
  • Why this job: Make a real impact on enterprise-grade applications and lead the charge in data reliability.
  • Qualifications: Experience in site reliability, managing large datasets, and supporting distributed systems.

The predicted salary is between 36000 - 60000 £ per year.

Overview

The impact you will have: As Senior DRE, you will drive engagement with Site Reliability across the full breadth of engineering. You will hold every engineer and every team accountable in building highly-resilient, robust, reliable software. You will be part of a cross-functional, cross-discipline team of SMEs and on-callers, whose mission it is to keep our platform highly performant 24/7/365. Responsible for a diverse suite of products, you will oversee SR of enterprise grade applications that sit on the critical path running 1000s of QPS. Elliptic is known for its extensive and reliable datasets and you will play a critical role in defining and building out a market-leading foundation for data quality and control. This means building the processes, culture, and frameworks that will power observability, quality, data lineage, and remediation to form an essential pillar of our data & intelligence platform.

What you will do: This is a cross team role, and you will have the full support of leadership and engineering in carrying out your responsibilities - it’s not all down to you, but you will show the rest of us what good looks like.

  • Evangelise SRE & DRE across engineering
  • Lead the charge on building out a framework for data quality that will provide our customers with strong guarantees about the fidelity of our data as well support our marketing and revenue functions
  • SRE as a function define and own the on-call process:
    • Quickly establishing a strong working knowledge of our systems
    • Commanding incidents
    • Running mop-ups
    • Ensuring follow-up actions are completed to your schedule
    • Evaluating and improving our existing E2E on-call process
  • Take part in the on-call rotation, one week every 4-5 weeks (24x7x365 coverage)
  • Evaluate, manage and maintain our existing solutions for monitoring, alerting, paging, response, documentation
  • Report on uptime, availability, performance, etc across our product suite
  • Write post-mortems for both internal and external consumption
  • Represent our SRE & DRE function on sales calls with tier one enterprise financial institutions
  • Work with product, sales and customer service to define SLAs for different products and use cases
  • Work with internal product teams to define SLOs for internal consumption and measurement
  • Work with our engineering teams directly to embed DRE practices

You will be a great fit here if you:

  • Thrive under high pressure situations, and are able to make tough decisions quickly
  • Fail fast, own the failure; encourage a blame free engineering culture
  • Are an inspiring thought leader, and are able to take others with you on a journey
  • Aren’t afraid to get your hands dirty and dig into code across myriad technologies
  • Understand the importance of reliability in enterprise finance systems
  • Have strong opinions based on your experience that you evolve over time as you learn from others

Our ideal candidate has:

  • Proven experience at leveling up the quality and reliability of large datasets not just services and APIs
  • Experience leading site reliability for a high volume SaaS product
  • Supported distributed systems in AWS
  • The presence and empathy required to hold teams to account
  • Defined SLAs / SLOs both internal and client facing
  • Offered post mortems to enterprise clients (verbal and written)

Bonus Points for:

  • Having a genuine interest in the crypto ecosystem and being behind the mission of the company
  • Working knowledge of Kubernetes and the challenges presented

Job Benefits

How we work:

  • Hybrid working and the option to work from almost anywhere for up to 90 days per year
  • £500 Remote working budget to set up your home office space

Learning & Development:

  • $1,000 Learning & Development budget to use on anything (agreed with your manager) that contributes to your growth and development

Vacation/ Leave:

  • Holidays: 25 days of annual leave + bank holidays
  • An extra day for your birthday
  • Enhanced parental leave: we provide eligible employees, regardless of gender or whether they become a parent by birth or adoption, 16 weeks fully-paid leave and leave.

Benefits:

  • Private Health Insurance - we use Vitality!
  • Full access to Spill Mental Health Support
  • Life Assurance: we hope you will never need this - but our cover is for 4 times your salary to your beneficiaries
  • £100 Crypto for you!
  • Cycle to Work Scheme

Senior Data Reliability Engineer employer: Elliptic Enterprises Limited

At Elliptic, we pride ourselves on being an exceptional employer that fosters a culture of collaboration and innovation. Our hybrid working model allows for flexibility, complemented by generous benefits such as a £500 remote working budget and a $1,000 learning and development allowance to support your professional growth. With a strong emphasis on employee well-being, including private health insurance and enhanced parental leave, we are committed to creating a rewarding and inclusive environment for our team members.

E

Contact Details:

Elliptic Enterprises Limited Recruitment Team

StudySmarter Expert Advice🤫

We think this is how you could land Senior Data Reliability Engineer

Tip Number 1

Network like a pro! Reach out to folks in the industry, attend meetups, and connect with people on LinkedIn. You never know who might have the inside scoop on job openings or can put in a good word for you.

Tip Number 2

Prepare for interviews by practising common questions and scenarios related to data reliability. We recommend doing mock interviews with friends or using online platforms to get comfortable with your responses.

Tip Number 3

Showcase your skills through personal projects or contributions to open-source. This not only demonstrates your expertise but also gives you something tangible to discuss during interviews.

Tip Number 4

Apply directly through our website! It’s the best way to ensure your application gets seen by the right people. Plus, it shows you’re genuinely interested in joining our team.

We think you need these skills to ace Senior Data Reliability Engineer

Site Reliability Engineering (SRE)
Data Reliability Engineering (DRE)
Incident Management
Monitoring and Alerting Solutions
Post-Mortem Analysis
Service Level Agreements (SLAs)
Service Level Objectives (SLOs)

Some tips for your application 🫡

Show Your Passion for Data Reliability:When writing your application, let us see your enthusiasm for data reliability and site reliability engineering. Share specific examples of how you've improved data quality or reliability in past roles – we love a good story!

Tailor Your Application:Make sure to customise your application to highlight the skills and experiences that align with the Senior Data Reliability Engineer role. Use keywords from the job description to show us you understand what we're looking for.

Be Clear and Concise:Keep your application straightforward and to the point. We appreciate clarity, so avoid jargon and long-winded explanations. Make it easy for us to see why you're the perfect fit for our team!

Apply Through Our Website:We encourage you to apply directly through our website. It’s the best way for us to receive your application and ensures you’re considered for the role. Plus, it’s super easy – just follow the prompts!

How to prepare for a job interview at Elliptic Enterprises Limited

Know Your Stuff

Make sure you have a solid understanding of Site Reliability Engineering (SRE) and Data Reliability Engineering (DRE). Brush up on the principles of building resilient systems, and be ready to discuss your experience with large datasets and enterprise applications. This will show that you’re not just familiar with the concepts but can also apply them in real-world scenarios.

Showcase Your Leadership Skills

As a Senior Data Reliability Engineer, you'll need to lead by example. Prepare examples of how you've inspired teams or driven change in previous roles. Think about times when you’ve had to hold teams accountable or improve processes, and be ready to share those stories during the interview.

Prepare for Technical Questions

Expect technical questions that dive deep into your knowledge of monitoring, alerting, and incident management. Be ready to discuss specific tools and frameworks you've used, as well as how you've handled incidents in the past. Practising these scenarios can help you articulate your thought process clearly.

Emphasise Your Problem-Solving Skills

The role requires quick decision-making under pressure. Prepare to discuss situations where you’ve had to think on your feet and resolve issues swiftly. Highlight your ability to learn from failures and how you foster a blame-free culture, which is crucial for a high-performing engineering team.