Remote Senior Data Engineer (Data Quality, PySpark, Databricks) in Nottingham

Remote Senior Data Engineer (Data Quality, PySpark, Databricks) in Nottingham

Nottingham Full-Time 65000 - 75000 £ / year (est.) Home office (partial)
PEXA Group

At a Glance

  • Tasks: Ensure data quality and optimise transformation pipelines using PySpark and Databricks.
  • Company: Join Smoove, a leading tech provider in the property sector with a mission to simplify home moving.
  • Benefits: Competitive salary, remote work flexibility, and opportunities for professional growth.
  • Other info: Collaborative team environment with regular meet-ups for meaningful connections.
  • Why this job: Make a real impact on data reliability and help revolutionise the home ownership experience.
  • Qualifications: Experience in data engineering, strong SQL skills, and knowledge of GDPR compliance.

The predicted salary is between 65000 - 75000 £ per year.

Hi, we’re Smoove, part of the PEXA Group. Our vision is to simplify and revolutionise the home moving and ownership experience for everyone. We are on a mission to deliver products and services that remove the pain, frustration, uncertainty, friction and stress that the current process creates. We are a leading provider of tech in the property sector - founded in 2003, our product focus has been our conveyancer two-sided marketplace, connecting consumers with a range of quality conveyancers to choose from at competitive prices via our easy-to-use tech platform. We are now building out our ecosystem so consumers can benefit from our services either via their Estate Agent or their Mortgage Broker, through smarter conveyancing platforms, making the home buying or selling process easier, quicker, safer and more transparent.

Why join Smoove? Great question! We pride ourselves on attracting, developing and retaining a diverse range of people in an equally diverse range of roles and specialisms – who together achieve outstanding results. Our transparent approach and open-door policy make Smoove a great place to work and as our business expands, we are looking for ambitious, talented people to join us.

We are looking for a technically proficient Senior Data Engineer to join our growing Data team. Your primary focus will be on ensuring data quality, stability, and reliability — from the moment data arrives in its rawest form to when it is used in decision-making dashboards and customer-facing reports. You will optimise the transformation pipeline from start to finish, guaranteeing that datasets are robust, tested, secure, and business-ready. Our data platform is built using Databricks, with data pipelines written in PySpark and orchestrated using Airflow. You will be expected to challenge and improve current transformations, ensuring they meet our performance, scalability, and data governance needs. This includes work with complex, nested data structures, ensuring they are reliably parsed and transformed. Experience in managing sensitive data (PII) and implementing GDPR policies is required.

You’ll work closely with analysts, engineers, and business stakeholders to ensure that datasets are not only accurate but also trusted. You will collaborate with product and engineering teams to incorporate data from new products into our core business datasets, ensuring that these new sources meet our data standards and are quickly usable for business intelligence. You’ll help put controls in place — such as access policies, metadata layers, and automated data quality checks — to ensure long-term stability. Experience with a data governance platform like Alation is desirable.

While predominantly remote / home based the team meet up to 20-25 days per year for meaningful collaboration in either Leeds or Thame.

Key Responsibilities

  • Ensure end-to-end data quality, from raw ingested data to business-ready datasets
  • Optimise PySpark-based data transformation logic for performance and reliability
  • Build scalable and maintainable pipelines in Databricks and Airflow
  • Implement and uphold GDPR-compliant processes around PII data
  • Collaborate with stakeholders to define what 'business-ready' means, and confidently sign off datasets as fit for consumption
  • Put testing strategies in place to detect data issues early and often
  • Contribute to access management, metadata management, and wider data governance practices
  • Help shape our approach to reliable data delivery for internal and external customers

Skills

  • Alation experience preferred, but similar tools welcome
  • Strong SQL skills and experience optimising complex queries
  • Strong experience in handling and transforming semi-structured data
  • High competency in programming, with a focus on clean, efficient, and production-quality code
  • Demonstrated ability to work with stakeholders to understand data needs and guide the validation and delivery process
  • Experience implementing and maintaining data quality tests and monitoring solutions
  • Strong verbal and written communication skills
  • Ability to think holistically about data reliability and how it serves business decisions

£65,000 - £75,000 a year. Sound like you? We at Smoove are ready so if this role sounds like you, apply today.

Remote Senior Data Engineer (Data Quality, PySpark, Databricks) in Nottingham employer: PEXA Group

At Smoove, we foster a dynamic and inclusive work culture that prioritises employee development and collaboration. As a remote-first company with regular in-person meet-ups, we offer a unique opportunity to work on cutting-edge technology in the property sector while ensuring data quality and governance. Join us to be part of a mission-driven team that values transparency, innovation, and the growth of its members.

PEXA Group

Contact Details:

PEXA Group Recruitment Team

StudySmarter Expert Advice🤫

We think this is how you could land Remote Senior Data Engineer (Data Quality, PySpark, Databricks) in Nottingham

Tip Number 1

Network like a pro! Reach out to your connections in the industry, especially those who work at Smoove or similar companies. A friendly chat can sometimes lead to insider info about job openings or even a referral.

Tip Number 2

Prepare for the interview by brushing up on your technical skills. Make sure you can confidently discuss your experience with PySpark, Databricks, and data governance. We want to see how you tackle real-world problems!

Tip Number 3

Show us your passion for data quality! Be ready to share examples of how you've ensured data reliability in past roles. We love candidates who can demonstrate their commitment to delivering top-notch datasets.

Tip Number 4

Don’t forget to apply through our website! It’s the best way to ensure your application gets seen by the right people. Plus, it shows you’re serious about joining our mission to revolutionise the home moving experience.

We think you need these skills to ace Remote Senior Data Engineer (Data Quality, PySpark, Databricks) in Nottingham

Data Quality Assurance
PySpark
Databricks
Airflow
GDPR Compliance
SQL
Data Transformation

Some tips for your application 🫡

Tailor Your CV:Make sure your CV is tailored to the Senior Data Engineer role. Highlight your experience with PySpark, Databricks, and data quality management. We want to see how your skills align with our mission at Smoove!

Craft a Compelling Cover Letter:Your cover letter is your chance to shine! Share your passion for data engineering and how you can contribute to simplifying the home moving experience. Let us know why you're excited about joining Smoove!

Showcase Your Projects:If you've worked on relevant projects, don’t hold back! Include examples that demonstrate your ability to handle complex data structures and ensure data quality. We love seeing real-world applications of your skills.

Apply Through Our Website:We encourage you to apply directly through our website. It’s the best way for us to receive your application and ensures you’re considered for the role. Plus, it’s super easy to do!

How to prepare for a job interview at PEXA Group

Know Your Data Tools

Familiarise yourself with Databricks, PySpark, and Airflow before the interview. Be ready to discuss how you've used these tools in past projects, especially focusing on data quality and transformation processes.

Understand GDPR Compliance

Since handling sensitive data is crucial for this role, brush up on GDPR policies and be prepared to explain how you've implemented these in your previous work. This shows you take data governance seriously.

Prepare for Technical Questions

Expect to tackle some technical challenges during the interview. Practice optimising SQL queries and transforming semi-structured data, as these skills are key for the position. You might even be asked to solve a problem on the spot!

Show Your Collaborative Spirit

Smoove values teamwork, so be ready to share examples of how you've worked with analysts and engineers in the past. Highlight your communication skills and how you ensure datasets meet business needs through collaboration.