At a Glance
- Tasks: Design and implement automated validation frameworks for large-scale data workflows.
- Company: Join a fast-moving startup transforming complex datasets into actionable insights.
- Benefits: Enjoy competitive salary, flexible work hours, and comprehensive health benefits.
- Why this job: Make a real impact on data quality and reliability in a mission-driven environment.
- Qualifications: 5+ years in data engineering or QA with strong Python and PySpark skills.
- Other info: Collaborative team culture with opportunities for professional growth and development.
The predicted salary is between 43200 - 72000 ÂŁ per year.
Atreides helps organizations transform large and complex multiâmodal datasets into informationârich geoâspatial data subscriptions that can be used across a wide spectrum of use cases. Currently, Atreides focuses on providing highâfidelity data solutions to enable customers to derive insights quickly. We are a fastâmoving, highâperformance startup. We value a diverse team and believe inclusion drives better performance. We trust our team with autonomy, believing it leads to better results and job satisfaction. With a missionâdriven mindset and entrepreneurial spirit, we are building something new and helping unlock the power of massiveâscale data to make the world safer, stronger, and more prosperous.
We are a passionate team of technologists, data scientists, and analysts with backgrounds in operational intelligence, law enforcement, large multinationals, and cybersecurity operations. We obsess about designing products that will change the way global companies, governments and nonprofits protect themselves from external threats and global adversaries.
We are seeking a QA Automation Data Engineer to ensure the correctness, performance, and reliability of our data pipelines, data lakes, and enrichment systems. In this role, you will design, implement, and maintain automated validation frameworks for our largeâscale data workflows. You will work closely with data engineers, analysts, and platform engineers to embed test coverage and data quality controls directly into the CI/CD lifecycle of our ETL and geospatial data pipelines.
You should be deeply familiar with test automation in data contexts, including schema evolution validation, edge case generation, null/duplicate detection, statistical drift analysis, and pipeline integration testing. This is not a manual QA role â you will write code, define test frameworks, and help enforce reliability through automation.
Team Principles:
- Remain curious and passionate in all aspects of our work
- Promote clear, direct, and transparent communication
- Embrace the 'measure twice, cut once' philosophy
- Value and encourage diverse ideas and technologies
- Lead with empathy in all interactions
Responsibilities:
- Develop automated test harnesses for validating Spark pipelines, Iceberg table transformations, and Pythonâbased data flows.
- Implement validation suites for data schema enforcement, contract testing, and null/duplication/anomaly checks.
- Design test cases for validating geospatial data processing pipelines (e.g., geometry validation, bounding box edge cases).
- Integrate data pipeline validation with CI/CD tooling.
- Monitor and alert on data quality regressions using metricâdriven validation (e.g., row count deltas, join key sparsity, referential integrity).
- Write and maintain mock data generators and propertyâbased test cases for data edge cases and corner conditions.
- Contribute to team standards for testing strategy, coverage thresholds, and release readiness gates.
- Collaborate with data engineers on pipeline observability and reproducibility strategies.
- Participate in root cause analysis and postâmortems for failed data releases or quality incidents.
- Document infrastructure design, data engineering processes, and maintain comprehensive documentation.
Desired Qualifications:
- 5+ years of experience in data engineering or data QA roles with automation focus.
- Strong proficiency in Python and PySpark, including writing testable, modular data code.
- Experience with Apache Iceberg, Delta Lake, or Hudi, including schema evolution and partitioning.
- Familiarity with data validation libraries (e.g., Great Expectations, Deequ, Soda SQL) or homegrown equivalents.
- Understanding of geospatial formats (e.g., GeoParquet, GeoJSON, Shapefiles) and related edge cases.
- Experience with test automation frameworks such as pytest, hypothesis, unittest, and integration with CI pipelines.
- Familiarity with cloudânative data infrastructure, especially AWS (Glue, S3, Athena, EMR).
- Knowledge of data lineage, data contracts, and observability tools is a plus.
- Strong communication skills and the ability to work crossâfunctionally with engineers and analysts.
Youâll Succeed If You:
- Enjoy catching issues before they hit production and designing coverage to prevent them.
- Believe that data quality is a firstâclass concern, not an afterthought.
- Thrive in environments where automated tests are part of the engineering pipeline, not separate from it.
- Can bridge the gap between engineering practices and analytics/ML testing needs.
- Have experience debugging distributed failures (e.g., skewed partitions, schema mismatches, memory pressure).
Compensation and Benefits:
- Competitive salary
- Comprehensive health, dental, and vision insurance plans
- Flexible hybrid work environment
- Additional benefits like flexible hours, work travel opportunities, competitive vacation time and parental leave
While meeting all of these criteria would be ideal, we understand that some candidates may meet most, but not all. If youâre passionate, curious and ready to "work smart and get things done," weâd love to hear from you.
Senior QA Automation Engineer (UK) in London employer: Atreides LLC.
Contact Detail:
Atreides LLC. Recruiting Team
StudySmarter Expert Advice đ¤Ť
We think this is how you could land Senior QA Automation Engineer (UK) in London
â¨Tip Number 1
Network like a pro! Reach out to folks in your industry on LinkedIn or at meetups. A friendly chat can lead to opportunities that arenât even advertised yet.
â¨Tip Number 2
Prepare for interviews by practising common questions and scenarios related to QA automation. We recommend doing mock interviews with friends or using online platforms to get comfortable.
â¨Tip Number 3
Showcase your skills through personal projects or contributions to open-source. This not only boosts your portfolio but also demonstrates your passion and expertise in QA automation.
â¨Tip Number 4
Donât forget to apply through our website! Itâs the best way to ensure your application gets seen by the right people. Plus, we love seeing candidates who take that extra step!
We think you need these skills to ace Senior QA Automation Engineer (UK) in London
Some tips for your application đŤĄ
Tailor Your CV: Make sure your CV is tailored to the role of Senior QA Automation Data Engineer. Highlight your experience with data pipelines, automation frameworks, and any relevant technologies like Python and PySpark. We want to see how your skills align with our needs!
Craft a Compelling Cover Letter: Your cover letter is your chance to shine! Use it to explain why you're passionate about data quality and automation. Share specific examples of how you've tackled similar challenges in the past. We love hearing your story!
Showcase Your Technical Skills: Donât forget to showcase your technical skills in your application. Mention your experience with tools like Apache Iceberg or Delta Lake, and any test automation frameworks youâve used. Weâre looking for someone who can hit the ground running!
Apply Through Our Website: We encourage you to apply through our website for a smoother process. It helps us keep track of applications and ensures you get the best experience. Plus, it shows youâre keen on joining our team at Atreides!
How to prepare for a job interview at Atreides LLC.
â¨Know Your Tech Stack
Make sure youâre well-versed in the technologies mentioned in the job description, especially Python, PySpark, and any data validation libraries. Brush up on your knowledge of Apache Iceberg and geospatial formats, as these will likely come up during technical discussions.
â¨Showcase Your Automation Skills
Prepare to discuss your experience with test automation frameworks like pytest and how you've integrated them into CI/CD pipelines. Be ready to share specific examples of how you've designed automated tests for data workflows and the impact they had on data quality.
â¨Emphasise Collaboration
Since this role involves working closely with data engineers and analysts, highlight your teamwork skills. Share examples of past collaborations where you contributed to improving data pipeline observability or resolving quality incidents through effective communication.
â¨Ask Insightful Questions
Prepare thoughtful questions about the companyâs approach to data quality and their testing strategies. This shows your genuine interest in the role and helps you gauge if their values align with yours, especially regarding curiosity and innovation.