At a Glance
- Tasks: Build and optimise data pipelines for AI evaluation, processing massive video data efficiently.
- Company: Wayve is a pioneering company in Embodied AI technology, enhancing automated driving systems.
- Benefits: Enjoy a hybrid working policy, flexible hours, and a culture that values diversity and innovation.
- Why this job: Join a team tackling complex challenges to create safer, smarter autonomous vehicles with real impact.
- Qualifications: Proficiency in Python and SQL, with experience in data processing frameworks and cloud infrastructure.
- Other info: Full-time role based in London; passionate candidates encouraged to apply regardless of meeting all requirements.
The predicted salary is between 36000 - 60000 £ per year.
At Wayve we\’re committed to creating a diverse, fair and respectful culture that is inclusive of everyone based on their unique skills and perspectives, and regardless of sex, race, religion or belief, ethnic or national origin, disability, age, citizenship, marital, domestic or civil partnership status, sexual orientation, gender identity, veteran status, pregnancy or related condition (including breastfeeding) or any other basis as protected by applicable law.
About us
Founded in 2017, Wayve is the leading developer of Embodied AI technology. Our advanced AI software and foundation models enable vehicles to perceive, understand, and navigate any complex environment, enhancing the usability and safety of automated driving systems.
Our vision is to create autonomy that propels the world forward. Our intelligent, mapless, and hardware-agnostic AI products are designed for automakers, accelerating the transition from assisted to automated driving.
In our fast-paced environment big problems ignite us—we embrace uncertainty, leaning into complex challenges to unlock groundbreaking solutions. We aim high and stay humble in our pursuit of excellence, constantly learning and evolving as we pave the way for a smarter, safer future.
At Wayve, your contributions matter. We value diversity, embrace new perspectives, and foster an inclusive work environment; we back each other to deliver impact.
Make Wayve the experience that defines your career!
Impact expected
Wayve\’s machine learning-first approach relies on high-quality, well-structured data. The Evaluation Workflows and Measurement teams build tools and pipelines that power model evaluation at scale. As we scale our evaluation approaches and tooling, we need to process massive volumes of test data efficiently and reliably.
This Data Engineer will be embedded in the AI Evaluation division to ensure our evaluation and analytics pipelines are robust, performant, and future-proof. Their work will strengthen our data foundations for fast decision-making, accelerate the availability of large-scale image and video analytics, and help us rapidly integrate and leverage data from external partners – enabling faster iteration across both offline and on-road evaluation.
Challenges you will own
- Build scalable and reliable data and analytics pipelines to process and enrich over 1 million hours of driving video data annually and supply mission-critical data to stakeholders across the business.
- Unlock rapid insights by architecting and optimising analytics pipelines that drive company wide development and decision-making.
- Collaborate across functions – including research engineers, simulation experts, robotics engineers, data scientists and safety drivers – to deliver and visualise enriched data.
- Improve pipeline observability, validation, and fault tolerance for production-grade robustness.
- Enable LLM-driven workflows by shaping data to be AI-consumable (e.g. chunking, embeddings, metadata).
- Reduce tech debt and simplify orchestration across Flyte, Databricks, and Azure-based infrastructure.
Example Projects:
- Design and optimise distributed data pipelines to handle large-scale video and image data processing.
- Re-design and optimise existing analytics pipelines.
- Collaborate with the data platform team to integrate pipelines with Databricks for governance and compliance – and unlock massive scale for offline evaluation from third party datasets.
- Shape evaluation data to support future use cases like Retrieval-Augmented Generation (RAG) and natural language analytics.
What we are looking for in our candidate
Essential
- Proficiency in Python and SQL, with experience in frameworks like Pandas, PySpark, and NumPy for large-scale data processing.
- Expertise in debugging and optimising distributed systems with a focus on scalability and reliability.
- Proven ability to design and implement scalable, fault-tolerant ETL pipelines with minimal manual intervention.
- Knowledge of data modelling best practices, including the medallion architecture or comparable frameworks.
- Experience in workflow orchestration using Flyte, dbt, Airflow, or Prefect.
- Strong understanding of unit, integration, and data validation testing using tools like Pytest or Great Expectations.
- Familiarity with cloud infrastructure (preferably Azure) for managing pipelines and storage
- Ability to collaborate closely with stakeholders to understand requirements and shape data pipelines to meet user needs effectively.
Desirable
- 5+ years of experience in a data engineering or similar role
- Experience with Docker, Kubernetes, Databricks
- Familiarity with shaping data for AI/LLM-based systems
This is a full-time role based in our office in London. At Wayve we want the best of all worlds so we operate a hybrid working policy that combines time together in our offices and workshops to fuel innovation, culture, relationships and learning, and time spent working from home. We operate core working hours so you can determine the schedule that works best for you and your team.
#LI-FH1
We understand that everyone has a unique set of skills and experiences and that not everyone will meet all of the requirements listed above. If you’re passionate about self-driving cars and think you have what it takes to make a positive impact on the world, we encourage you to apply.
For more information visit Careers at Wayve.
To learn more about what drives us, visit Values at Wayve
DISCLAIMER: We will not ask about marriage or pregnancy, care responsibilities or disabilities in any of our job adverts or interviews. However, we do look to capture information about care responsibilities, and disabilities among other diversity information as part of an optional DEI Monitoring form to help us identify areas of improvement in our hiring process and ensure that the process is inclusive and non-discriminatory.
#J-18808-Ljbffr
Staff Data Engineer, AI Evaluation employer: Futureshaper.com
Contact Detail:
Futureshaper.com Recruiting Team
StudySmarter Expert Advice 🤫
We think this is how you could land Staff Data Engineer, AI Evaluation
✨Tip Number 1
Familiarise yourself with the specific technologies mentioned in the job description, such as Flyte, Databricks, and Azure. Having hands-on experience or projects that showcase your skills with these tools can set you apart from other candidates.
✨Tip Number 2
Network with current employees or alumni who work at Wayve. Engaging in conversations about their experiences can provide valuable insights into the company culture and expectations, which can help you tailor your approach during interviews.
✨Tip Number 3
Prepare to discuss your previous projects that involved large-scale data processing and pipeline optimisation. Be ready to explain the challenges you faced and how you overcame them, as this will demonstrate your problem-solving abilities and technical expertise.
✨Tip Number 4
Showcase your collaborative skills by preparing examples of how you've worked with cross-functional teams in the past. Highlighting your ability to communicate effectively with stakeholders will be crucial, especially since the role involves collaboration with various experts.
We think you need these skills to ace Staff Data Engineer, AI Evaluation
Some tips for your application 🫡
Understand the Role: Before applying, make sure you fully understand the responsibilities and requirements of the Staff Data Engineer position at Wayve. Tailor your application to highlight how your skills and experiences align with their needs.
Highlight Relevant Experience: In your CV and cover letter, emphasise your proficiency in Python and SQL, as well as your experience with data processing frameworks like Pandas and PySpark. Provide specific examples of projects where you've built scalable data pipelines or optimised distributed systems.
Showcase Collaboration Skills: Wayve values collaboration across functions. In your application, mention instances where you've worked closely with stakeholders or cross-functional teams to deliver impactful data solutions. This will demonstrate your ability to fit into their inclusive culture.
Tailor Your Cover Letter: Craft a compelling cover letter that not only outlines your technical skills but also reflects your passion for AI and self-driving technology. Mention why you want to work at Wayve and how you can contribute to their mission of creating safer automated driving systems.
How to prepare for a job interview at Futureshaper.com
✨Showcase Your Technical Skills
Make sure to highlight your proficiency in Python and SQL during the interview. Be prepared to discuss your experience with frameworks like Pandas, PySpark, and NumPy, as well as any projects where you've designed scalable ETL pipelines.
✨Demonstrate Problem-Solving Abilities
Wayve values candidates who can tackle complex challenges. Prepare examples of how you've debugged and optimised distributed systems, focusing on scalability and reliability. This will show your ability to handle the fast-paced environment they operate in.
✨Emphasise Collaboration Skills
Collaboration is key at Wayve. Be ready to discuss how you've worked with cross-functional teams, such as data scientists and engineers, to deliver and visualise enriched data. Highlight your communication skills and how you understand stakeholder requirements.
✨Familiarity with Cloud Infrastructure
Since Wayve uses Azure for managing pipelines and storage, it’s beneficial to demonstrate your familiarity with cloud infrastructure. Discuss any relevant experience you have with cloud services and how you've utilised them in previous roles.