Data Engineer

Data Engineer

Full-Time 60000 - 75000 € / year (est.) No home office possible
CUBE

At a Glance

  • Tasks: Design and build data pipelines for regulatory intelligence using cutting-edge technology.
  • Company: Join CUBE, a leader in regulatory intelligence with a focus on innovation.
  • Benefits: Competitive salary, inclusive culture, and opportunities for professional growth.
  • Other info: Collaborative environment with opportunities to work on exciting projects.
  • Why this job: Make a real impact by transforming complex data into AI-ready assets.
  • Qualifications: 3+ years in data engineering, strong SQL and Python skills required.

The predicted salary is between 60000 - 75000 € per year.

We’re looking for a Data Engineer to join our Data and AI Engineering team and help build the pipelines, transformations, and infrastructure that power CUBE’s regulatory intelligence platform. This is hands‑on engineering work with real scope. You’ll be designing and building data pipelines that ingest, process, and serve complex regulatory content—turning unstructured source data into clean, governed, AI‑ready assets. The work sits at the intersection of data infrastructure and product capability, directly enabling the analytical and AI workloads that define what CUBE does. You’ll be working in an Azure‑native environment, collaborating closely with data architects, platform engineers, and AI/ML teams. Our post‑acquisition business integrates multiple platforms, offering genuine complexity and opportunity to shape how things get built, from greenfield work alongside legacy reality.

Responsibilities

  • Design and build data pipelines — Build, maintain, and optimise data pipelines that ingest, transform, and deliver structured and unstructured regulatory content across our platform estate.
  • Transform and model data — Apply transformation logic that converts raw source data into clean, reliable, semantically consistent assets ready for analytics and AI consumption.
  • Implement data quality and observability practices — Instrument pipelines with monitoring, alerting, and data quality checks that catch problems early and maintain platform trust.
  • Collaborate with architects and platform engineers — Work closely with the Principal Data Architect and Head of Data Platform to implement patterns that align with our architectural direction.
  • Support integration and migration work — Contribute to source‑to‑target mapping and pipeline development for ongoing platform consolidation.
  • Champion engineering best practices — Write code that others can maintain: version‑controlled, tested, documented, and built for production.
  • Contribute to platform scalability and cost efficiency — Identify and resolve performance bottlenecks, redundancies, and inefficiencies in existing pipeline infrastructure.
  • Build for AI readiness — Understand how downstream AI/ML workloads consume data and design pipelines that support feature engineering, model training, and inference requirements.

Core Qualifications

  • 3+ years of experience in data engineering or a closely related role.
  • Strong SQL and Python skills—you write production‑quality code, not just scripts.
  • Hands‑on experience building and maintaining data pipelines in cloud environments.
  • Familiarity with ETL/ELT patterns, orchestration tools (e.g. Apache Airflow, dbt, Azure Data Factory), and data transformation frameworks.
  • Experience working with both structured and unstructured or semi‑structured data.
  • Understanding of data quality principles—you know what a bad pipeline looks like and how to fix it.
  • Comfort with version control, CI/CD practices, and engineering‑grade delivery.

Preferred Qualifications

  • Experience with Microsoft Azure data services — Azure Data Factory, Synapse Analytics, Data Lake Storage, Fabric.
  • Familiarity with Apache Spark for large‑scale data processing.
  • Exposure to data modelling concepts — normalisation, dimensional design, entity‑relationship patterns.
  • Background in platform integration, data migration, or M&A consolidation work.
  • Experience building pipelines that support AI/ML workloads, including feature stores or model training infrastructure.
  • Knowledge of data governance practices — lineage, cataloguing, access control, compliance.
  • Familiarity with infrastructure‑as‑code tooling (e.g. Terraform).
  • Exposure to regulatory, financial services, or compliance data domains.

Mindset

  • You care about the quality of your output — not just whether the pipeline runs, but whether it’s maintainable, observable, and trustworthy.
  • You’re comfortable working with ambiguity and systems that weren’t built the way you’d have built them.
  • You communicate clearly with both engineers and non‑engineers.
  • You take ownership — when something breaks, you fix it; when something could be better, you say so.

CUBE is an equal opportunity employer. We celebrate diversity and are committed to creating an inclusive environment for all employees.

Data Engineer employer: CUBE

CUBE is an exceptional employer for Data Engineers, offering a dynamic work environment where you can directly influence the development of cutting-edge regulatory intelligence solutions. With a strong focus on collaboration and innovation, employees benefit from opportunities for professional growth, hands-on experience with advanced technologies in an Azure-native setting, and a commitment to diversity and inclusion that fosters a supportive workplace culture.

CUBE

Contact Detail:

CUBE Recruiting Team

StudySmarter Expert Advice🤫

We think this is how you could land Data Engineer

Tip Number 1

Network like a pro! Reach out to folks in the industry, attend meetups, and connect with people on LinkedIn. You never know who might have the inside scoop on job openings or can refer you directly.

Tip Number 2

Show off your skills! Create a portfolio showcasing your data pipelines and projects. This is your chance to demonstrate your hands-on experience and make a lasting impression on potential employers.

Tip Number 3

Prepare for technical interviews by brushing up on your SQL and Python skills. Practice coding challenges and be ready to discuss your past projects in detail—this is where you can really shine!

Tip Number 4

Don’t forget to apply through our website! It’s the best way to ensure your application gets seen by the right people. Plus, we love seeing candidates who are genuinely interested in joining our team.

We think you need these skills to ace Data Engineer

Data Pipeline Development
SQL
Python
ETL/ELT Patterns
Apache Airflow
Azure Data Factory
Data Transformation Frameworks

Some tips for your application 🫡

Tailor Your CV:Make sure your CV is tailored to the Data Engineer role. Highlight your experience with data pipelines, SQL, and Python, and don’t forget to mention any cloud environments you’ve worked in. We want to see how your skills match what we’re looking for!

Showcase Your Projects:Include specific projects where you've built or optimised data pipelines. Talk about the challenges you faced and how you overcame them. This gives us a glimpse into your hands-on experience and problem-solving skills.

Be Clear and Concise:When writing your application, keep it clear and to the point. Use bullet points for easy reading and make sure to highlight your key achievements. We appreciate straightforward communication, so let’s keep it simple!

Apply Through Our Website:Don’t forget to apply through our website! It’s the best way for us to receive your application and ensures you’re considered for the role. Plus, it shows you’re keen on joining our team at StudySmarter!

How to prepare for a job interview at CUBE

Know Your Data Engineering Basics

Make sure you brush up on your SQL and Python skills before the interview. Be ready to discuss how you've built and maintained data pipelines in cloud environments, especially with Azure services. They’ll want to see that you can write production-quality code, so be prepared to share examples of your work.

Showcase Your Problem-Solving Skills

Be ready to talk about specific challenges you've faced in data engineering, particularly around data quality and observability. Think of a time when you identified a problem in a pipeline and how you fixed it. This will demonstrate your understanding of what makes a good pipeline and your proactive approach to maintaining quality.

Familiarise Yourself with Their Tech Stack

Since the role involves working with tools like Apache Airflow, dbt, and Azure Data Factory, make sure you know how these tools work. If you have experience with ETL/ELT patterns or data transformation frameworks, be ready to discuss them. Showing that you’re comfortable with their tech stack will give you an edge.

Communicate Clearly and Confidently

During the interview, remember to communicate your thoughts clearly, especially when discussing technical concepts. They value candidates who can explain complex ideas to both engineers and non-engineers. Practice articulating your experiences and how they relate to the job description to ensure you come across as confident and knowledgeable.