Software Engineer - Data, Lakehouse and AI Data Platform Engineer - Vice President - London

Software Engineer - Data, Lakehouse and AI Data Platform Engineer - Vice President - London

London Full-Time 100000 - 120000 € / year (est.) No home office possible
Goldman Sachs Group, Inc.

At a Glance

  • Tasks: Design and build data pipelines for a cutting-edge AI data platform.
  • Company: Join Goldman Sachs, a leader in finance and technology.
  • Benefits: Enjoy competitive pay, top-notch benefits, and a dynamic work environment.
  • Other info: Collaborative culture with opportunities for growth and learning.
  • Why this job: Make a real impact by shaping the future of data and AI.
  • Qualifications: Strong programming skills in Python or Java and SQL knowledge required.

The predicted salary is between 100000 - 120000 € per year.

The Opportunity

Join a team building the data foundations that support the firm’s AI and analytics capabilities. This role sits within the engineering effort to develop a modern Lakehouse and AI data platform that enables reliable, well-governed and high-performing data use across the firm. At Goldman Sachs, engineering teams are positioned at the centre of the business, building scalable systems, solving complex technical problems and turning data into action. In data engineering roles, the emphasis is on designing, building and maintaining large-scale data platforms, delivering production pipelines, improving reliability and quality, and partnering closely with users of the platform.

This is a delivery-focused role for engineers who want to build robust data assets in production, work with modern data technologies, and grow over time within the firm. You will contribute to the data models, pipelines and platform capabilities that underpin analytics, operational decision-making and emerging AI use cases.

Role Summary

As a Data Engineer, Lakehouse and AI Data Platform, you will design, build, test and support data pipelines and curated datasets on the firm’s modern data platform. You will work across ingestion, transformation, modelling, optimisation and data quality, helping to deliver data products that are reliable, scalable and fit for purpose. The role is suited to engineers who are comfortable writing code, working with SQL and distributed data processing, and solving practical delivery problems in a team environment. More experienced candidates may also contribute to technical design, platform standards and the shaping of delivery approaches across a wider set of use cases.

Key Responsibilities

  • Build, enhance and support batch and streaming data pipelines on the Lakehouse and AI data platform.
  • Refactor or modernise existing data flows where needed to improve reliability, performance and maintainability.
  • Ensure data pipelines are production-ready, well tested and operationally supportable.

Data Modelling and Curation

  • Develop raw, refined and curated datasets that support analytics, reporting and AI use cases.
  • Apply sound data modelling principles to represent business entities, relationships and historical change accurately.
  • Work with consumers to shape data products that are usable, well documented and aligned to business needs.

Data Quality and Reconciliation

  • Implement controls to validate completeness, accuracy and consistency of data across pipelines and datasets.
  • Use reconciliation approaches to build confidence in production outputs and investigate breaks where they arise.
  • Contribute to clear standards for testing, monitoring and issue resolution.

Delivery and Partnership

  • Work closely with engineers, platform teams and data consumers to deliver agreed outcomes to time and quality expectations.
  • Communicate clearly on progress, risks, dependencies and design choices.
  • Take a broader role in technical leadership, task breakdown and support for junior engineers.

Skills and Experience

Required

  • Bachelor’s or master’s degree in a relevant discipline, or equivalent practical experience, with evidence of strong quantitative skills or data engineering expertise.
  • Strong hands-on programming experience in Python or Java.
  • Good working knowledge of SQL, including troubleshooting, optimisation and data analysis.
  • Ability to learn new tools, internal platforms and delivery workflows quickly.
  • Familiarity with software engineering fundamentals, including version control, testing, release discipline and CI/CD practices.

Data Engineering Capability

  • Stronger ownership of technical design across multiple datasets or pipeline domains.
  • Experience guiding implementation standards, code quality and engineering practices within a team.
  • Ability to lead delivery for a workstream, manage dependencies and support less experienced engineers.
  • Understanding of temporal data modelling, including the handling of historical state and change over time.
  • Knowledge of schema design, schema evolution and data compatibility considerations.
  • Understanding of partitioning, clustering and other techniques used to improve data performance at scale.
  • Ability to make sensible design choices across normalised and denormalised models, and between natural and surrogate keys.
  • Practical approach to data quality, reconciliation and root-cause analysis.
  • Experience building or supporting production data pipelines in a collaborative engineering environment.
  • Experience working with distributed data processing frameworks such as Apache Spark.
  • Working knowledge of common data formats such as JSON, Avro and Parquet.

Technology Environment

The role will involve working with a modern and evolving data stack. Candidates are not expected to have deep expertise in every tool from day one but should bring relevant experience and the ability to work across comparable technologies. Examples of technologies in scope include:

  • Data processing and logic: ANSI SQL, Apache Spark, Kafka
  • Platforms and storage: Snowflake, Apache Iceberg, Databricks, Hadoop ecosystem technologies, Sybase IQ
  • Engineering and deployment: CI/CD tooling, containerised or Kubernetes-based deployment approaches where relevant

You'll also work with internal data management and platform tooling, so a practical and adaptable engineering mindset is important.

What We Are Looking For

We are looking for engineers who can deliver well-structured, reliable solutions in production and who take ownership of the quality of what they build. The role suits candidates who are technically strong, pragmatic and comfortable working in a fast-paced environment where data platforms support important business outcomes.

Stronger candidates will typically demonstrate:

  • Sound judgement in technical trade-offs
  • Attention to detail in data correctness and testing
  • A clear and structured approach to problem solving
  • Willingness to work closely with stakeholders and partner teams
  • An interest in developing long-term expertise within the firm

Software Engineer - Data, Lakehouse and AI Data Platform Engineer - Vice President - London employer: Goldman Sachs Group, Inc.

Goldman Sachs is an exceptional employer, offering a dynamic work environment in London where engineering teams are at the forefront of innovation. With a strong emphasis on employee growth, you will have the opportunity to develop your skills in cutting-edge data technologies while contributing to impactful projects that drive the firm's AI and analytics capabilities. The collaborative culture fosters teamwork and technical leadership, ensuring that every engineer can thrive and make meaningful contributions to the business.

Goldman Sachs Group, Inc.

Contact Detail:

Goldman Sachs Group, Inc. Recruiting Team

StudySmarter Expert Advice🤫

We think this is how you could land Software Engineer - Data, Lakehouse and AI Data Platform Engineer - Vice President - London

Tip Number 1

Network like a pro! Reach out to current employees at Goldman Sachs or similar firms on LinkedIn. Ask them about their experiences and any tips they might have for landing a role like the one you're after.

Tip Number 2

Prepare for technical interviews by brushing up on your coding skills, especially in Python or Java. Practice solving data engineering problems and be ready to discuss your past projects and how you tackled challenges.

Tip Number 3

Showcase your passion for data! During interviews, share your thoughts on emerging AI technologies and how they can impact data platforms. This will demonstrate your enthusiasm and forward-thinking mindset.

Tip Number 4

Don’t forget to apply through our website! It’s the best way to ensure your application gets noticed. Plus, it shows you’re serious about joining the team and ready to contribute to building robust data assets.

We think you need these skills to ace Software Engineer - Data, Lakehouse and AI Data Platform Engineer - Vice President - London

Python
Java
SQL
Data Modelling
Data Quality Assurance
Data Pipeline Development
Apache Spark

Some tips for your application 🫡

Tailor Your CV:Make sure your CV reflects the skills and experiences that match the job description. Highlight your programming experience in Python or Java, and any relevant data engineering projects you've worked on. We want to see how you can contribute to our Lakehouse and AI data platform!

Craft a Compelling Cover Letter:Your cover letter is your chance to shine! Use it to explain why you're passionate about data engineering and how your background aligns with our needs. Be sure to mention your experience with SQL and distributed data processing, as these are key for us.

Showcase Your Problem-Solving Skills:In your application, give examples of how you've tackled complex technical problems in the past. We love candidates who can demonstrate sound judgement in technical trade-offs and a structured approach to problem-solving. This will set you apart!

Apply Through Our Website:We encourage you to apply directly through our website. It’s the best way for us to receive your application and ensures you’re considered for the role. Plus, it shows you’re serious about joining our team at Goldman Sachs!

How to prepare for a job interview at Goldman Sachs Group, Inc.

Know Your Tech Stack

Familiarise yourself with the technologies mentioned in the job description, like SQL, Apache Spark, and Kafka. Be ready to discuss your experience with these tools and how you've used them in past projects.

Showcase Your Problem-Solving Skills

Prepare examples of complex technical problems you've solved in previous roles. Highlight your thought process and the steps you took to arrive at a solution, especially in data engineering contexts.

Understand Data Quality Principles

Brush up on data quality and reconciliation techniques. Be prepared to discuss how you've implemented controls to ensure data accuracy and consistency in your previous work.

Communicate Clearly

Practice articulating your thoughts clearly and concisely. During the interview, make sure to communicate your progress on projects, any risks you've encountered, and how you've collaborated with teams to achieve outcomes.