Staff Data Engineer, Multimodal AI Platform

Staff Data Engineer, Multimodal AI Platform

Full-Time 80000 - 100000 £ / year (est.) No working from home possible
Dormont Manufacturing Co

At a Glance

  • Tasks: Design and implement scalable data pipelines for our multimodal AI platform.
  • Company: Join Fox Corporation, a leader in media and entertainment innovation.
  • Benefits: Competitive salary, flexible work options, and opportunities for professional growth.
  • Other info: Collaborative environment with mentorship opportunities and a focus on cutting-edge tech.
  • Why this job: Be at the forefront of AI technology and shape the future of content intelligence.
  • Qualifications: Extensive experience in data engineering and proficiency in Python and SQL.

The predicted salary is between 80000 - 100000 £ per year.

We are seeking a Staff Data Engineer to join our Emerging Tech team and define the data architecture powering our multimodal AI platform. You will set the technical vision and drive the implementation of scalable data pipelines, lakehouse infrastructure, and data platform capabilities that enable content intelligence, semantic search, recommendation, and personalization at scale — while raising the engineering bar across the data team.

Snapshot of Your Responsibilities

  • Act as the technical anchor for your product pillar — collaborate closely with product leaders, ML engineers, backend engineers, editorial, and merchandising teams to translate ambiguous goals into clear technical designs, and communicate decisions effectively to both technical and non‑technical stakeholders.
  • Design and hands‑on implement high‑throughput batch and streaming data pipelines for multimodal content — including media segments, metadata, transcripts, and engagement signals — and architect the data models and ML feature stores that support them.
  • Build scalable data ingestion frameworks across heterogeneous sources including media processing systems, AI inference services, and user engagement events; partner with ML engineers to define feature‑ready data contracts for model training and inference, including embedding generation and vector storage.
  • Own data governance, lineage tracking, and quality frameworks; design observability and alerting to ensure data integrity and SLA compliance at scale.
  • Drive pipeline performance optimization and cloud cost management; lead adoption of CI/CD and infrastructure‑as‑code practices across the team.
  • Mentor data engineers at all levels, conduct design and code reviews, and evaluate emerging technologies to ensure the team’s technical decisions align with platform strategy, security, and compliance requirements.

What You Will Need

  • Extensive data engineering experience operating production systems at scale in global engineering organizations.
  • Expert‑level proficiency in Python and SQL for large‑scale data processing and transformation.
  • Deep experience with distributed data processing frameworks (Apache Spark, Apache Flink, or equivalent) and streaming architectures (Kafka, Spark Structured Streaming) for both batch and real‑time workloads at terabyte scale.
  • Proven experience building data infrastructure for LLM and generative AI workloads — including training data preparation, embedding generation, and vector storage.
  • Proven ability to provide technical clarity in ambiguous environments — translating loosely defined product goals into actionable architecture decisions and driving alignment across engineering, ML, and product stakeholders.
  • Strong cloud platform experience on GCP, AWS, or Azure with hands‑on infrastructure‑as‑code (Terraform or CDK) and DevOps practices.
  • Deep understanding of data warehousing, data mesh principles, and open table format standards (Apache Iceberg, Delta Lake, or Apache Hudi).
  • Ownership mindset with end‑to‑end accountability for architecture, implementation, and production operations.

Nice to Have, But Not a Dealbreaker

  • Experience with managed lakehouse platforms (Databricks or equivalent) and their ecosystem tooling.
  • Knowledge of media data formats, content metadata standards, or media processing pipelines.
  • Experience with observability and monitoring systems (Datadog, Grafana, or OpenTelemetry).
  • Experience leading data platform migrations or large‑scale data infrastructure initiatives.
  • Contributions to open‑source data engineering projects or active participation in the data engineering community.
  • Curiosity and enthusiasm for multimodal AI, generative AI, and LLM‑powered applications.

Staff Data Engineer, Multimodal AI Platform employer: Dormont Manufacturing Co

At Fox Corporation, we pride ourselves on being an exceptional employer that fosters a culture of creativity and innovation. As a Staff Data Engineer in our Emerging Tech team, you will have the opportunity to work with cutting-edge technologies while collaborating with diverse teams to shape the future of multimodal AI. We offer a supportive environment that encourages professional growth, mentorship, and the chance to make a significant impact within a globally recognised organisation.

Dormont Manufacturing Co

Contact Details:

Dormont Manufacturing Co Recruitment Team

StudySmarter Expert Advice🤫

We think this is how you could land Staff Data Engineer, Multimodal AI Platform

Tip Number 1

Network like a pro! Reach out to folks in the industry, especially those at Fox or similar companies. A friendly chat can open doors and give you insights that job descriptions just can't.

Tip Number 2

Show off your skills! Create a portfolio or GitHub repo showcasing your data engineering projects. This is your chance to demonstrate your expertise in Python, SQL, and any cool frameworks you've worked with.

Tip Number 3

Prepare for interviews by brushing up on your technical knowledge and soft skills. Be ready to discuss how you've tackled ambiguous problems and collaborated with cross-functional teams—just like the role requires!

Tip Number 4

Don't forget to apply through our website! It’s the best way to ensure your application gets seen. Plus, it shows you're genuinely interested in joining the team at Fox.

We think you need these skills to ace Staff Data Engineer, Multimodal AI Platform

Data Engineering
Python
SQL
Apache Spark
Apache Flink
Kafka
Cloud Platforms (GCP, AWS, Azure)

Some tips for your application 🫡

Tailor Your CV:Make sure your CV is tailored to the Staff Data Engineer role. Highlight your experience with data pipelines, cloud platforms, and any relevant projects that showcase your skills in Python and SQL. We want to see how your background aligns with our needs!

Craft a Compelling Cover Letter:Your cover letter is your chance to shine! Use it to explain why you're passionate about multimodal AI and how your expertise can contribute to our team. Keep it concise but impactful, and don’t forget to mention specific projects or achievements.

Showcase Your Technical Skills:In your application, be sure to highlight your technical skills, especially around distributed data processing frameworks and cloud platforms. We love seeing concrete examples of how you've tackled challenges in previous roles, so don’t hold back!

Apply Through Our Website:We encourage you to apply through our website for the best chance of getting noticed. It’s super easy, and you’ll be able to keep track of your application status. Plus, we love seeing candidates who take the initiative to connect directly with us!

How to prepare for a job interview at Dormont Manufacturing Co

Know Your Data Engineering Stuff

Make sure you brush up on your data engineering skills, especially in Python and SQL. Be ready to discuss your experience with distributed data processing frameworks like Apache Spark or Kafka, as well as any hands-on work you've done with cloud platforms like GCP or AWS.

Showcase Your Problem-Solving Skills

Prepare to share examples of how you've tackled ambiguous goals in the past. Think about specific projects where you translated vague requirements into clear technical designs, and be ready to explain your thought process to both technical and non-technical folks.

Get Familiar with Multimodal AI

Since this role focuses on multimodal AI, it’s a good idea to understand the basics of generative AI and LLMs. Be prepared to discuss how you've built data infrastructure for these workloads and any relevant experiences you have with embedding generation or vector storage.

Ask Smart Questions

Interviews are a two-way street! Prepare thoughtful questions about the team’s current challenges, their data governance practices, or how they approach pipeline performance optimisation. This shows your genuine interest in the role and helps you gauge if it's the right fit for you.