Data Engineer (Remote, Python)

Data Engineer (Remote, Python)

Full-Time 55000 - 70000 £ / year (est.) No working from home possible
The Independent

At a Glance

  • Tasks: Build and optimise data pipelines while driving AI and Machine Learning initiatives.
  • Company: Join a dynamic team at a leading media company focused on data-driven insights.
  • Benefits: Remote work, competitive salary, and opportunities for professional growth.
  • Other info: Collaborative environment with a commitment to diversity and continuous learning.
  • Why this job: Be at the forefront of AI and ML, making a real impact in the media industry.
  • Qualifications: Experience in Python, SQL, and cloud platforms; strong problem-solving skills.

The predicted salary is between 55000 - 70000 £ per year.

Reports to: Director of Data Science & Engineering

We are looking for a Data Engineer with Machine Learning experience to join our team. They will be responsible for maintaining and supporting the existing data infrastructure used to underpin our data analytics and reporting. The Data Engineer will own the company's drive into Machine Learning and AI adoption. They will be accountable for building as well as owning new Machine Learning and AI solutions across the team and the wider business. These must complement our current, scalable data architecture.

The Data Engineer will support the customer analytics function by developing and deploying machine learning models and techniques to deliver value around our consumer (B2C) data. They will be expected to optimize the architecture of our data pipelines and ensure that data flows support various cross-functional teams across the business. The ideal candidate will have a self-directed, innovative mindset who is comfortable supporting the data needs of multiple teams. The right candidate will be proactive in identifying and implementing improvements for our systems contributing constructively to the current data ecosystem.

As the business continues to invest in cloud solutions, particularly Google Cloud Platform, you will be excited by the prospect of owning new projects, propelling our data initiatives and capabilities. Lead Data Engineer will be exposed to best practice methods with the current framework.

The Data & Marketing Department is the intelligence powerhouse of the business, representing the customer. Our goal is to inspire, engage and grow a loyal readership by harnessing data-driven insights and innovative marketing to deliver personalised, impactful content experiences that build lasting trust, diverse revenue streams, and a thriving future for quality journalism.

An accomplished ML/Data Engineer with proven experience deploying and managing ML and AI based services in production. Strong SQL skills and hands-on experience with both relational and non-relational databases, supporting data needs in fast-paced, content-driven environments. Strong expertise in designing and maintaining scalable data pipelines and architectures—integrating data from web analytics, content management systems (CMS), subscription platforms, ad tech, and social media.

Proven ability to automate and optimise data workflows, using modern ETL/ELT tools (e.g., Airflow, dbt, Apache Spark) to ensure timely and reliable delivery of data. Experience building robust data models and reporting layers to support performance dashboards, user engagement analytics, ad revenue tracking, and A/B testing frameworks. Skilled in cloud-based data platforms and infrastructure (e.g., AWS, GCP), ensuring scalability and security for large volumes of streaming and batch data. Additionally, experience with data warehouses such as BigQuery or Snowflake.

Adept in Python and/or Java for developing data services and integrating APIs to bring in diverse sources of media data. Exposure to utilising AI and Agentic workflows to build and deploy models agents and coding pipelines. Ideally having experience in Gemini, Claude and Vertex AI. Exposure to ML Frameworks such as PyTorch or TensorFlow. Understanding of ML Ops such as automated model testing, monitoring and tuning.

Excellent interpersonal and communication skills, enabling effective collaboration with analytical and commercial teams to turn data into actionable insights. Proactive and self-driven, capable of managing multiple data projects in a high tempo setting while meeting tight deadlines. A continuous learner with a diligent approach to data engineering including data privacy.

Diversity, Equity and Inclusion: We champion diversity in our teams and in our reporting. As a growing and global brand, we must have a workforce that’s more representative of our readers, viewers, clients and partners, and a workplace that creates a sense of belonging for everyone.

Design and Maintain Data Pipelines: Develop and maintain robust, scalable, and efficient data pipeline architecture to support current and future business needs. Identify, design, and implement improvements to automate manual processes, enhance data delivery performance, and re-architect infrastructure for improved scalability and resilience.

AI Agentic Workflows: Build on the current usage of AI across the business rolling out new processes and models.

Machine Learning: Take the lead in building out our capability in this area with new models and recommender systems.

ETL Development and Infrastructure Building: Build and manage the infrastructure necessary for optimal ETL or ELT of data using Python, SQL, and Google Cloud Platform (GCP) big data technologies, such as BigQuery, Dataflow, Dataproc and Cloud Storage.

Business Intelligence Enablement: Prepare and transform pipeline data to support downstream analytics and feed BI tools (DOMO), enabling data-driven decision-making across the organization. Partner with internal stakeholders—ranging from Data, Commercial, and Editorial teams to executive leadership—to address data-related technical challenges and support their infrastructure needs.

Enhance Data System Functionality: Collaborate with the Data Team to continuously improve the functionality, flexibility, and performance of data systems and platforms.

Data Governance and Compliance: Ensure all data is handled responsibly, securely, and in full compliance with the Data Protection Act, GDPR regulations, and the Company’s Code of Conduct.

SQL and Database Expertise: Strong working knowledge of SQL with hands-on experience querying and managing relational databases, alongside familiarity with a variety of database technologies (e.g., PostgreSQL, MySQL, BigQuery).

Big Data Engineering: Proven experience designing, building, and optimizing ‘big data’ pipelines, architectures, and datasets, enabling efficient data processing at scale. Skilled in performing root cause analysis on complex internal and external data sources and business processes to resolve issues and uncover opportunities for operational or strategic improvements.

Unstructured Data Handling: Strong analytical capability for working with unstructured and semi-structured datasets, transforming raw information into actionable insights.

Data Workflow Development: Expertise in developing and maintaining data transformation processes, managing data structures, metadata, workload dependencies, and orchestration frameworks.

Large-scale Data Processing: Excellent project management and organizational skills, with experience supporting and collaborating with cross-functional teams in dynamic and evolving settings. Holds a graduate degree in Computer Science, STEM related quantitative field, with 2+ years of hands-on experience in a data engineering role.

Databases: Proficient in SQL databases.

Programming Languages: Skilled in one or more of the following languages, i.e.: Python, Java, Go.

Inclusive: We champion diversity in our teams and in our reporting. We take risks and are always looking to try new ideas in pursuit of excellence.

Data Engineer (Remote, Python) employer: The Independent

Join a forward-thinking company that values innovation and diversity, where as a Data Engineer, you will play a pivotal role in shaping the future of Machine Learning and AI within a collaborative and dynamic environment. With a strong emphasis on employee growth, you will have access to cutting-edge technologies and the opportunity to work alongside talented professionals, all while enjoying the flexibility of remote work. Our commitment to fostering a culture of inclusivity and continuous learning ensures that you will thrive both personally and professionally.

The Independent

Contact Details:

The Independent Recruitment Team

StudySmarter Expert Advice🤫

We think this is how you could land Data Engineer (Remote, Python)

Tip Number 1

Network like a pro! Reach out to people in the industry, attend meetups, and connect with potential colleagues on LinkedIn. You never know who might have the inside scoop on job openings or can refer you directly.

Tip Number 2

Show off your skills! Create a portfolio showcasing your data projects, especially those involving Machine Learning and AI. This will give you an edge and demonstrate your hands-on experience to potential employers.

Tip Number 3

Prepare for interviews by brushing up on common technical questions related to data engineering and ML. Practice coding challenges and be ready to discuss your past projects in detail—this is your chance to shine!

Tip Number 4

Don’t forget to apply through our website! It’s the best way to ensure your application gets seen by the right people. Plus, we love seeing candidates who are genuinely interested in joining our team.

We think you need these skills to ace Data Engineer (Remote, Python)

Machine Learning
Data Infrastructure Management
Data Pipeline Architecture
SQL
Relational Databases
Non-Relational Databases
ETL/ELT Tools

Some tips for your application 🫡

Tailor Your CV:Make sure your CV is tailored to the Data Engineer role. Highlight your experience with Python, SQL, and any machine learning projects you've worked on. We want to see how your skills align with our needs!

Showcase Your Projects:Include specific examples of data pipelines or machine learning models you've built. We love seeing real-world applications of your skills, so don’t hold back on the details!

Be Clear and Concise:When writing your cover letter, keep it clear and to the point. Explain why you’re excited about the role and how you can contribute to our data initiatives. We appreciate straightforward communication!

Apply Through Our Website:Make sure to apply through our website for the best chance of getting noticed. It’s the easiest way for us to track your application and get back to you quickly!

How to prepare for a job interview at The Independent

Know Your Tech Stack

Make sure you’re well-versed in the technologies mentioned in the job description, especially Python, SQL, and Google Cloud Platform. Brush up on your experience with ETL tools like Airflow and dbt, as well as machine learning frameworks like PyTorch or TensorFlow. Being able to discuss these confidently will show that you're ready to hit the ground running.

Showcase Your Problem-Solving Skills

Prepare to discuss specific examples where you've optimised data pipelines or solved complex data issues. Use the STAR method (Situation, Task, Action, Result) to structure your answers. This will help interviewers see how you approach challenges and contribute to improving data systems.

Understand the Business Context

Familiarise yourself with the company’s goals and how the Data Engineer role fits into their broader strategy. Be ready to discuss how your work can support customer analytics and enhance data-driven decision-making. This shows that you’re not just a techie but also understand the impact of your work on the business.

Ask Insightful Questions

Prepare thoughtful questions about the team dynamics, current projects, and future data initiatives. This not only demonstrates your interest in the role but also gives you a chance to assess if the company culture aligns with your values, especially regarding diversity and innovation.