At a Glance
- Tasks: Design and maintain scalable data pipelines using Python and Azure technologies.
- Company: Join Vantage Data Centers, a leader in innovative data solutions.
- Benefits: Enjoy competitive pay, health perks, flexible work, and growth opportunities.
- Other info: Collaborative culture focused on innovation and personal development.
- Why this job: Make an impact in a fast-paced environment with cutting-edge technology.
- Qualifications: 3-5 years in data engineering, strong Python and SQL skills required.
The predicted salary is between 60000 - 80000 £ per year.
Vantage Data Centers powers, cools, protects and connects the technology of the world’s well-known hyperscalers, cloud providers and large enterprises. Developing and operating across North America, EMEA and Asia Pacific, Vantage has evolved data center design in innovative ways to deliver dramatic gains in reliability, efficiency and sustainability in flexible environments that can scale as quickly as the market demands.
This position will be based at our office in London in alignment with our flexible work policy. (3 days on site, 2 days from home). Vantage Data Centers is seeking a Mid‐Level Data Engineer to help build, operate, and scale our enterprise data platform. This role is designed for an engineer who can operate independently, execute reliably in a fast‐paced environment, and take ownership of data pipelines and datasets with minimal ramp‐up.
As part of the Data Engineering & Business Intelligence team, you will be responsible for delivering production‐ready data solutions that support analytics, reporting, and emerging AI‐enabled use cases. You will work closely with senior data engineers and business partners, but this role assumes a self‐starter mindset with the ability to move from requirements to implementation without constant oversight. Success in this position requires comfort with ambiguity, strong execution discipline, and accountability for results.
Essential Job Functions- Design, build, and maintain reliable, scalable data pipelines using Python and PySpark on the Microsoft Azure data platform.
- Develop and operate batch and incremental data pipelines leveraging Azure Data Factory for orchestration and Azure Data Lake Storage Gen2 as the primary data store.
- Independently implement SQL- and Spark‐based transformations to produce curated datasets that support enterprise reporting, analytics, and downstream consumption.
- Take ownership of assigned data pipelines and datasets, including monitoring, troubleshooting, and performance optimization in production environments.
- Work with Azure Synapse (dedicated or serverless where applicable) to support analytical workloads and data consumption patterns.
- Collaborate with business analysts and cross‐functional stakeholders to translate data requirements into practical, working data solutions.
- Prepare and structure data to support advanced analytics and AI‐enabled use cases by ensuring data quality, consistency, and documentation.
- Apply established data governance, security, and engineering standards to ensure compliant, maintainable, and scalable solutions.
- Participate in code reviews, technical discussions, and platform improvement initiatives as an active contributor.
- Proactively identify data quality issues, pipeline risks, and improvement opportunities, and communicate them clearly in a fast‐paced environment.
- Develop and maintain PySpark notebooks and jobs to ingest, transform, and curate data within the enterprise data platform.
- Build and modify Azure Data Factory pipelines for batch and incremental data ingestion.
- Implement Spark‐based transformations that write curated datasets to Azure Data Lake Storage Gen2 using established folder structures and naming conventions.
- Create and maintain SQL views and tables in Azure Synapse to support analytics and reporting use cases.
- Respond to pipeline failures, data validation issues, and operational alerts.
- Perform basic performance tuning of Spark jobs (e.g., partitioning, filtering, incremental logic) within established architectural patterns and standards.
- Validate data outputs with business partners and address data defects or discrepancies.
- Commit code using Git, follow branching standards, and participate in pull request reviews.
- Update documentation for pipelines, datasets, and operational runbooks as changes are made.
- Execute assigned backlog items within sprint timelines and raise risks or blockers early.
- Additional duties as assigned by management.
- Bachelor’s degree in Engineering, Computer Science, Data Analytics, or a related field, or equivalent experience.
- Minimum of 3–5 years of experience in data engineering or analytics engineering.
- Proficiency in Python for building and maintaining data pipelines, automation, and data processing workflows, including use of PySpark.
- Proficiency in SQL for querying, transformation, and analytical data processing.
- Solid understanding of ETL/ELT pipelines, data transformation patterns, and data integration concepts.
- Experience analyzing enterprise data sources to identify data relationships, transformations, and business rules.
- Experience building solutions on the Microsoft Azure platform with exposure to services such as Azure Data Factory, Azure Synapse, Azure Data Lake Storage Gen2, and related analytics services.
- Experience working with source control and CI/CD workflows using tools such as GitHub or Azure DevOps.
- Working knowledge of data modeling fundamentals, including fact and dimension tables.
- Strong communication and interpersonal skills with the ability to collaborate across teams in a fast‐paced environment.
- Experience working in Agile development environments.
- Experience using collaboration and project tracking tools such as Jira or similar tools.
- Travel required is expected to be up to 10% but may increase over time as the business evolves.
- Experience working with distributed data processing frameworks, including Apache Spark.
- Exposure to advanced analytics or AI‐adjacent data use cases, including preparing data for machine learning or intelligent applications.
- Familiarity with additional Azure services such as Azure Functions or Logic Apps in support of data workflows.
- Experience supporting data platform enhancement, refactoring, or modernization initiatives.
- Familiarity with data quality, reliability, and operational best practices in production environments.
- Experience working in a scaling or fast‐paced organization where priorities evolve quickly.
The Technology & Systems department drives technological innovation for the company and advances the technology strategy to support global growth. This includes IT, Software Development, OT / Automation Systems, and business process improvement. At Vantage, we are very hands on. In most cases, we specify, purchase, configure and maintain all networking and server hardware. We also work closely with partner VARs to learn about the latest technology changes so we can make informed purchase decisions. The Technology & Systems department participates in designing each of our new data center building’s networking infrastructure, including but not limited to diverse pathways connecting to carriers, meet-me room (MMR) hosting (rack, cabs, ladders, cross connect panels...), wireless AP coverage, etc.
We rely on extreme collaboration with Sales, Construction, Operations and Corporate Functions to drive simplification, establish global standards with continuous improvement and speed to value. We are working to build a reliable, scalable, sustainable, and secure technology roadmap/landscape that enables us to better serve our customers and employees and be the partner of choice. Our focus is on speed to value with financial and execution discipline. We strive to double the good, halve the bad, in half the time.
We operate with No Ego and No Arrogance. We work to build each other up and support one another, appreciating each other’s strengths and respecting each other’s weaknesses. We find joy in our work and each other, actively seeking opportunities to inject fun into what we do. Our hard and efficient work is rewarded with an above market total compensation package. We offer a comprehensive suite of health and welfare, retirement, and paid leave benefits exceeding local expectations.
Throughout the year, the advantage of being part of the Vantage team is evident with an array of benefits, recognition, training and development, and the knowledge that your contribution adds value to the company and our community.
Don't meet all the requirements? Please still apply if you think you are the right person for the position. We are always keen to speak to people who connect with our mission and values.
Data Engineer (Mid‐Level ), Global employer: Vantage Data Centers
Vantage Data Centers is an exceptional employer that fosters a collaborative and innovative work culture, where employees are encouraged to grow and develop their skills in a supportive environment. With a flexible work policy allowing for a hybrid model, competitive compensation, and a comprehensive benefits package, team members can thrive both personally and professionally while contributing to cutting-edge data solutions in the heart of London. The company values teamwork, creativity, and continuous improvement, making it an ideal place for those seeking meaningful and rewarding employment.
StudySmarter Expert Advice🤫
We think this is how you could land Data Engineer (Mid‐Level ), Global
✨Get Involved in Data Science Meetups
Tap into local data science meetups or workshops to connect with fellow enthusiasts and professionals. These events are goldmines for networking, and sometimes even lead directly to job openings at companies like Vantage Data Centers!
✨Show Off Your Projects
Start building a public portfolio showcasing your data science projects on platforms like GitHub or personal websites. Highlight unique analyses or models you've developed. This not only demonstrates your skills but also gets your name out there for roles like Data Engineer (Mid‐Level ), Global at Vantage Data Centers.
✨Leverage Professional Networks
Join professional bodies related to data science, like the Data Science Society or similar organisations. Getting involved can lead to mentorship opportunities and insider knowledge about full-time positions at companies like Vantage Data Centers.
✨Apply Directly through Our Website
When you find a suitable opening like Data Engineer (Mid‐Level ), Global at Vantage Data Centers, make sure to apply directly through our website. It gives you an edge and shows you're keen to join our team. Plus, who doesn’t love a direct application? It’s easier than navigating through job boards!
We think you need these skills to ace Data Engineer (Mid‐Level ), Global
Some tips for your application 🫡
Show Off Your Projects:In the world of data science, your projects can speak volumes about your skills. Make sure to showcase a few key projects in your CV or portfolio, especially those that highlight your ability to work with data sets, build models, or use relevant tools like Python, R, or SQL. Don’t forget to include links to any GitHub repositories if applicable!
Quantify Your Achievements:Employers love numbers! When drafting your CV, highlight your achievements with quantifiable results. For instance, mention how your data analysis led to a certain percentage increase in efficiency or revenue at a previous job or project. These details can really make your application pop!
Craft a Tailored Cover Letter:For a full-time role at Vantage Data Centers, your cover letter should reflect your passion for data science and your excitement about the specific projects or values of the company. Dive into why you’re a good fit, how your skills align with their needs, and any unique perspectives you can bring to the team.
Stand Out with Relevant Courses and Certifications:Although experience talks, relevant courses or certifications can be your ticket to impressing hiring managers at Vantage Data Centers. Mention any standout courses you've completed that equipped you with essential skills, such as machine learning certifications or data visualisation courses. This shows your commitment to continuously developing your skills in the field!
How to prepare for a job interview at Vantage Data Centers
✨Brush Up on Your Statistics
For a data science role, we need to seriously sharpen our statistics skills. Get ready to tackle technical questions on probability distributions, hypothesis testing, and regression analysis. These are often the bread and butter of data science interviews, so don't just skim over them!
✨Showcase Your Projects
Prepare a killer portfolio showcasing your data science projects. We should include details about the datasets used, the tools and techniques applied, and the impact of your findings. If we can walk them through a particularly challenging project or a cool visualisation that had real-world implications, it’ll really make us stand out!
✨Get Comfortable with Python and R
Most data science positions require us to be proficient in programming languages like Python and R. We should practice common libraries like pandas, NumPy, and scikit-learn, and be ready for live coding exercises or algorithm questions. Showing off our coding chops can really impress the interviewers at Vantage Data Centers!
✨Prepare for Case Studies
Expect to encounter real-world case studies during the interview. We might be asked how we’d approach a data problem or analyse a dataset to extract insights. It's essential to think out loud and demonstrate our problem-solving process so that the interviewer can see our logical thinking in action.