At a Glance
- Tasks: Orchestrate cutting-edge simulations and build robust data pipelines for advanced engineering.
- Company: PhysicsX, a deep-tech innovator transforming industries with AI-driven simulation software.
- Benefits: Equity options, flexible working, free lunches, and comprehensive healthcare.
- Other info: Diverse and inclusive workplace with excellent growth opportunities.
- Why this job: Join a team pushing the boundaries of engineering and make a real impact.
- Qualifications: 5+ years in data or HPC engineering, strong Python skills, and experience with orchestration systems.
The predicted salary is between 70000 - 90000 £ per year.
PhysicsX is a deep-tech company with roots in numerical physics and Formula One, dedicated to accelerating hardware innovation at the speed of software. We are building an AI-driven simulation software stack for engineering and manufacturing across advanced industries. By enabling high-fidelity, multi-physics simulation through AI inference across the entire engineering lifecycle, PhysicsX unlocks new levels of optimization and automation in design, manufacturing, and operations — empowering engineers to push the boundaries of possibility. Our customers include leading innovators in Aerospace & Defense, Materials, Energy, Semiconductors, and Automotive.
The Senior Simulation Data Engineer will extend and operate the infrastructure that powers our research Data Factory. You will be responsible for the end-to-end pipeline: from geometry preparation and simulation orchestration through validation, post-processing, and delivery to downstream ML training systems, using PhysicsX platform orchestration services where synergies exist. This role sits at the intersection of HPC engineering and data engineering. You will orchestrate long-running CFD simulations at scale, build robust data pipelines, and ensure that every simulation we produce meets rigorous quality standards.
In this role, you will be vertically embedded in Research, working daily with:
- Research Scientists who define data requirements and quality standards
- ML Engineers who consume Data Factory outputs for model training
- ML Infrastructure Engineers who are accountable for downstream training infrastructure
You will have end-to-end responsibilities over the Data Factory, with the autonomy to make architectural decisions and the responsibility to keep data flowing reliably. Horizontally, you will be part of an infrastructure engineering group responsible for infrastructure across the company.
What you will do
- Simulation Orchestration
- Extend and operate the Data Factory infrastructure that orchestrates thousands of CFD simulations per day on cloud compute
- Design and operate job scheduling systems that maximize throughput while handling failures gracefully
- Build monitoring and alerting to detect simulation failures, convergence issues, and resource bottlenecks early
- Build high-performance data pipelines that move simulation outputs from solver results to ML-ready training data
- Implement geometry preprocessing workflows (mesh preparation, morphing, watertightness validation)
- Design and operate post-processing pipelines: surface decimation, field interpolation, format conversion
- Optimize I/O performance for large mesh datasets
- Data Quality and Validation
- Implement comprehensive validation checks at every pipeline stage: solver convergence, physical field bounds, post-processing fidelity
- Build systems that capture and quarantine bad data before they reach training pipelines
- Track and report data quality metrics across the entire Data Factory
- Work towards full provenance: training samples should be traceable back to their source geometry and simulation configuration
- Integration and Delivery
- Deliver validated datasets to downstream ML training infrastructure in formats optimized for efficient data loading
- Design data versioning and cataloging systems that support reproducible training runs
- Work closely with ML Infrastructure Engineers to ensure smooth handoff between data production and model training
- Support multi-dataset training workflows
What you bring to the table
- Ability to scope and effectively deliver projects, prioritising activity as needed.
- Problem‑solving skills and the ability to analyse issues, identify causes, and recommend solutions quickly.
- Excellent collaboration and communication skills, especially in a research setting.
- 5+ years of experience in data engineering, HPC engineering, or simulation infrastructure.
- Strong experience with orchestration systems: SLURM, Kubernetes, Temporal
- Production data pipeline experience: you’ve built and operated pipelines that process large volumes of data reliably
- Proficiency in Python for pipeline development and automation
- Systems engineering fundamentals: Linux, networking, storage systems, performance debugging
- Experience with cloud infrastructure; ideally CoreWeave or similar GPU/HPC‑focused clouds
- Background in HPC for simulation engineering: experience with CFD, FEA, or similar computational workflows (StarCCM+, OpenFOAM, ANSYS, etc.)
- Experience with geometry processing: mesh manipulation, CAD formats, PyVista
- Familiarity with scientific data formats: HDF5, VTK, NetCDF, Zarr
- Data quality engineering experience: validation frameworks, anomaly detection, data observability
Ideally
- Understanding of CFD fundamentals, enough to interpret solver outputs and validation metrics
- Experience with 3D geometry pipelines (mesh decimation, field interpolation)
- Familiarity with ML data loading patterns and how training systems consume data
What we offer
- Equity options – share in our success and growth.
- 10% employer pension contribution – invest in your future.
- Free office lunches – great food to fuel your workdays.
- Flexible working – balance your work and life in a way that works for you.
- Hybrid setup – enjoy our new Shoreditch office while keeping remote flexibility.
- Enhanced parental leave – support for life’s biggest milestones.
- Private healthcare – comprehensive coverage
- Personal development – access learning and training to help you grow.
- Work from anywhere – extend your remote setup to enjoy the sun or reconnect with loved ones.
We value diversity and are committed to equal employment opportunity regardless of sex, race, religion, ethnicity, nationality, disability, age, sexual orientation or gender identity. We strongly encourage individuals from groups traditionally underrepresented in tech to apply.
Senior Simulation Data Engineer London, United Kingdom employer: PhysicsX Ltd
PhysicsX is an exceptional employer, offering a dynamic work culture that fosters innovation and collaboration at the forefront of AI-driven simulation technology. With a strong commitment to employee growth, we provide extensive personal development opportunities, flexible working arrangements, and a supportive environment that values diversity and inclusion. Located in the vibrant Shoreditch area of London, our team enjoys not only competitive benefits like equity options and enhanced parental leave but also the chance to contribute to groundbreaking advancements across multiple advanced industries.
StudySmarter Expert Advice🤫
We think this is how you could land Senior Simulation Data Engineer London, United Kingdom
✨Tip Number 1
Network like a pro! Reach out to folks in the industry, especially those at PhysicsX or similar companies. Attend meetups, webinars, or even just grab a coffee with someone who’s already in the field. You never know where a casual chat might lead!
✨Tip Number 2
Show off your skills! Create a portfolio showcasing your projects, especially those related to data engineering or HPC. If you’ve worked on simulations or built data pipelines, make sure to highlight that. A strong portfolio can really set you apart from the crowd.
✨Tip Number 3
Prepare for the interview like it’s the championship match! Research PhysicsX, understand their tech stack, and be ready to discuss how your experience aligns with their needs. Practice common interview questions and think about how you can demonstrate your problem-solving skills.
✨Tip Number 4
Don’t forget to apply through our website! It’s the best way to ensure your application gets seen by the right people. Plus, it shows you’re genuinely interested in joining the team at PhysicsX. So, get that application in and let’s make some magic happen!
We think you need these skills to ace Senior Simulation Data Engineer London, United Kingdom
Some tips for your application 🫡
Tailor Your Application:Make sure to customise your CV and cover letter to highlight your experience in data engineering and HPC. We want to see how your skills align with the role of Senior Simulation Data Engineer, so don’t hold back on showcasing relevant projects!
Showcase Your Problem-Solving Skills:In your application, give examples of how you've tackled challenges in previous roles. We love candidates who can analyse issues and come up with quick solutions, especially in a research setting like ours.
Highlight Collaboration Experience:Since you'll be working closely with Research Scientists and ML Engineers, it’s important to demonstrate your collaboration skills. Share instances where you’ve successfully worked in a team to achieve a common goal.
Apply Through Our Website:We encourage you to apply directly through our website for a smoother process. It helps us keep track of applications and ensures you’re considered for the right role that matches your skills and career goals!
How to prepare for a job interview at PhysicsX Ltd
✨Know Your Stuff
Make sure you brush up on your knowledge of HPC engineering and data pipelines. Be ready to discuss your experience with orchestration systems like SLURM or Kubernetes, and how you've tackled challenges in simulation infrastructure. This role is all about the nitty-gritty, so showing you understand the technical details will impress the interviewers.
✨Showcase Your Problem-Solving Skills
Prepare to share specific examples of how you've solved complex issues in past projects. Think about times when you had to analyse problems quickly and recommend effective solutions. This will demonstrate your ability to think on your feet and adapt to challenges, which is crucial for a Senior Simulation Data Engineer.
✨Communicate Clearly
Since you'll be working closely with Research Scientists and ML Engineers, it's vital to communicate your ideas clearly. Practice explaining technical concepts in simple terms, especially how you can bridge the gap between infrastructure and model training. Good communication can set you apart from other candidates.
✨Understand the Bigger Picture
Familiarise yourself with the industries PhysicsX operates in, such as Aerospace, Automotive, and Energy. Understanding their specific needs and how your role contributes to their success will show that you're not just focused on the technical side but also on how it impacts the business. This insight can make a big difference in your interview.