At a Glance
- Tasks: Build and scale cutting-edge data infrastructure for AI at Mistral.
- Company: Join a pioneering AI company transforming society with innovative technology.
- Benefits: Competitive salary, equity, health insurance, and generous parental leave.
- Other info: Flexible hybrid or remote work options with excellent career growth opportunities.
- Why this job: Be part of a dynamic team shaping the future of AI and data infrastructure.
- Qualifications: 4+ years in Data Infrastructure or MLOps, proficient in Python and Kubernetes.
The predicted salary is between 60000 - 80000 € per year.
About Mistral
At Mistral AI, we believe in the power of AI to simplify tasks, save time, and enhance learning and creativity. Our technology is designed to integrate seamlessly into daily working life. We democratize AI through high-performance, optimized, open-source and cutting‑edge models, products and solutions. Our comprehensive AI platform is designed to meet enterprise as well as personal needs. Our offerings include Le Chat, La Plateforme, Mistral Code and Mistral Compute - a suite that brings frontier intelligence to end‑users. We are a dynamic, collaborative team passionate about AI and its potential to transform society. Our diverse workforce thrives in competitive environments and is committed to driving innovation. Our teams are distributed between France, USA, UK, Germany and Singapore. We are creative, low‑ego and team‑spirited. Join us to be part of a pioneering company shaping the future of AI. Together, we can make a meaningful impact.
Role Summary
The Data Infrastructure team at Mistral AI is architecting the backbone of our frontier model training and fine‑tuning ecosystem. We are building the specialized compute and data fabrics required to power the development of world‑class AI. Our vision is to operate some of the largest compute fleets in production and build data lakes and metadata systems with a roadmap toward exabyte‑scale architecture. We are currently in the process of building a high‑performance training platform designed for massive scale across both on‑premise and cloud‑native Kubernetes environments. We are leading a strategic transition from legacy scheduling to modern orchestration. With numerous clusters distributed across various regions, we are focussed on implementing sophisticated multi‑cluster orchestration and cloud‑bursting capabilities to better utilize our global resources and ensure our researchers have seamless access to compute wherever it resides. Our mission is to evolve our current systems into a platform that is as durable as it is flexible.
Location: Paris / London (hybrid) or remote EU/UK with one hub day per month.
About the Role
This role focuses on building and operating the next generation of data infrastructure at Mistral AI. You will be a core contributor to our evolution, helping us design and scale massive compute fleets and storage systems designed for high performance and scalability. You will help us move toward a future of decoupled control and data planes, scaling big data compute and storage platforms while ensuring secure and governed data access for MLOps and research. You will take full lifecycle ownership: from architecting the migration away from legacy orchestrators to implementing production‑grade pipelines and participating in on‑call rotations for critical training jobs.
In this role, you will:
- Build & Scale: Help us reach our goal of operating massive distributed compute and storage systems.
- Global Orchestration: Architect and maintain multi‑cluster orchestration layers to optimize workload placement across diverse hardware and regions.
- Design Future‑Proof Storage: Architect our transition to modern storage formats to handle fine‑tuning datasets at a scale that anticipates exabyte growth.
- Platform Engineering: Contribute to the development of our internal training platform, ensuring seamless model training and fine‑tuning capabilities across Kubernetes and SLURM based environments.
- Metadata & Lineage: Implement and manage systems to provide clear visibility and lineage as our data and model pipelines grow in complexity.
- Operational Excellence: Use modern deployment workflows to manage cloud‑native deployments, ensuring our data platform can scale by orders of magnitude while remaining reliable and efficient.
You might thrive in this role if you:
- Have 4+ years of experience in Data Infrastructure, MLOps, or Infrastructure Engineering.
- Have experience or a strong interest in supporting foundational compute and storage platforms.
- Are proficient in Python and enjoy solving the "brittle data lake" problem with modern, columnar storage standards.
- Are well‑versed in Kubernetes‑native tooling and excited to debug large‑scale distributed systems across multi‑cluster environments.
- Take pride in building and operating scalable, reliable, and secure systems from the ground up.
- Are comfortable with ambiguity and the challenges of building high‑scale infrastructure in a rapid‑growth AI environment.
Benefits
France
- Competitive cash salary and equity
- Food: Daily lunch vouchers
- Sport: Monthly contribution to a Gympass subscription
- Transportation: Monthly contribution to a mobility pass
- Health: Full health insurance for you and your family
- Parental: Generous parental leave policy
UK
- Competitive cash salary and equity
- Insurance
- Transportation: Reimburse office parking charges, or £90 per month for public transport
- Sport: £90 per month reimbursement for gym membership
- Meal voucher: £200 monthly allowance for meals
- Pension plan: SmartPension (percentages are 5% Employee & 3% Employer)
Research Engineer (Data Infrastructure) in London employer: Mistral AI
Mistral AI is an exceptional employer that fosters a dynamic and collaborative work culture, where innovation thrives and employees are empowered to make a meaningful impact in the AI landscape. With competitive salaries, generous benefits including health insurance, gym memberships, and parental leave, as well as opportunities for professional growth in a pioneering environment, Mistral AI is committed to supporting its diverse workforce in achieving their career aspirations. Join us in Paris, London, or remotely within the EU/UK to be part of a team that is shaping the future of AI technology.
StudySmarter Expert Advice🤫
We think this is how you could land Research Engineer (Data Infrastructure) in London
✨Tip Number 1
Network like a pro! Reach out to folks in the industry, attend meetups, and connect with Mistral AI employees on LinkedIn. A friendly chat can open doors that applications alone can't.
✨Tip Number 2
Show off your skills! If you’ve got a portfolio or projects that highlight your expertise in data infrastructure or MLOps, make sure to share them during interviews. It’s all about demonstrating what you can bring to the table.
✨Tip Number 3
Prepare for technical interviews by brushing up on your Python and Kubernetes knowledge. Practice common problems and scenarios you might face in the role. We want to see how you think and solve issues!
✨Tip Number 4
Don’t forget to apply through our website! It’s the best way to ensure your application gets seen by the right people. Plus, it shows you’re genuinely interested in being part of our team at Mistral AI.
We think you need these skills to ace Research Engineer (Data Infrastructure) in London
Some tips for your application 🫡
Show Your Passion for AI:When writing your application, let your enthusiasm for AI shine through! We love seeing candidates who are genuinely excited about the potential of AI to transform society. Share any personal projects or experiences that highlight your passion.
Tailor Your Application:Make sure to customise your application to fit the role of Research Engineer in Data Infrastructure. Highlight relevant experience and skills that align with our mission at Mistral AI. This shows us you’ve done your homework and are serious about joining our team.
Be Clear and Concise:Keep your application clear and to the point. We appreciate well-structured applications that are easy to read. Use bullet points where necessary and avoid jargon unless it’s relevant to the role. Remember, clarity is key!
Apply Through Our Website:We encourage you to apply directly through our website. It’s the best way to ensure your application gets to us without any hiccups. Plus, you’ll find all the details you need about the role and our culture there!
How to prepare for a job interview at Mistral AI
✨Know Your Stuff
Make sure you brush up on your knowledge of data infrastructure and MLOps. Familiarise yourself with the latest trends in Kubernetes and cloud-native environments, as well as the specific technologies Mistral AI uses. Being able to discuss these topics confidently will show that you're genuinely interested and prepared.
✨Showcase Your Experience
Prepare to share specific examples from your past work that demonstrate your ability to build and scale data systems. Think about challenges you've faced and how you overcame them, especially in high-performance environments. This will help the interviewers see how you can contribute to their team.
✨Ask Smart Questions
Come armed with thoughtful questions about Mistral AI's projects and future goals. Inquire about their approach to multi-cluster orchestration or how they handle data governance. This not only shows your interest but also helps you gauge if the company aligns with your career aspirations.
✨Emphasise Team Spirit
Mistral AI values collaboration and a low-ego environment. Be ready to discuss how you've worked effectively in teams, especially in competitive settings. Highlight your ability to communicate and collaborate with diverse groups, as this will resonate well with their culture.