At a Glance
- Tasks: Train and deploy reinforcement learning agents for real-time cooling control in data centres.
- Company: Innovative tech firm tackling energy efficiency with cutting-edge AI solutions.
- Benefits: Competitive salary, equity options, hybrid work, and direct access to leadership.
- Other info: Opportunity for significant impact on energy savings and career growth.
- Why this job: Solve real-world problems while working at the intersection of research and engineering.
- Qualifications: 3-5 years in deep RL, Python, and a background in physical systems.
The predicted salary is between 110000 - 150000 £ per year.
London (hybrid, 1 day/week in Kings Cross) - Solve Data Centres Cooling issues. Cooling is one of the largest items on a data centre's energy bill, and most sites run it conservatively because getting it wrong puts the hardware at risk. Our client trains reinforcement learning agents to control cooling systems on live sites, cutting cooling energy without breaching the temperature and humidity limits operators are contractually bound to. They're hiring an ML Engineer - Reinforcement Learning to build those agents and get them running on real data centres. You'll report to the CTO / Head of AI and work across the line between research and deployment.
The System: The agents don't learn on the live plant. They train against a digital twin of each site, then move to production once they're safe. Reward and constraint design is shaped by ASHRAE standards and customer SLAs - air temperature, humidity, and rate-of-change limits on cooling air and chilled water setpoints. Training is federated across multiple sites. Agents share learned control strategies without any site's operational data leaving the building, which delivers significantly more savings than a single-site approach. Models are deployed on-prem at the edge, then monitored and retrained in place.
What You'll Own:
- Reinforcement Learning: Train and deploy deep RL agents for live cooling control. Design reward functions and constraints that hold up against physical limits and SLAs, not just in a notebook. Move between research-style exploration and the engineering work to make something stable on a real site.
- Simulation and Digital Twins: Build and improve the physics-based simulators, surrogate models, and digital twins the agents train against. Close the gap between what works in simulation and what holds on real hardware.
- Production and Deployment: Federated and distributed training across sites. Edge deployment, monitoring, and retraining of agents already running in production.
What We're Looking For:
- Essential: 3-5 years training and deploying deep RL agents in Python, PyTorch or JAX, and RL libraries such as Gymnasium. A background in physical systems - engineering (mechanical, electrical, structural, biomedical), physics, robotics, autonomous driving, or control systems - and the instinct to reason about what's physically possible, not only what's mathematically possible. Comfortable iterating between research exploration and the engineering needed to run on a live site. A degree in engineering, CS, or physics.
- Useful: Control systems (classical control, MPC), HVAC, thermodynamics, power systems, or data centre operations. Federated learning, distributed training, or edge ML deployment. Simulation experience - building or using physics-based simulators, digital twins, surrogate models, or large physics models. Published research or open-source contributions.
Who You Are: You want both halves of this job. You'll run experiments and read papers, but you also want your work controlling real equipment, with the constraints that come with that. RL experience limited to advertising or multi-armed bandits won't carry over here - the physical world doesn't behave like a recommendation system. A pure maths or CS background with no feel for physical systems will struggle, and so will anyone after a pure research seat or a pure production one. This sits in the middle.
What's On Offer: £110K-£150K, plus competitive equity. A genuine technical problem: RL on physical systems, under real constraints, deployed on live infrastructure. Direct access to the CTO and founding team. Hybrid working, one day a week in the Kings Cross office. Visa sponsorship available on a case-by-case basis. Get in touch for a confidential conversation.
Machine Learning Engineer- Reinforcement Learning employer: Wave Recruitment
Join a forward-thinking company that is at the forefront of applying machine learning to real-world challenges in data centre cooling. With a hybrid working model and a competitive salary package, you will have the opportunity to work closely with the CTO and founding team, fostering a culture of innovation and collaboration. The company prioritises employee growth through hands-on experience in both research and deployment, making it an ideal environment for those looking to make a meaningful impact in the field of reinforcement learning.
StudySmarter Expert Advice🤫
We think this is how you could land Machine Learning Engineer- Reinforcement Learning
✨Tip Number 1
Network like a pro! Reach out to people in the industry, attend meetups, and connect with professionals on LinkedIn. You never know who might have the inside scoop on job openings or can refer you directly.
✨Tip Number 2
Show off your skills! Create a portfolio showcasing your projects, especially those related to reinforcement learning and physical systems. This will give potential employers a taste of what you can do and set you apart from the crowd.
✨Tip Number 3
Prepare for interviews by brushing up on both technical and practical aspects of ML. Be ready to discuss how you’d approach real-world problems, like cooling systems in data centres, and demonstrate your understanding of constraints and SLAs.
✨Tip Number 4
Don’t forget to apply through our website! It’s the best way to ensure your application gets seen. Plus, we love seeing candidates who are proactive about their job search!
We think you need these skills to ace Machine Learning Engineer- Reinforcement Learning
Some tips for your application 🫡
Tailor Your CV:Make sure your CV highlights your experience with reinforcement learning and any relevant projects you've worked on. We want to see how your skills align with the role, so don’t be shy about showcasing your Python and PyTorch expertise!
Craft a Compelling Cover Letter:Your cover letter is your chance to tell us why you're the perfect fit for this role. Share your passion for solving real-world problems with ML and how your background in physical systems makes you a strong candidate. Keep it engaging and personal!
Showcase Relevant Projects:If you've worked on any projects involving digital twins or federated learning, make sure to mention them! We love seeing practical applications of your skills, so include links to your GitHub or any published research if you have them.
Apply Through Our Website:We encourage you to apply directly through our website. It’s the best way for us to receive your application and ensures you’re considered for the role. Plus, it shows us you’re keen to join the StudySmarter team!
How to prepare for a job interview at Wave Recruitment
✨Know Your Reinforcement Learning
Make sure you brush up on your reinforcement learning concepts, especially how they apply to physical systems. Be ready to discuss your experience with training and deploying deep RL agents, and how you've tackled real-world constraints in past projects.
✨Showcase Your Technical Skills
Prepare to demonstrate your proficiency in Python, PyTorch or JAX, and relevant RL libraries like Gymnasium. Bring examples of your work that highlight your ability to build and improve physics-based simulators or digital twins, as this will be crucial for the role.
✨Understand the Business Context
Familiarise yourself with the challenges data centres face regarding cooling and energy efficiency. Being able to articulate how your work can directly impact these issues will show that you understand the bigger picture and are invested in solving real problems.
✨Prepare for a Hybrid Discussion
Since the role involves both research and deployment, be ready to discuss how you balance exploration with engineering. Think of examples where you've successfully transitioned from theoretical work to practical applications, and how you’ve iterated on solutions in a live environment.