At a Glance
- Tasks: Research and develop cutting-edge machine learning systems and optimisation infrastructure.
- Company: Join Huawei, a global leader in ICT with a focus on innovation.
- Benefits: Enjoy 33 days of annual leave, private medical insurance, and a pension scheme.
- Other info: Collaborate in a dynamic environment with opportunities for personal and professional growth.
- Why this job: Make a real impact in AI and machine learning while advancing your career.
- Qualifications: Master's or PhD in Machine Learning or related field; proficiency in PyTorch required.
The predicted salary is between 60000 - 80000 £ per year.
About Huawei Research and Development UK Limited. Founded in 1987, Huawei is a leading global provider of information and communications technology (ICT) infrastructure and smart devices. We have 207,000 employees and operate in over 170 countries and regions, serving more than three billion people around the world.
Our vision and mission is to bring digital to every person, home and organization for a fully connected, intelligent world. To this end, we will drive ubiquitous connectivity and promote equal access to networks; bring cloud and artificial intelligence to all four corners of the earth to provide superior computing power where you need it, when you need it; build digital platforms to help all industries and organizations become more agile, efficient, and dynamic; redefine user experience with AI, making it more personalized for people in all aspects of their life, whether they’re at home, in the office, or on the go.
This spirit of innovation has led Huawei to work in close partnership with leading academic institutions in the UK to develop and refine the latest technologies. With a shared commitment to innovation and progress, both parties have worked together to achieve common goals and establish a strong partnership. The partnership between UK and Huawei helps to develop the technologies of the future that will transform the way we all communicate, work and live.
For the past 30 years we have maintained an unwavering focus, rejecting shortcuts and easy opportunities that don't align with our core business. With a practical approach to everything we do, we concentrate our efforts and invest patiently to drive technological breakthroughs. This strategic focus is a reflection of our core values:
- Staying customer-centric
- Inspiring dedication
- Persevering
- Growing by reflection
Huawei’s vision is a fully connected, intelligent world. To achieve this, we work to inspire passion for basic research around the world. Our combined passion drives development across the global innovation value chain. Huawei has the largest Research and Development organization in the world with 96,000+ employees in research centers around the globe. In the UK, we already have design centers in Cambridge, London, Edinburgh and Ipswich. We continue to explore and define new research directions and new services. We have expanded our collaborations with academic researchers; researched new network architectures, integration of communications and key enabling technologies; and developed the fundamental theories of these technologies. We invite you to join us on this exciting journey and drive your career forward.
Job Summary: Research and develop large-scale machine learning systems, alignment workflows, and optimization infrastructure to advance LLM reasoning and post-training capabilities. Design and execute scaled reinforcement learning pipelines (e.g., PPO, GRPO) utilizing distributed training frameworks (verl, trl, DeepSpeed, FSDP) integrated with high-performance inference engines (vLLM). Optimize low-level training throughput, kernel performance, and memory utilization across heterogeneous hardware clusters using expressive hardware DSLs (e.g., TileLang, Triton). Advance the LLM orchestration loop and leverage Bayesian optimization to automate the search, generation, and continuous improvement of high-performance NPU kernels.
Key Responsibilities:
- Design and execute scaled RL finetuning workflows (e.g., PPO, GRPO) to enhance LLM reasoning, instruction-following, and alignment.
- Architect and manage large-scale distributed training experiments across multi-node GPU, optimizing for maximum throughput and hardware utilization.
- Develop and maintain training infrastructure using advanced parallelization frameworks (verl, trl, DeepSpeed, FSDP) to support rapidly evolving research needs.
- Integrate high-performance inference engines like vLLM directly into RL generation loops to reduce rollout latency and accelerate training cycles.
- Implement robust profiling and debugging pipelines to diagnose bottlenecks in GPU memory, compute, and inter-node communication.
- Collaborate with data and evaluation teams to design dense reward functions and synthetic data generation pipelines.
- Design, benchmark, and deploy highly optimized custom tensor operators (e.g., FlashAttention, GEMM) across heterogeneous hardware architectures using modern Domain-Specific Languages (DSLs) and AI compilers.
This job description is only an outline of the tasks, responsibilities and outcomes required of the role. The jobholder will carry out any other duties as may be reasonably required by his/her line manager. The job description and personal specification may be reviewed on an ongoing basis in accordance with the changing needs of Huawei Research and Development UK Limited.
Person Specification:
Required:
- Master's or PhD (or equivalent industry research experience) in Machine Learning, Computer Science, Data Science, or a highly quantitative field with a heavy focus on Machine Learning.
- Deep proficiency in PyTorch and experience writing custom training loops and data pipelines.
- Hands-on experience with RLHF/RLVF methods (e.g., PPO, GRPO) and an understanding of policy optimization dynamics.
- Technical familiarity with at least two of the following: DeepSpeed, FSDP, verl or trl.
- Production or heavy research experience utilizing vLLM or similar high-throughput inference serving engines for generation.
- Ability to thrive in a fast-paced, iterative environment where research and production infrastructure deeply intersect.
Desired:
- Proven track record of running scaled GPU experiments across multi-node clusters.
- Experience implementing next-gen alignment and reasoning paradigms, such as GRPO or Monte Carlo Tree Search (MCTS).
- Deep understanding of GPU architectures, kernels, FlashAttention, and profiling tools.
- Familiarity with cluster environments and schedulers like Slurm or Kubernetes.
- Hands-on experience developing within NPU stack or ecosystem integrations.
- Practical knowledge of hardware-expressive Domain-Specific Languages (e.g., TileLang, Triton) to optimize low-level memory placement, layout propagation, and thread-block scheduling.
- Research publications at top-tier AI/ML conferences (NeurIPS, ICLR, ICML) or a strong open-source GitHub footprint in LLM training/infrastructure.
What we offer:
- 33 days annual leave entitlement per year (including UK public holidays)
- Group Personal Pension
- Life insurance
- Private medical insurance
- Medical expense claim scheme
- Employee Assistance Program
- Cycle to work scheme
- Company sports club and social events
- Additional time off for learning and development
Research Engineer/Scientist - Machine Learning RL & Optimisation (Contractor) in London employer: Huawei Technologies Research & Development (UK) Ltd
Huawei Research and Development UK Limited is an exceptional employer, offering a dynamic work culture that fosters innovation and collaboration. With a strong commitment to employee growth, the company provides extensive benefits including 33 days of annual leave, private medical insurance, and opportunities for continuous learning and development. Located in the UK, employees have the unique advantage of working at the forefront of technology alongside leading academic institutions, driving meaningful advancements in machine learning and artificial intelligence.
Contact Details:
Huawei Technologies Research & Development (UK) Ltd Recruitment Team
StudySmarter Expert Advice🤫
We think this is how you could land Research Engineer/Scientist - Machine Learning RL & Optimisation (Contractor) in London
✨Showcase Your Skills with a Public Portfolio
As a freelancer in data science, having a killer portfolio is essential. Showcase your projects on platforms like GitHub or create a personal website that details your work and techniques. This gives potential clients a clear picture of what you can do and helps you stand out from the competition.
✨Get Involved in Data Science Communities
Tap into online forums like Kaggle or Stack Overflow. Not only can you showcase your expertise, but you can also connect with other data scientists and potential clients. Plus, participating in competitions and discussions can elevate your profile in the field.
✨Leverage Local Networking Opportunities
Keep an eye out for local data science meetups or tech events in your area. These are golden opportunities to meet potential clients and collaborators face-to-face. Plus, who doesn't love a bit of networking over pizza and drinks?
✨Pitch Your Services Directly to Companies
Don't just wait for freelancing platforms to bring clients to you—be proactive! Research companies that could benefit from data science services and craft tailored pitches. Mention specific pain points you can address for them. Let’s get that freelance hustle going!
We think you need these skills to ace Research Engineer/Scientist - Machine Learning RL & Optimisation (Contractor) in London
Some tips for your application 🫡
Showcase Your Projects:When applying for a freelance data science role like Research Engineer/Scientist - Machine Learning RL & Optimisation (Contractor) at Huawei Technologies Research & Development (UK) Ltd, it’s crucial to highlight your projects. Include a portfolio that features at least two or three projects involving data analysis, machine learning, or visualisation. Make sure to describe the tools and methodologies you used, so we can see your skills in action!
Quantify Your Achievements:Freelance gigs, especially in data science, often ask for proven results. In your CV, include any relevant metrics or outcomes from your previous work. Did your analysis help reduce costs by a certain percentage? Or did your predictive model improve performance? Numbers speak volumes!
Introduce Your Style:Since freelancing is all about your individual style and approach, use your cover letter to share how you tackle data problems. This is your chance to let us know how you think, your creative problem-solving methods, and how you would approach a project at Huawei Technologies Research & Development (UK) Ltd.
Be Real About Your Rates:When you send in your application, don’t forget to mention your freelance rates and availability. We appreciate clarity up front, and it helps us gauge if you fit within our budget and timeline. Being transparent in this aspect shows professionalism and readiness!
How to prepare for a job interview at Huawei Technologies Research & Development (UK) Ltd
✨Show Off Your Data Wizardry
As a freelancer in data science, you'll want to present a portfolio that showcases your best projects. We should pull together examples where you tackled real problems with data analytics, machine learning models, or visualisations. It's all about demonstrating your skills in action!
✨Be Ready to Dive Deep into Technical Questions
Expect to encounter some technical grilling during the interview. Prepare to discuss statistical methods, algorithms, or maybe even tackle a live coding challenge. We should brush up on tools like Python, R, or SQL—those are key players in the data science field. Don't just know them; be ready to explain your thought process!
✨Help Them Understand Your Work Style
Freelance gigs often mean you'll be working independently, so we need to convey our self-motivation and time management skills. Be prepared to talk about how you’ve handled multiple projects or met tight deadlines before. Sharing your approach to client communication can also give them confidence in your ability to deliver remotely.
✨Pitch Your Value Proposition
When freelancing, it’s crucial to clearly articulate what makes you unique. We should highlight not just technical skills but also the business impact of our projects. Think of a couple of stories where your data insights drove decision-making—this can be a game changer in showing why they should choose you for their freelance needs!