At a Glance
- Tasks: Design and implement cutting-edge research in reinforcement learning and post-training at scale.
- Company: Join DeepL, a global leader in Language AI, breaking down language barriers.
- Benefits: Enjoy 30 days annual leave, hybrid work, and competitive benefits including virtual shares.
- Why this job: Shape the future of AI while collaborating with a diverse, innovative team.
- Qualifications: Strong background in reinforcement learning, Python, and large-scale model alignment.
- Other info: Be part of a vibrant community with over 90 nationalities and flexible working hours.
The predicted salary is between 36000 - 60000 ÂŁ per year.
Meet DeepL. DeepL is a global communications platform powered by Language AI. Since 2017, we’ve been on a mission to break down language barriers. Our human‑sounding translations and intelligent writing suggestions are designed with enterprise security in mind. Today, they enable over 100,000 businesses to transform communications, reach new markets, and improve productivity. Our goal is to become the global leader in Language AI, building products that drive better communication, foster connections, and make a real‑life impact.
Meet the team behind this journey: Foundation Model Task Adaptation. We are the team behind DeepL’s post‑training stack for large language models. We focus on developing algorithms and systems that align pre‑trained models with tasks and performance goals through techniques like reinforcement learning. As a research‑driven team, we stay up to date with current literature to integrate cutting‑edge ideas into our core stack. As part of this team, you will shape the future of how our models learn beyond pre‑training: enabling new capabilities, better controllability, and safer, more effective user experiences.
Your Responsibilities
- Design, implement, and deploy cutting‑edge research in reinforcement learning and post‑training at scale, driving innovations that make it into production.
- Build and deploy state‑of‑the‑art reinforcement learning pipelines at scale.
- Post‑train large (multi‑modal) models to align them with human intent and enable general capabilities such as reasoning, pushing the boundaries of model performance, safety, and efficiency.
- Always keep the entire lifecycle of research and production in mind: from idea conception, theoretical modeling, prototyping, ablation studies, all the way to production deployment.
- Build and foster external collaborations with academic and industrial partners.
- Follow scientific and technical standards for experimentation, reproducibility, and model evaluation.
- Collaborate deeply with Engineering, ML Platform, and HPC teams to deliver robust and reliable model updates to users.
Qualities we look for
- A scientist with a deep technical background, strong leadership skills, and a proven track record of driving research in reinforcement learning or large‑scale model alignment to production.
- Researchers with a strong practical background, a creative mindset, and a passion for solving hard problems with real‑world impact.
- A solid mathematical background and enjoy solving challenging problems, evidenced by a master’s degree, diploma, PhD, or equivalent industry experience in mathematics, physics, computer science, or a related field.
- Deep practical experience in Python and at least one modern machine learning framework such as PyTorch, TensorFlow, or JAX, and experience working with large compute clusters and ML infrastructure is a plus.
- A track record of leading self‑directed research projects that go well beyond academic exercises and deliver tangible results.
- Expertise in deep reinforcement learning (RLHF/RLAIF/RLVR) is a plus.
- Hands‑on experience scaling and deploying LLMs or other foundation models in real‑world systems is a plus.
What we offer
- Diverse and internationally distributed team: joining our team means becoming part of a large, global community with people of more than 90 nationalities.
- Open communication, regular feedback: we value the importance of clear, honest communication.
- Hybrid work, flexible hours: we offer a hybrid work schedule, with team members coming into the office twice a week.
- Monthly full‑day hacking sessions: every month, we have Hack Fridays, where you can spend your time diving into a project you’re passionate about.
- 30 days of annual leave: we value your peace of mind.
- Competitive benefits: we’ve crafted it to reflect the diversity of our team and tailored it to align with your unique location.
- Virtual Shares: An ownership mindset in every role.
If this role and our mission resonate with you, but you’re hesitant because you don’t check all the boxes, don’t let that hold you back. At DeepL, it’s all about the value you bring and the growth we can foster together. Go ahead, apply let’s discover your potential together. We can’t wait to meet you.
We are an equal opportunity employer. You are welcome at DeepL for who you are. We appreciate authenticity here. Our product is for everyone, and so is our workplace. The more voices we have represented and amplified in our business, the more we will all succeed, contribute, and think forward!
Senior Research Scientist - FMTA employer: Paul Ekman Group
Contact Detail:
Paul Ekman Group Recruiting Team
StudySmarter Expert Advice 🤫
We think this is how you could land Senior Research Scientist - FMTA
✨Tip Number 1
Network like a pro! Reach out to current employees at DeepL on LinkedIn or other platforms. Ask them about their experiences and any tips they might have for your application process. Personal connections can make a huge difference!
✨Tip Number 2
Prepare for the interview by diving deep into DeepL’s products and mission. Understand how your skills in reinforcement learning and model alignment can contribute to their goals. Show them you’re not just a fit, but the perfect fit!
✨Tip Number 3
Practice your problem-solving skills! Be ready to tackle real-world scenarios related to large language models and reinforcement learning during interviews. Think of it as a fun challenge rather than a test!
✨Tip Number 4
Don’t forget to apply through our website! It’s the best way to ensure your application gets seen by the right people. Plus, it shows you’re genuinely interested in joining the DeepL family!
We think you need these skills to ace Senior Research Scientist - FMTA
Some tips for your application 🫡
Tailor Your Application: Make sure to customise your CV and cover letter for the Senior Research Scientist role. Highlight your experience in reinforcement learning and large-scale model alignment, as well as any relevant projects that showcase your skills.
Showcase Your Passion: Let us see your enthusiasm for AI and language technology! Share any personal projects or research that demonstrate your commitment to innovation and problem-solving in this field.
Be Clear and Concise: When writing your application, keep it straightforward and to the point. Use clear language to explain your achievements and how they relate to the responsibilities of the role. We appreciate clarity!
Apply Through Our Website: Don’t forget to submit your application through our website! It’s the best way for us to receive your details and ensures you’re considered for the position. We can’t wait to see what you bring to the table!
How to prepare for a job interview at Paul Ekman Group
✨Know Your Reinforcement Learning
Make sure you brush up on your reinforcement learning concepts and techniques. Be ready to discuss how you've applied these in past projects, especially in relation to large-scale models. This will show your depth of knowledge and practical experience.
✨Showcase Your Collaboration Skills
DeepL values teamwork, so be prepared to share examples of how you've successfully collaborated with cross-functional teams. Highlight any partnerships with academic or industrial partners that have led to tangible results in your research.
✨Prepare for Technical Questions
Expect technical questions related to Python and machine learning frameworks like PyTorch or TensorFlow. Brush up on your coding skills and be ready to solve problems on the spot, as this will demonstrate your hands-on experience and problem-solving abilities.
✨Align with DeepL's Mission
Familiarise yourself with DeepL's mission to break down language barriers. Be ready to articulate how your work aligns with this goal and how you can contribute to their vision of improving communication through AI. This will show your passion for the role and the company.