Applied Scientist - Post-training

Applied Scientist - Post-training

Full-Time 160000 - 200000 € / year (est.) No home office possible
techire ai

At a Glance

  • Tasks: Develop innovative methods to enhance AI alignment and reasoning in large language models.
  • Company: Join a cutting-edge research team backed by $400 million in funding.
  • Benefits: Competitive salary, equity, bonuses, and a collaborative work environment.
  • Other info: Work alongside experts from top tech companies and universities.
  • Why this job: Make a real impact on AI technology that shapes human interaction.
  • Qualifications: Ideal for researchers or recent PhDs with experience in LLM post-training.

The predicted salary is between 160000 - 200000 € per year.

How do you make a large language model genuinely human-centred, capable of reasoning, empathy, and nuance rather than just pattern-matching? This team is built to answer that question. They’re a small, focused group of researchers and engineers working on the post-training challenges that matter most: RLHF, RLAIF, continual learning, multilingual behaviour, and evaluation frameworks designed for natural, reliable interaction. You’ll work alongside a team from NVIDIA, Meta, Microsoft, Apple, and Stanford, in an environment that combines academic rigour with production-level delivery. Backed by over $400 million in funding, they have the freedom, compute, and scale to run experiments that push beyond the limits of standard alignment research. This is a role where your work moves directly into deployed products. The team’s models are live, meaning every insight you develop, every method you refine, and every experiment you run has immediate, measurable impact on how large-scale conversational systems behave.

What you’ll work on:

  • Developing post-training methods that improve alignment, reasoning, and reliability
  • Advancing instruction-tuning, RLHF/RLAIF, and preference-learning pipelines for deployed systems
  • Designing evaluation frameworks that measure human-centred behaviour, not just accuracy
  • Exploring continual learning and multilingual generalisation for long-lived models
  • Publishing and collaborating on research that informs real-world deployment

Who this role suits:

  • Researchers or recent PhDs with experience in LLM post-training, alignment, or optimisation
  • A track record of rigorous work — published papers, open-source projects, or deployed research
  • Curiosity about how large models learn and behave over time, and how to steer that behaviour safely
  • Someone who values autonomy, clarity of purpose, and research that turns into impact

You’ll find a culture driven by technical depth rather than hype — where thoughtful research is backed by meaningful compute and where the best ideas scale fast.

Location: South Bay (on-site, collaborative setup)

Compensation: $200,000 – $250,000 base + equity + bonus

If you’re ready to work on post-training research that shapes how large language models behave, we’d love to hear from you. All applicants will receive a response.

Applied Scientist - Post-training employer: techire ai

Join a pioneering team in South Bay that thrives on collaboration and innovation, where your contributions directly influence the future of large language models. With a strong emphasis on research-driven impact, competitive compensation, and a culture that values autonomy and technical depth, this company offers exceptional growth opportunities for those passionate about advancing AI technology. Experience a unique environment that combines academic excellence with practical application, ensuring your work is both meaningful and rewarding.

techire ai

Contact Detail:

techire ai Recruiting Team

StudySmarter Expert Advice🤫

We think this is how you could land Applied Scientist - Post-training

Tip Number 1

Network like a pro! Reach out to people in the industry, especially those who work at companies like NVIDIA or Meta. A friendly chat can open doors and give you insights that a job description just can't.

Tip Number 2

Show off your skills! If you've got projects or research that align with post-training methods or RLHF, make sure to highlight them in conversations. Bring your portfolio to life and let your passion shine through!

Tip Number 3

Prepare for interviews by diving deep into the latest trends in LLMs and human-centred AI. Be ready to discuss how your experience can contribute to improving alignment and reliability in deployed systems.

Tip Number 4

Don't forget to apply through our website! It’s the best way to ensure your application gets the attention it deserves. Plus, we love seeing candidates who take that extra step to connect with us directly.

We think you need these skills to ace Applied Scientist - Post-training

Post-training Methods Development
Alignment Techniques
Reasoning Skills
Reliability Improvement
Instruction-Tuning
Reinforcement Learning from Human Feedback (RLHF)
Reinforcement Learning from AI Feedback (RLAIF)

Some tips for your application 🫡

Tailor Your CV:Make sure your CV reflects the skills and experiences that align with the role of Applied Scientist. Highlight any relevant research, publications, or projects that showcase your expertise in LLM post-training and alignment.

Craft a Compelling Cover Letter:Your cover letter is your chance to shine! Use it to explain why you're passionate about human-centred AI and how your background makes you a perfect fit for our team. Be genuine and let your enthusiasm show!

Showcase Your Research Impact:When detailing your past work, focus on the impact it had. Did your research lead to real-world applications? Did it influence how models behave? We want to see how your contributions can translate into meaningful outcomes.

Apply Through Our Website:We encourage you to apply directly through our website. It’s the best way for us to receive your application and ensures you’re considered for the role. Plus, it’s super easy – just follow the prompts!

How to prepare for a job interview at techire ai

Know Your Stuff

Make sure you brush up on the latest trends in large language models and post-training techniques. Familiarise yourself with concepts like RLHF, RLAIF, and continual learning. Being able to discuss these topics confidently will show that you're genuinely interested and knowledgeable about the field.

Showcase Your Work

Prepare to talk about your previous research or projects, especially those that relate to alignment or optimisation. If you've published papers or contributed to open-source projects, bring them up! This is your chance to demonstrate your rigorous work and how it aligns with the team's goals.

Ask Thoughtful Questions

Interviews are a two-way street, so come prepared with insightful questions about the team’s current challenges and future projects. This not only shows your curiosity but also helps you gauge if the role and environment are the right fit for you.

Emphasise Collaboration

Since this role involves working alongside experts from top companies, highlight your experience in collaborative settings. Share examples of how you've successfully worked in teams to solve complex problems, as this will resonate well with the team’s culture of collaboration and technical depth.