Member of Technical Staff (Post Training)

Member of Technical Staff (Post Training)

Full-Time 60000 - 80000 £ / year (est.) No working from home possible
I

At a Glance

  • Tasks: Lead post-training AI research and develop state-of-the-art algorithms for scientific exploration.
  • Company: Inherent, a cutting-edge AI lab in London focused on recursive self-improvement.
  • Benefits: Competitive salary, collaborative culture, and a vibrant office environment with great food.
  • Other info: Join a diverse team committed to societal benefit and creative experimentation.
  • Why this job: Shape the future of AI research and work on groundbreaking projects that matter.
  • Qualifications: 3+ years in deep learning and software engineering, with a passion for innovation.

The predicted salary is between 60000 - 80000 £ per year.

At Inherent, we are on a mission to build AI that recursively self-improves to discover new knowledge. Scientific advances are the backbone of our economic, technological and societal prosperity, but ideas are getting harder to find and breakthroughs are becoming more expensive. We are building a new frontier lab dedicated to developing AI that explores “unknown unknowns” to uncover paradigm-shifting research contributions. Science is a social endeavour, and so our mission is inextricably a human-machine teaming problem. We’re starting by reinventing the AI research factory so that our own agents accelerate their own creation. Inherent is a well-funded, fast-growing neo-lab backed by Tier 1 VCs who believe in our ethical stance. We are a team of operators with backgrounds at frontier labs who have done foundational work in recursive self-improvement, AI Scientists, world modelling, meta-RL and human-machine cooperation. Working in-person every day at our high-intensity London headquarters, we believe that Europe will lead the way in the coming paradigm of AI-enabled science, unlocking human potential across the globe.

About the role

We’re looking for Members of Technical Staff to lead work on post-training state-of-the-art foundation models for open-ended agentic capabilities in scientific research. You’ll be involved at every level of the post-training pipeline: sourcing and creating data, building autocurricula, devising and implementing SFT and RL algorithms, constructing tools and harnesses for foundation model self‑improvement, analysing research results, and using information gained to devise future hypotheses. You will work closely with an experienced technical team of humans, and increasingly alongside the AI scientist collaborators we dogfood.

What you'd do

  • Design, implement, and tune SFT and RL algorithms to post-train models that autonomously perform state-of-the‑art research.
  • Build the autocurricula, judges, harnesses and eval pipelines that turn open-ended research tasks into reliable reward signal.
  • Run large-scale experiments on state-of-the‑art hardware and analyse experiments to determine the next hypotheses to test, in collaboration with our AI agents.
  • Close recursive loops so that AI agents drive their own post-training research.
  • Work closely with colleagues in the Infrastructure and AI for Science teams to optimise hardware and deliver remarkable performance in real scientific domains.

What we're looking for

  • 3+ years of deep learning research experience.
  • Experience post-training large language, vision, video or multi-modal models.
  • Demonstrated track record of success in deep learning research, whether papers, model releases, open-source contributions, or other artifacts.
  • 5+ years of software engineering experience, including deep familiarity with Python and at least one deep learning framework (e.g., PyTorch, JAX).
  • Experience using the latest coding agents, and opinions about optimal workflow.
  • Enthusiasm for experimental organizational design.
  • AI-pilled: adopting agents, keen to build a company where agents are front and centre.

Strong candidates may also have

  • PhD in mathematics, computer science or hard science discipline.
  • Hands‑on experience training LLMs with RL at scale (GRPO/PPO, DPO, distillation, and variants).
  • Familiarity with distributed and long-context training infrastructure.
  • A background in autocurricula, open‑endedness, meta‑learning, or recursive self‑improvement.
  • Experience post‑training frontier models at an industry lab (scale, infra, and iteration speed).

Why this is interesting

  • You’ll shape the core research of a frontier AI lab from the beginning.
  • You’ll work on genuine recursive self‑improvement — training AI scientists that improve the very pipeline that trains them — not incremental benchmark‑chasing.
  • You’ll dogfood your own work: the agents you post‑train accelerate the research that creates them.
  • Small team, high trust, no bureaucracy, and a genuinely technical culture.

Culture

We only select people with low ego, spiky skill profiles, commitment to societal benefit, unusual viewpoints, and a passion for "living in the experiment". We'll win because we're willing to try things that no incumbent would even think to do, let alone action. We have really good lunch and dinner. Seriously. You've got to try it. We're based in King's Cross, London and believe in the pace and energy of working in person. We’re committed to having the most tasteful, and the weirdest, office of any AI lab: the environment shapes the agents within it. If you believe in our mission and culture, and are qualified and motivated, we encourage you to apply, even if you don’t meet every one of the criteria above. We know that many of the most creative and talented people have had unusual career paths and backgrounds. Building a team with a diversity of thought is mission‑critical, for plurality spurs curiosity, invention and collective experimentation.

Member of Technical Staff (Post Training) employer: Inherentlabs

At Inherent, we pride ourselves on being an exceptional employer, fostering a high-intensity work culture that thrives on collaboration and innovation. Our London headquarters offers a unique environment where employees are empowered to shape the future of AI research, with ample opportunities for personal and professional growth. We value diversity of thought and encourage unconventional career paths, ensuring that every team member contributes to our mission of unlocking human potential through cutting-edge technology.

I

Contact Details:

Inherentlabs Recruitment Team

StudySmarter Expert Advice🤫

We think this is how you could land Member of Technical Staff (Post Training)

Tip Number 1

Network like a pro! Reach out to folks in the AI and tech community, especially those connected to Inherent. Attend meetups, webinars, or even just grab a coffee with someone in the field. You never know who might have the inside scoop on job openings!

Tip Number 2

Show off your skills! Create a portfolio showcasing your deep learning projects, research papers, or any cool contributions you've made. This is your chance to demonstrate your expertise and passion for AI, so make it shine!

Tip Number 3

Prepare for interviews by diving deep into the latest trends in AI and recursive self-improvement. Be ready to discuss your past experiences and how they align with Inherent's mission. Practice makes perfect, so do mock interviews with friends or mentors!

Tip Number 4

Don't forget to apply through our website! It's the best way to ensure your application gets seen. Plus, it shows you're genuinely interested in being part of our team at Inherent. Let's make this happen together!

We think you need these skills to ace Member of Technical Staff (Post Training)

Deep Learning Research
Post-Training Algorithms
SFT and RL Algorithms
Python Programming
PyTorch
JAX
Large-Scale Experimentation

Some tips for your application 🫡

Show Your Passion for AI:When writing your application, let your enthusiasm for AI and its potential shine through. We want to see how you connect with our mission of recursive self-improvement and how you envision contributing to groundbreaking research.

Tailor Your Experience:Make sure to highlight your relevant experience in deep learning and software engineering. We’re looking for specific examples of your work with SFT and RL algorithms, so don’t hold back on the details that showcase your skills!

Be Authentic:We value unique perspectives and low ego. Don’t be afraid to share your unconventional career path or any unusual viewpoints you have. This is a chance for us to get to know the real you, so let your personality come through!

Apply Through Our Website:We encourage you to apply directly through our website. It’s the best way for us to receive your application and ensures you’re considered for this exciting opportunity at Inherent. We can’t wait to hear from you!

How to prepare for a job interview at Inherentlabs

Know Your Stuff

Make sure you brush up on your deep learning research experience and be ready to discuss specific projects you've worked on. Highlight any papers, model releases, or open-source contributions that showcase your expertise in post-training models.

Show Your Passion for AI

Inherent is all about pushing the boundaries of AI, so demonstrate your enthusiasm for recursive self-improvement and human-machine collaboration. Share your thoughts on how AI can revolutionise scientific research and why you're excited to be part of that journey.

Prepare for Technical Questions

Expect to dive deep into technical discussions about SFT and RL algorithms. Be prepared to explain your approach to building autocurricula and how you would optimise hardware for performance. Practising coding challenges in Python or your preferred deep learning framework can give you an edge.

Cultural Fit Matters

Inherent values low ego and diverse viewpoints, so be yourself! Share your unique experiences and how they shape your perspective on teamwork and innovation. Don't forget to mention your excitement about working in a high-trust, low-bureaucracy environment where experimentation is encouraged.