Full-stack Software Engineer (Research team) in London
Full-stack Software Engineer (Research team)

Full-stack Software Engineer (Research team) in London

London Full-Time 100000 - 200000 ÂŁ / year (est.) No home office possible
COL Limited

At a Glance

  • Tasks: Build innovative tools for AGI safety research and enhance evaluation processes.
  • Company: Join a cutting-edge tech company focused on AI safety and research.
  • Benefits: Enjoy a competitive salary, unlimited vacation, and professional development budget.
  • Other info: Dynamic team culture with opportunities for growth and collaboration.
  • Why this job: Make a real impact in AI safety while working with top researchers and engineers.
  • Qualifications: Experience in Python and React, with a passion for software engineering.

The predicted salary is between 100000 - 200000 ÂŁ per year.

We’re looking for Full-stack Software Engineers who are excited to build tools for frontier AGI safety research, e.g. building and maintaining evals libraries and tools for monitoring and controlling our own LLM traffic.

Representative Projects

  • Your main objective is to develop tooling for analyzing model evaluation results. Here is a list of features that you might build and ship in your first 6 months:
  • LLM-powered search that finds interesting fragments in evaluation transcripts
  • Comparison views that show how conversations and scores differ between two evaluation runs
  • Ability to view and analyse conversations with coding agents (Cursor, Claude Code, etc.) in addition to evaluation transcripts
  • Results streaming for evaluations that are currently being run
  • Collaborative editing of evaluation logs that automatically updates metrics and other derived data. Think of this as developing an “IDE for evaluations”.
  • Besides this, here are example auxiliary projects which you might do:
    • Automated evaluation pipelines to minimize the time from getting access to a new model for pre-deployment testing to analyzing the most important results and sharing them.
    • LLM agents and MCP tools to automate internal software engineering and research tasks, with sandboxes to prevent major failures
    • Telemetry API and instrumentation of our existing tools, allowing us to monitor usage and improve reliability
    • Upstream improvements to the Inspect framework and ecosystem, e.g. support for evaluating modern agentic scaffolds.

    Key Responsibilities

    • Balance between moving quickly and creating robust and performant software
    • Lead the development of major features from ideation to implementation
    • Support the entire user journey from running the evaluation to finding interesting results to analysing the results to producing reports and papers
    • Make the software configurable and extensible, so that users can adapt it for their needs
    • Collaboratively define and shape the software roadmap and priorities
    • Establish and advocate for good software design practices, codebase health, and coding agent practices
    • Work closely with researchers to understand what challenges they face
    • Work closely with the product team to create solutions that satisfy both our researchers and external customers

    Key Requirements

    • You must have experience writing production-quality Python and React code
    • 5+ years of professional software engineering experience
    • We value candidates from diverse backgrounds and recognise that candidates may demonstrate their skills in different ways. For example, we might be impressed if you have:
    • Led the development of a successful software tool or product over an extended period (e.g. 1 year or more)
    • Started and built the tech stack for a company, e.g in a start-up
    • Worked your way up in a large organisation, repeatedly gaining more responsibility and influencing a large part of the codebase
    • Authored and/or maintained a popular open-source tool or library
    • Placed in a prestigious programming competition (IOI, ICPC, etc.)
  • The following would be a bonus:
    • Experience designing rich and intuitive UIs, especially for power users
    • Direct work with researchers or customers
    • Experience working with LLM agents or LLM evaluations
    • Interest in AI Safety

    We want to emphasize that people who feel they don’t fulfill all of these characteristics but think they would be a good fit for the position nonetheless are strongly encouraged to apply. We believe that excellent candidates can come from a variety of backgrounds and are excited to give you opportunities to shine.

    Logistics

    • Start Date: Target of 2-3 months after the first interview
    • Time Allocation: Full-time
    • Location: The office is in London, and right next to the London Initiative for Safe AI (LISA) offices. This is an in-person role. In rare situations, we may consider partially remote arrangements on a case-by-case basis.
    • Work Visas: We can sponsor UK visas

    Benefits

    • Salary: 100k - 200k GBP (~135k - 270k USD)
    • Flexible work hours and schedule
    • Unlimited vacation
    • Unlimited sick leave
    • Lunch, dinner, and snacks are provided for all employees on workdays
    • Paid work trips, including staff retreats, business trips, and relevant conferences
    • A yearly $1,000 (USD) professional development budget

    About the Team

    The SWE team currently consists of Rusheb Shah, Andrei Matveiakin, Alex Kedrik, and Glen Rodgers. Beyond the SWE team, you will closely interact with the research scientists and engineers as the primary user group of your tools.

    About the Apollo Research

    The rapid rise in AI capabilities offers tremendous opportunities, but also presents significant risks. At Apollo Research, we’re primarily concerned with risks from Loss of Control, i.e. risks coming from the model itself rather than e.g. humans misusing the AI. We’re particularly concerned with deceptive alignment/scheming, a phenomenon where a model appears to be aligned but is, in fact, misaligned and capable of evading human oversight. We work on the detection of scheming (e.g. building evaluations), the science of scheming (e.g. model organisms), and scheming mitigations (e.g. anti-scheming, and control). We closely work with multiple frontier AI companies, e.g. to test their models before deployment or collaborate on scheming mitigations. At Apollo, we aim for a culture that emphasizes truth-seeking, being goal-oriented, giving and receiving constructive feedback, and being friendly and helpful.

    Equality Statement: Apollo Research is an Equal Opportunity Employer. We value diversity and are committed to providing equal opportunities to all, regardless of age, disability, gender reassignment, marriage and civil partnership, pregnancy and maternity, race, religion or belief, sex, or sexual orientation.

    Interview Process

    Please complete the application form with your CV. The provision of a cover letter is optional but not necessary. Please also feel free to share links to relevant work samples.

    About the interview process: Our multi-stage process includes a screening interview, a take-home test (approx. 2 hours), 3 technical interviews, and a final interview with Marius (CEO). The technical interviews will be closely related to tasks the candidate would do on the job. There are no leetcode-style general coding interviews. If you want to prepare for the interviews, we suggest working on hands-on LLM evals projects (e.g. as suggested in our starter guide), such as building LM agent evaluations in Inspect.

    Applications deadline: We are reviewing applications on a rolling basis. It might take a few weeks until you hear from us.

    Full-stack Software Engineer (Research team) in London employer: COL Limited

    At Apollo Research, we pride ourselves on being an exceptional employer, offering a dynamic work culture that fosters innovation and collaboration. Our London office provides a vibrant environment where Full-stack Software Engineers can thrive, with benefits such as unlimited vacation, flexible working hours, and a generous professional development budget. We are committed to employee growth and well-being, ensuring that our team members are supported in their pursuit of meaningful work in the rapidly evolving field of AI safety.
    COL Limited

    Contact Detail:

    COL Limited Recruiting Team

    StudySmarter Expert Advice 🤫

    We think this is how you could land Full-stack Software Engineer (Research team) in London

    ✨Tip Number 1

    Get your networking game on! Connect with folks in the industry, especially those already working at Apollo Research. A friendly chat can go a long way in getting your foot in the door.

    ✨Tip Number 2

    Prepare for those interviews like a pro! Brush up on your Python and React skills, and don’t forget to dive into LLM evals projects. Show us you’re ready to tackle real-world challenges.

    ✨Tip Number 3

    Be yourself during the interview process. We value diverse backgrounds and unique experiences, so don’t hesitate to share your journey and how it shapes your approach to software engineering.

    ✨Tip Number 4

    Apply through our website! It’s the best way to ensure your application gets seen by the right people. Plus, we love seeing candidates who take the initiative to reach out directly.

    We think you need these skills to ace Full-stack Software Engineer (Research team) in London

    Python
    React
    Software Development
    Tooling Development
    Model Evaluation
    Data Analysis
    User Interface Design
    Collaboration
    Project Management
    Problem-Solving
    Agile Methodologies
    Telemetry API
    Open-source Contributions
    AI Safety Awareness

    Some tips for your application 🫡

    Show Your Passion: When you're writing your application, let your enthusiasm for AI safety and software engineering shine through. We want to see that you’re genuinely excited about the work we do at StudySmarter and how you can contribute to our mission.

    Tailor Your CV: Make sure your CV highlights relevant experience, especially with Python and React. We love seeing projects that demonstrate your skills, so don’t hold back on showcasing any tools or libraries you've built or contributed to!

    Be Clear and Concise: Keep your application straightforward and to the point. We appreciate clarity, so make it easy for us to see your qualifications and how they align with the role. Avoid jargon unless it’s necessary to explain your expertise.

    Apply Early!: Since we review applications on a rolling basis, getting your application in early can give you an edge. Don’t wait until the deadline; hop onto our website and submit your application as soon as you can!

    How to prepare for a job interview at COL Limited

    ✨Know Your Tech Stack

    Make sure you’re well-versed in Python and React, as these are crucial for the role. Brush up on your coding skills and be ready to discuss your past projects that involved these technologies. Highlight any experience you have with building tools or libraries, especially those related to LLM evaluations.

    ✨Understand the Research Context

    Familiarise yourself with the concepts of AGI safety and the specific challenges researchers face. This will help you engage in meaningful conversations during the interview. Show that you’re not just a coder but someone who understands the impact of your work on AI safety.

    ✨Prepare for Technical Interviews

    Since the technical interviews will focus on real tasks, practice hands-on projects related to LLM evals. Work on building evaluation tools or similar projects to demonstrate your practical skills. Be ready to explain your thought process and how you approach problem-solving.

    ✨Showcase Collaboration Skills

    This role involves working closely with researchers and product teams, so be prepared to discuss your experience in collaborative environments. Share examples of how you’ve successfully worked with others to define software roadmaps or solve complex problems together.

    Full-stack Software Engineer (Research team) in London
    COL Limited
    Location: London

    Land your dream job quicker with Premium

    You’re marked as a top applicant with our partner companies
    Individual CV and cover letter feedback including tailoring to specific job roles
    Be among the first applications for new jobs with our AI application
    1:1 support and career advice from our career coaches
    Go Premium

    Money-back if you don't land a job in 6-months

    >