Job Board

Companies

COL Limited

Full-stack Software Engineer

Full-stack Software Engineer in London

London Full-Time 100000 - 200000 £ / year (est.) No home office possible

Apply now

At a Glance

Tasks: Build innovative tools for AGI safety research and develop cutting-edge software features.
Company: Join a forward-thinking tech company focused on AI safety and collaboration.
Benefits: Enjoy a competitive salary, flexible hours, unlimited vacation, and professional development budget.
Why this job: Make a real impact in AI safety while working with top researchers and engineers.
Qualifications: Experience in Python and React, with a passion for software engineering and AI.
Other info: Dynamic team environment with opportunities for growth and learning.

The predicted salary is between 100000 - 200000 £ per year.

We accept submissions until 15 January 2026. We review applications on a rolling basis and encourage early submissions.

We are looking for Full-stack Software Engineers who are excited to build tools for frontier AGI safety research, e.g. building and maintaining evals libraries and tools for monitoring and controlling our own LLM traffic.

Representative Projects

Your main objective is to develop tooling for analyzing model evaluation results. Here is a list of features that you might build and ship in your first 6 months:

LLM-powered search that finds interesting fragments in evaluation transcripts
Comparison views that show how conversations and scores differ between two evaluation runs
Ability to view and analyse conversations with coding agents (Cursor, Claude Code, etc.) in addition to evaluation transcripts
Results streaming for evaluations that are currently being run
Collaborative editing of evaluation logs that automatically updates metrics and other derived data. Think of this as developing an "IDE for evaluations".

Besides this, here are example auxiliary projects which you might do:

Automated evaluation pipelines to minimize the time from getting access to a new model for pre-deployment testing to analyzing the most important results and sharing them.
LLM agents and MCP tools to automate internal software engineering and research tasks, with sandboxes to prevent major failures
Telemetry API and instrumentation of our existing tools, allowing us to monitor usage and improve reliability
Upstream improvements to the Inspect framework and ecosystem, e.g. support for evaluating modern agentic scaffolds.

Key Responsibilities

Balance between moving quickly and creating robust and performant software
Lead the development of major features from ideation to implementation
Support the entire user journey from running the evaluation to finding interesting results to analysing the results to producing reports and papers
Make the software configurable and extensible, so that users can adapt it for their needs
Collaboratively define and shape the software roadmap and priorities
Establish and advocate for good software design practices, codebase health, and coding agent practices
Work closely with researchers to understand what challenges they face
Work closely with the product team to create solutions that satisfy both our researchers and external customers

Key Requirements

You must have experience writing production-quality Python and React code
5+ years of professional software engineering experience
We value candidates from diverse backgrounds and recognise that candidates may demonstrate their skills in different ways. For example, we might be impressed if you have:

Led the development of a successful software tool or product over an extended period (e.g. 1 year or more)
Started and built the tech stack for a company, e.g. in a start-up
Worked your way up in a large organisation, repeatedly gaining more responsibility and influencing a large part of the codebase
Authored and/or maintained a popular open-source tool or library
Placed in a prestigious programming competition (IOI, ICPC, etc.)

The following would be a bonus:

Experience designing rich and intuitive UIs, especially for power users
Direct work with researchers or customers
Experience working with LLM agents or LLM evaluations
Interest in AI Safety

We want to emphasise that people who feel they don’t fulfil all of these characteristics but think they would be a good fit for the position nonetheless are strongly encouraged to apply. We believe that excellent candidates can come from a variety of backgrounds and are excited to give you opportunities to shine.

Logistics

Start Date: Target of 2-3 months after the first interview
Time Allocation: Full-time
Location: The office is in London, right next to the London Initiative for Safe AI (LISA) offices. This is an in-person role. In rare situations, we may consider partially remote arrangements on a case-by-case basis.
Work Visas: We can sponsor UK visas

Benefits

Salary: 100k - 200k GBP (~135k - 270k USD)
Flexible work hours and schedule
Unlimited vacation
Unlimited sick leave
Lunch, dinner, and snacks are provided for all employees on workdays
Paid work trips, including staff retreats, business trips, and relevant conferences
A yearly $1,000 (USD) professional development budget

About the Team

The SWE team currently consists of Rusheb Shah, Andrei Matveiakin, Alex Kedrik, and Glen Rodgers. Beyond the SWE team, you will closely interact with the research scientists and engineers as the primary user group of your tools.

About the Apollo Research

The rapid rise in AI capabilities offers tremendous opportunities, but also presents significant risks. At Apollo Research, we are primarily concerned with risks from Loss of Control, i.e. risks coming from the model itself rather than e.g. humans misusing the AI. We are particularly concerned with deceptive alignment/scheming, a phenomenon where a model appears to be aligned but is, in fact, misaligned and capable of evading human oversight. We work on the detection of scheming (e.g. building evaluations), the science of scheming (e.g. model organisms), and scheming mitigations (e.g. anti-scheming, and control). We closely work with multiple frontier AI companies, e.g. to test their models before deployment or collaborate on scheming mitigations. At Apollo, we aim for a culture that emphasises truth-seeking, being goal-oriented, giving and receiving constructive feedback, and being friendly and helpful.

Equality Statement: Apollo Research is an Equal Opportunity Employer. We value diversity and are committed to providing equal opportunities to all, regardless of age, disability, gender reassignment, marriage and civil partnership, pregnancy and maternity, race, religion or belief, sex, or sexual orientation.

Interview Process

Please complete the application form with your CV. The provision of a cover letter is optional but not necessary. Please also feel free to share links to relevant work samples.

Our multi-stage process includes a screening interview, a take-home test (approx. 2 hours), 3 technical interviews, and a final interview with Marius (CEO). The technical interviews will be closely related to tasks the candidate would do on the job. There are no leetcode-style general coding interviews. If you want to prepare for the interviews, we suggest working on hands-on LLM evals projects (e.g. as suggested in our starter guide), such as building LM agent evaluations in Inspect.

We are reviewing applications on a rolling basis. It might take a few weeks until you hear from us.

Full-stack Software Engineer in London employer: COL Limited

At Apollo Research, we pride ourselves on being an exceptional employer, offering a dynamic work culture that fosters innovation and collaboration in the field of AI safety. Our London office provides a vibrant environment with flexible work hours, unlimited vacation, and a generous professional development budget, ensuring that our Full-stack Software Engineers have the resources and support they need to thrive and grow in their careers. Join us to work alongside passionate researchers and engineers, contributing to meaningful projects that address critical challenges in AI technology.

Contact Detail:

COL Limited Recruiting Team

View COL Limited Profile

StudySmarter Expert Advice 🤫

We think this is how you could land Full-stack Software Engineer in London

✨Tip Number 1

Get your networking game on! Reach out to folks in the industry, especially those who work at Apollo or similar companies. A friendly chat can sometimes lead to opportunities that aren’t even advertised.

✨Tip Number 2

Show off your skills! If you’ve got a portfolio of projects, especially those related to LLMs or software tools, make sure to highlight them. We love seeing what you can do in action!

✨Tip Number 3

Prepare for the interview process by diving into hands-on projects. Work on LLM evals or similar tasks to get a feel for what we do. It’ll not only help you stand out but also give you great talking points during interviews.

✨Tip Number 4

Don’t hesitate to apply through our website! Even if you don’t tick every box, we’re keen on diverse backgrounds and experiences. If you think you’d be a good fit, we want to hear from you!

We think you need these skills to ace Full-stack Software Engineer in London

Python

React

Software Development

Tooling Development

Model Evaluation

Data Analysis

User Interface Design

Collaboration

Project Management

Problem-Solving

Agile Methodologies

Telemetry API

Open-source Contributions

AI Safety Awareness

Some tips for your application 🫡

Get Your CV Spot On: Make sure your CV is tailored to highlight your experience with Python and React. We want to see how you've led projects or built tools, so don’t hold back on showcasing your achievements!

Show Off Your Passion: In your application, let us know why you're excited about AI safety and the work we do at Apollo Research. A genuine interest can really make you stand out from the crowd!

Include Relevant Work Samples: If you have any projects or tools you've worked on that relate to LLM evaluations or software engineering, share them! Links to your GitHub or other portfolios can give us a better idea of your skills.

Apply Early!: We review applications on a rolling basis, so don’t wait until the last minute. Get your application in early through our website to increase your chances of getting noticed!

How to prepare for a job interview at COL Limited

✨Know Your Tech Stack

Make sure you’re well-versed in Python and React, as these are crucial for the role. Brush up on your coding skills and be ready to discuss your past projects that involved these technologies.

✨Understand the Research Context

Familiarise yourself with AGI safety research and the specific challenges faced by researchers. This will help you demonstrate how your skills can directly contribute to their work and show that you’re genuinely interested in the field.

✨Prepare for Technical Interviews

Since the technical interviews will focus on real tasks, practice hands-on LLM evals projects. Work on building LM agent evaluations or similar projects to showcase your practical experience and problem-solving skills.

✨Showcase Collaboration Skills

Highlight your ability to work closely with both researchers and product teams. Be prepared to discuss examples of how you’ve successfully collaborated in the past, as this role requires balancing technical development with user needs.

Full-stack Software Engineer in London

COL Limited

Location: London

Apply now

Full-stack Software Engineer in London

London

Full-Time

100000 - 200000 £ / year (est.)

Apply now
COL Limited

50-100

View COL Limited Profile

Similar positions in other companies

UK’s top job board for Gen Z

Discover now

Full-stack Software Engineer in London

At a Glance

Full-stack Software Engineer in London employer: COL Limited

StudySmarter Expert Advice 🤫

✨Tip Number 1

✨Tip Number 2

✨Tip Number 3

✨Tip Number 4

We think you need these skills to ace Full-stack Software Engineer in London

Some tips for your application 🫡

How to prepare for a job interview at COL Limited

Full-stack Software Engineer in London

Land your dream job quicker with Premium

Similar positions in other companies

UK’s top job board for Gen Z