At a Glance
- Tasks: Build innovative tools for AGI safety research and enhance internal software capabilities.
- Company: Join Apollo Research, a leader in AI safety with a collaborative culture.
- Benefits: Enjoy a competitive salary, unlimited vacation, flexible hours, and professional development budget.
- Other info: Dynamic team environment with opportunities for growth and collaboration.
- Why this job: Make a real impact in AI safety while working with cutting-edge technology.
- Qualifications: Experience in production-quality Python coding and a passion for AI safety.
The predicted salary is between 100000 - 200000 £ per year.
Applications deadline: We accept submissions until 15 January 2026. We review applications on a rolling basis and encourage early submissions.
ABOUT THE OPPORTUNITY
We’re looking for Backend Software Engineers who are excited to build tools for frontier AGI safety research, e.g. building and maintaining evals libraries and tools for monitoring and controlling our own LLM traffic.
REPRESENTATIVE PROJECTS
- Internal tooling for efficiently running and analyzing evaluations. For example, a tool that quickly investigates thousands of agentic eval runs in parallel and surfaces interesting information automatically.
- Automated evaluation pipelines to minimize the time from getting access to a new model for pre-deployment testing to analyzing the most important results and sharing them.
- Orchestration tools that allow researchers to run thousands of agentic evaluations in parallel on remote machines with high security and reliability.
- LLM proxy service that enables us to monitor all of our coding agent traffic in real time and identify undesired behavior automatically (in the spirit of Control).
- LLM agents and MCP tools to automate internal software engineering and research tasks, with sandboxes to prevent major failures.
- CI pipeline optimisations to reduce execution time and eliminate flaky tests.
- Telemetry API and instrumentation of our existing tools, allowing us to monitor usage and improve reliability.
- Data warehousing pipeline and service to store thousands of eval transcripts which researchers can study and build datasets from.
- Upstream improvements to the Inspect framework and ecosystem, e.g. support for evaluating modern agentic scaffolds.
KEY RESPONSIBILITIES
- Rapidly prototype and iterate on internal tools and libraries for building and running frontier language model evaluations.
- Lead the development of major features from ideation to implementation.
- Collaboratively define and shape the software roadmap and priorities.
- Establish and advocate for good software design practices, codebase health, and coding agent practices.
- Work closely with researchers to understand what challenges they face.
- Assist researchers with implementation and debugging of research code.
- Communicate clearly about technical decisions and tradeoffs.
KEY REQUIREMENTS
- You must have experience writing production-quality python code.
- We value candidates from diverse backgrounds and recognise that candidates may demonstrate their skills in different ways. For example, we might be impressed if you have:
- Led the development of a successful software tool or product over an extended period (e.g. 1 year or more).
- Started and built the tech stack for a company, e.g. in a start-up.
- Worked your way up in a large organisation, repeatedly gaining more responsibility and influencing a large part of the codebase.
- Authored and/or maintained a popular open-source tool or library.
- Placed in a prestigious programming competition (IOI, ICPC, etc.).
Bonus
- Experience working with LLM agents or LLM evaluations.
- Infosecurity / cybersecurity experience.
- Experience working with AWS.
- Interest in AI Safety.
We want to emphasize that people who feel they don’t fulfill all of these characteristics but think they would be a good fit for the position nonetheless are strongly encouraged to apply. We believe that excellent candidates can come from a variety of backgrounds and are excited to give you opportunities to shine.
LOGISTICS
- Start Date: Target of 2-3 months after the first interview.
- Time Allocation: Full-time.
- Location: The office is in London, and right next to the London Initiative for Safe AI (LISA) offices. This is an in-person role. In rare situations, we may consider partially remote arrangements on a case-by-case basis.
- Work Visas: We can sponsor UK visas.
BENEFITS
- Salary: 100k - 200k GBP (~135k - 270k USD).
- Flexible work hours and schedule.
- Unlimited vacation.
- Unlimited sick leave.
- Lunch, dinner, and snacks are provided for all employees on workdays.
- Paid work trips, including staff retreats, business trips, and relevant conferences.
- A yearly $1,000 (USD) professional development budget.
ABOUT APOLLO RESEARCH
The rapid rise in AI capabilities offers tremendous opportunities, but also presents significant risks. At Apollo Research, we’re primarily concerned with risks from Loss of Control, i.e. risks coming from the model itself rather than e.g. humans misusing the AI. We’re particularly concerned with deceptive alignment / scheming, a phenomenon where a model appears to be aligned but is, in fact, misaligned and capable of evading human oversight. We work on the detection of scheming (e.g. building evaluations), the science of scheming (e.g. model organisms), and scheming mitigations (e.g. anti-scheming, and control). We closely work with multiple frontier AI companies, e.g. to test their models before deployment or collaborate on scheming mitigations. At Apollo, we aim for a culture that emphasizes truth-seeking, being goal-oriented, giving and receiving constructive feedback, and being friendly and helpful.
ABOUT THE TEAM
The SWE team currently consists of Rusheb Shah, Andrei Matveiakin, Alex Kedrik, and Glen Rodgers. Beyond the SWE team, you will closely interact with the research scientists and engineers as the primary user group of your tools.
Equality Statement
Apollo Research is an Equal Opportunity Employer. We value diversity and are committed to providing equal opportunities to all, regardless of age, disability, gender reassignment, marriage and civil partnership, pregnancy and maternity, race, religion or belief, sex, or sexual orientation.
INTERVIEW PROCESS
Please complete the application form with your CV. The provision of a cover letter is optional but not necessary. Please also feel free to share links to relevant work samples. About the interview process: Our multi-stage process includes a screening interview, a take-home test (approx. 2 hours), 3 technical interviews, and a final interview with Marius (CEO). The technical interviews will be closely related to tasks the candidate would do on the job. There are no leetcode-style general coding interviews. If you want to prepare for the interviews, we suggest working on hands‑on LLM evals projects (e.g. as suggested in our starter guide), such as building LM agent evaluations in Inspect.
Applications deadline: We are reviewing applications on a rolling basis. It might take a few weeks until you hear from us.
Backend Software Engineer (Research team) in London employer: COL Limited
Contact Detail:
COL Limited Recruiting Team
StudySmarter Expert Advice 🤫
We think this is how you could land Backend Software Engineer (Research team) in London
✨Tip Number 1
Get your networking game on! Connect with folks in the AI and software engineering space, especially those who work at Apollo Research or similar companies. Attend meetups, webinars, or even just reach out on LinkedIn. You never know who might give you a heads-up about job openings or refer you directly!
✨Tip Number 2
Show off your skills! If you've worked on any relevant projects, make sure to have them ready to share during interviews. Whether it's a GitHub repo or a personal website, having tangible examples of your work can really set you apart from the crowd.
✨Tip Number 3
Prepare for those technical interviews by diving deep into LLM evaluations. Brush up on your Python skills and get hands-on with projects that align with what Apollo Research is doing. The more familiar you are with their work, the better you'll perform in interviews!
✨Tip Number 4
Don't forget to apply through our website! It’s the best way to ensure your application gets seen. Plus, we love seeing candidates who take the initiative to engage with us directly. So, hit that apply button and let’s get the conversation started!
We think you need these skills to ace Backend Software Engineer (Research team) in London
Some tips for your application 🫡
Show Your Passion: When writing your application, let your enthusiasm for AGI safety research shine through! We want to see that you're genuinely excited about the projects we’re working on and how you can contribute to them.
Tailor Your CV: Make sure your CV highlights relevant experience, especially in Python and software engineering. We love seeing how your past projects align with our needs, so don’t hold back on showcasing your achievements!
Be Clear and Concise: In your written application, clarity is key. Use straightforward language to explain your skills and experiences. We appreciate a well-structured application that makes it easy for us to see why you’d be a great fit.
Apply Early!: Since we review applications on a rolling basis, don’t wait until the deadline! Applying early gives you a better chance of standing out, so head over to our website and submit your application as soon as you can.
How to prepare for a job interview at COL Limited
✨Know Your Python Inside Out
Since the role requires experience in writing production-quality Python code, make sure you brush up on your Python skills. Be ready to discuss your past projects and how you've tackled challenges using Python. Practising coding problems related to backend development can also help you feel more confident.
✨Familiarise Yourself with LLMs
Given the focus on LLM evaluations, it’s crucial to understand how they work. Dive into any relevant projects or tools that involve LLMs, and be prepared to discuss your insights during the interview. Showing genuine interest and knowledge about LLM safety and evaluation will set you apart.
✨Prepare for Technical Discussions
The interview process includes multiple technical interviews, so be ready to engage in deep discussions about software design practices and trade-offs. Think about how you would approach building internal tools or libraries, and be prepared to share your thought process clearly and confidently.
✨Showcase Your Collaborative Spirit
This role involves working closely with researchers, so highlight your teamwork skills. Prepare examples of how you've collaborated on projects in the past, especially in a tech environment. Emphasising your ability to communicate effectively and advocate for good practices will resonate well with the interviewers.