At a Glance
- Tasks: Build innovative tools for AGI safety research and enhance language model evaluations.
- Company: Join Apollo Research, a leader in AI safety with a collaborative culture.
- Benefits: Enjoy a competitive salary, unlimited vacation, flexible hours, and professional development budget.
- Why this job: Make a real impact in AI safety while working with cutting-edge technology.
- Qualifications: Experience in Python and a passion for software engineering; diverse backgrounds welcome.
- Other info: Dynamic team environment with opportunities for growth and collaboration.
The predicted salary is between 80000 - 120000 £ per year.
Applications deadline: We accept submissions until 15 January 2026. We review applications on a rolling basis and encourage early submissions.
ABOUT THE OPPORTUNITY
We’re looking for Backend Software Engineers who are excited to build tools for frontier AGI safety research, e.g. building and maintaining evals libraries and tools for monitoring and controlling our own LLM traffic.
REPRESENTATIVE PROJECTS
- Internal tooling for efficiently running and analyzing evaluations.
- Automated evaluation pipelines to minimise the time from getting access to a new model for pre-deployment testing to analysing the most important results and sharing them.
- Orchestration tools that allow researchers to run thousands of agentic evaluations in parallel on remote machines with high security and reliability.
- LLM proxy service that enables us to monitor all of our coding agent traffic in real time and identify undesired behaviour automatically.
- LLM agents and MCP tools to automate internal software engineering and research tasks, with sandboxes to prevent major failures.
- CI pipeline optimisations to reduce execution time and eliminate flaky tests.
- Telemetry API and instrumentation of our existing tools, allowing us to monitor usage and improve reliability.
- Data warehousing pipeline and service to store thousands of eval transcripts which researchers can study and build datasets from.
- Upstream improvements to the Inspect framework and ecosystem, e.g. support for evaluating modern agentic scaffolds.
KEY RESPONSIBILITIES
- Rapidly prototype and iterate on internal tools and libraries for building and running frontier language model evaluations.
- Lead the development of major features from ideation to implementation.
- Collaboratively define and shape the software roadmap and priorities.
- Establish and advocate for good software design practices, codebase health, and coding agent practices.
- Work closely with researchers to understand what challenges they face.
- Assist researchers with implementation and debugging of research code.
- Communicate clearly about technical decisions and tradeoffs.
KEY REQUIREMENTS
- You must have experience writing production-quality python code.
- 5+ years of professional software engineering experience.
- Bonus: Experience working with LLM agents or LLM evaluations.
- Infosecurity / cybersecurity experience.
- Experience working with AWS.
- Interest in AI Safety.
We want to emphasise that people who feel they don’t fulfil all of these characteristics but think they would be a good fit for the position nonetheless are strongly encouraged to apply.
LOGISTICS
- Start Date: Target of 2-3 months after the first interview.
- Time Allocation: Full-time.
- Location: The office is in London, next to the London Initiative for Safe AI (LISA) offices. This is an in-person role.
- Work Visas: We can sponsor UK visas.
BENEFITS
- Salary: 100k - 200k GBP (~135k - 270k USD).
- Flexible work hours and schedule.
- Unlimited vacation.
- Unlimited sick leave.
- Lunch, dinner, and snacks are provided for all employees on workdays.
- Paid work trips, including staff retreats, business trips, and relevant conferences.
- A yearly $1,000 (USD) professional development budget.
ABOUT APOLLO RESEARCH
The rapid rise in AI capabilities offers tremendous opportunities, but also presents significant risks. At Apollo Research, we’re primarily concerned with risks from Loss of Control, i.e. risks coming from the model itself rather than humans misusing the AI.
ABOUT THE TEAM
The SWE team currently consists of Rusheb Shah, Andrei Matveiakin, Alex Kedrik, and Glen Rodgers. Beyond the SWE team, you will closely interact with the research scientists and engineers as the primary user group of your tools.
EQUALITY STATEMENT
Apollo Research is an Equal Opportunity Employer. We value diversity and are committed to providing equal opportunities to all, regardless of age, disability, gender reassignment, marriage and civil partnership, pregnancy and maternity, race, religion or belief, sex, or sexual orientation.
INTERVIEW PROCESS
Please complete the application form with your CV. The provision of a cover letter is optional but not necessary. Please also feel free to share links to relevant work samples.
About the interview process: Our multi-stage process includes a screening interview, a take-home test (approx. 2 hours), 3 technical interviews, and a final interview with Marius (CEO). The technical interviews will be closely related to tasks the candidate would do on the job. There are no leetcode-style general coding interviews.
Backend Software Engineer (Research team) employer: COL Limited
Contact Detail:
COL Limited Recruiting Team
StudySmarter Expert Advice 🤫
We think this is how you could land Backend Software Engineer (Research team)
✨Tip Number 1
Get your networking game on! Reach out to folks in the industry, especially those who work at Apollo Research or similar companies. A friendly chat can sometimes lead to opportunities that aren’t even advertised yet.
✨Tip Number 2
Show off your skills! If you’ve got any projects or tools you’ve built, make sure to have them ready to share during interviews. It’s a great way to demonstrate your experience and passion for backend development.
✨Tip Number 3
Prepare for those technical interviews by diving into hands-on LLM evals projects. Familiarise yourself with the tools and libraries mentioned in the job description, as this will help you stand out as a candidate who’s ready to hit the ground running.
✨Tip Number 4
Don’t forget to apply through our website! It’s the best way to ensure your application gets seen by the right people. Plus, we love seeing candidates who take the initiative to connect directly with us.
We think you need these skills to ace Backend Software Engineer (Research team)
Some tips for your application 🫡
Show Off Your Skills: When you're writing your application, make sure to highlight your experience with production-quality Python code. We want to see what you've built and how you've contributed to projects, so don’t hold back!
Tailor Your Application: Take a moment to customise your CV and any work samples you share. Align them with the key responsibilities and projects mentioned in the job description. This shows us that you understand what we're looking for and how you can fit into our team.
Be Yourself: We value diverse backgrounds and experiences, so don’t worry if you don’t tick every box. If you think you'd be a good fit, go ahead and apply! Let your personality shine through in your application.
Apply Early: Since we review applications on a rolling basis, it’s a good idea to submit your application sooner rather than later. Head over to our website and get your application in before the deadline!
How to prepare for a job interview at COL Limited
✨Know Your Tech Stack
Make sure you’re well-versed in Python and any other technologies mentioned in the job description. Brush up on your experience with LLM evaluations and internal tooling, as these will likely come up during the technical interviews.
✨Showcase Your Projects
Prepare to discuss specific projects you've worked on that relate to the role. Highlight any tools or libraries you've built, especially those that demonstrate your ability to lead development from ideation to implementation.
✨Understand the Research Context
Familiarise yourself with the challenges researchers face in AGI safety. Being able to articulate how your skills can help solve these problems will show that you’re not just a coder, but someone who understands the bigger picture.
✨Practice Clear Communication
Since the role involves collaborating with researchers, practice explaining your technical decisions and trade-offs clearly. This will be crucial during the interviews, especially when discussing your past experiences and how they relate to the responsibilities of the position.