At a Glance
- Tasks: Develop and manage synthetic data pipelines for training cutting-edge AI models.
- Company: Join Cosine, a YC-backed tech company revolutionising software engineering with AI.
- Benefits: Enjoy competitive salary, equity options, 30 days holiday, and a dog-friendly office.
- Why this job: Make a real impact on the future of AI-driven software engineering.
- Qualifications: 3-5 years in software engineering, experience with Python, Docker, and cloud platforms.
- Other info: Collaborative environment focused on work-life balance and professional growth.
The predicted salary is between 80000 - 110000 ÂŁ per year.
Location: London; full in-office working as default
Start date: ASAP
Compensation: ÂŁ80,000 - ÂŁ110,000 Base Salary & ÂŁ80,000 - ÂŁ110,000 Share options.
At Cosine, we’re building autonomous AI engineers that plan, write, and ship code inside real development workflows. Cosine is designed for on-premise and virtual private cloud (VPC) deployments, including fully air-gapped environments. We build our agent tooling entirely in-house and post-train open-source models to deliver reliable, enterprise-grade coding performance in security-critical settings. In 2024, Cosine achieved a 72% score on OpenAI’s SWE-Lancer benchmark, placing us among the strongest real-world software-engineering AI systems evaluated. YC-backed and well-funded, Cosine was founded by experienced operators focused on building dependable, production-grade AI. This role is based in our Hoxton office, five days a week, because close collaboration, fast feedback, and shared context matter for the problems we’re solving.
We’re looking for an ML Systems Engineer to collaborate in training our Lumen models – our open‑source–based software engineering LLMs. This is a unique, and truly interdisciplinary role that involves developing and deploying our reinforcement learning (RL) training environments, working on synthetic data pipelines at massive scale and running fine‑tuning jobs to train the next generation of SWE models that will be used in both our self‑serve and enterprise products. We want to make sure that the models we train are the best SWEs in the world - this doesn’t just mean training them to get the right answer, it means training them so that they write readable, maintainable code, that fits with the architectural patterns already present in the codebase. We believe we’re now in the anti‑slop era of coding agents, where data, RL environments and opinionated reward functions will shape the future standards of SWE models. If this sounds exciting, then this could be the role for you.
In this role you will:
- Develop and manage synthetic data generation pipelines to curate datasets that will underpin future RL fine‑tunes.
- Design, build and deploy containerized services using Docker and platforms like Kubernetes to enable our RL infrastructure.
- Build and iterate on large‑scale RL loops where models write code, run tests or tools, and get rewarded (or penalized) accordingly.
- Work hands‑on across the stack: custom PyTorch dataloaders, RL objectives, and evaluation on real‑world repos and tasks.
You’ll collaborate closely with infra, product, and research to decide what to train next, how to train it, and how to measure whether it’s actually better for engineers.
What you’ll do:
- Participate in end‑to‑end training of models:
- Supervised fine‑tuning on curated code and conversation datasets.
- RL on top of those models to align them with software‑engineering objectives.
- Help maintain/extend an evaluation suite for code models (unit tests, benchmark suites, repo‑level tasks).
- Analyze failure modes and feed them back into data and training plans.
What we’re looking for (essential):
- Strong software engineering or computer science background: Typically 3-5 years of experience. You can read, debug, and write non‑trivial production code (you’ll mainly be working across Python and Go).
- Experience with tools like Docker and container management/orchestration platforms, like Kubernetes.
- Experience with at least one major cloud‑computing platform like GCP, AWS or Azure.
- You care about code quality, correctness, and maintainability as much as model metrics.
- Knowledge of PyTorch/Tensorflow/JAX: Comfortable implementing custom training loops, losses, and dataloaders.
- Data engineering instincts: Comfortable working with large‑scale datasets, object storage, dataset sharding, and filtering. Know that data quality and sampling strategies matter as much as architecture.
- Clear communication and ownership: Can take a vague modelling goal (“make Lumen better at X”) and turn it into a concrete plan of experiments. Comfortable documenting decisions and walking others through tradeoffs.
Nice to have:
- Experience with synthetic data generation pipelines.
- Experience with data tooling like SQL, Apache Iceberg and duckDB.
- Experience training LLMs in distributed environments.
- Safety, robustness, and reward shaping: Experience with LLM‑as‑a‑judge, reward hacking detection, or robustness evaluation.
- Open‑source contributions or research: Contributions to open‑source LLM tooling, RL libraries, etc.
Why join Cosine:
- Direct impact: Your work directly shapes the next generations of Lumen Enterprise SWE models that engineers use every day.
- Real scale: You’ll work with large, modern open‑source models, long context lengths, and multi‑node training runs.
- Full‑stack ML engineering: From custom PyTorch code and distributed systems to data curation, RL infrastructure design and MLOps.
If this sounds like a fit, this is a role where you can meaningfully push the frontier of open-source–based software engineering models.
Cosine is an equal opportunity employer. We value diverse backgrounds, perspectives, and ways of thinking, and we’re committed to creating an inclusive and respectful workplace. We encourage applications from anyone who meets the role requirements, even if you don’t meet every single qualification. If you need reasonable adjustments at any stage of the hiring process, we’re happy to discuss them.
We’re an in‑office team, five days a week, by design. We believe the work we’re doing benefits from being together, collaborating closely, and building shared context. What you can expect:
- Competitive salary, benchmarked to the market.
- Equity / share options, so you share in the upside you help create.
- 30 days’ holiday + bank holidays.
- Genuine 9–5 working hours — we don’t expect late nights or weekend work.
- Work hard in the office, collaborate closely, and switch off properly.
- Dog‑friendly office — bring your dog to work.
- Daily lunch provided.
- Monthly team breakfasts.
- Monthly socials.
- Pension.
- High-quality equipment to do your best work.
We care about focus, sustainability, and doing great work — not performative overwork. We value people who show up, contribute thoughtfully, collaborate well with their colleagues, and then go home. This role won’t suit everyone. But if you want structure, clarity, strong collaboration, and a team that takes both the work and work‑life balance seriously, it’s a great place to be.
ML Systems Engineer - Model Training and Infrastructure (SWE-focused LLMs) employer: Cosine
Contact Detail:
Cosine Recruiting Team
StudySmarter Expert Advice 🤫
We think this is how you could land ML Systems Engineer - Model Training and Infrastructure (SWE-focused LLMs)
✨Tip Number 1
Network like a pro! Reach out to folks in the industry, attend meetups, and connect with people on LinkedIn. You never know who might have the inside scoop on job openings or can refer you directly.
✨Tip Number 2
Show off your skills! Create a portfolio showcasing your projects, especially those related to ML systems or software engineering. This gives potential employers a taste of what you can do and sets you apart from the crowd.
✨Tip Number 3
Prepare for interviews by practising common technical questions and coding challenges. Use platforms like LeetCode or HackerRank to sharpen your skills. Remember, confidence is key!
✨Tip Number 4
Apply through our website! It’s the best way to ensure your application gets seen. Plus, it shows you’re genuinely interested in joining our team at Cosine. Don’t miss out!
We think you need these skills to ace ML Systems Engineer - Model Training and Infrastructure (SWE-focused LLMs)
Some tips for your application 🫡
Tailor Your CV: Make sure your CV is tailored to the ML Systems Engineer role. Highlight your experience with Python, Docker, and any relevant cloud platforms. We want to see how your skills align with what we're looking for!
Showcase Your Projects: Include any projects that demonstrate your experience with synthetic data generation or reinforcement learning. We love seeing real-world applications of your skills, so don’t hold back on the details!
Craft a Compelling Cover Letter: Your cover letter is your chance to shine! Explain why you're excited about working at Cosine and how you can contribute to our mission. We appreciate clear communication, so keep it concise and engaging.
Apply Through Our Website: Remember to apply through our website! It’s the best way for us to track your application and ensure it gets the attention it deserves. Plus, it makes the whole process smoother for everyone involved.
How to prepare for a job interview at Cosine
✨Know Your Tech Stack
Make sure you’re well-versed in the technologies mentioned in the job description, especially Python, Go, Docker, and Kubernetes. Brush up on your knowledge of cloud platforms like GCP, AWS, or Azure, as these will likely come up during technical discussions.
✨Showcase Your Problem-Solving Skills
Prepare to discuss specific examples where you've tackled complex problems, particularly in ML systems or software engineering. Think about how you’ve approached challenges in training models or managing data pipelines, and be ready to explain your thought process.
✨Communicate Clearly
Since this role involves collaboration across teams, practice articulating your ideas clearly. Be prepared to explain technical concepts in a way that’s understandable to non-technical team members. This will demonstrate your ability to communicate effectively within a diverse team.
✨Ask Insightful Questions
Prepare thoughtful questions about the company’s projects, culture, and future goals. This shows your genuine interest in the role and helps you assess if Cosine is the right fit for you. Consider asking about their approach to model evaluation or how they envision the future of SWE models.