At a Glance
- Tasks: Design and maintain robust test infrastructure for complex ML software.
- Company: Join Graphcore, a leader in AI compute technology.
- Benefits: Flexible working, generous leave, private medical insurance, and healthy snacks.
- Other info: Inclusive culture with excellent career growth opportunities.
- Why this job: Make an impact in AI while working with cutting-edge technologies.
- Qualifications: Experience in software engineering, strong Python skills, and CI/CD knowledge.
The predicted salary is between 60000 - 80000 £ per year.
About Graphcore
At Graphcore, we’re building the future of AI compute. We’re a team of semiconductor, software and AI experts, with deep experience in creating the complete AI compute stack - from silicon and software to infrastructure at datacentre scale. As part of the SoftBank Group, backed by significant long-term investment, we are delivering key technology into the fast-growing SoftBank AI ecosystem. To meet the vast and exciting AI opportunity, Graphcore is expanding its teams around the world. We are bringing together the brightest minds to solve the toughest problems, in a place where everyone has the opportunity to make an impact on the company, our products and the future of artificial intelligence.
Job Summary
Applicants for this role should have strong experience designing, developing, and maintaining high-quality software systems. The role focuses on testing and validating a complex machine learning software stack, with particular emphasis on software architecture, automation, and engineering best practices. The ideal candidate is an experienced software engineer who values code quality, testability, and long-term maintainability, and enjoys building systems that other engineers rely on. This person will be comfortable working across large codebases, contributing to CI/CD infrastructure, and shaping technical direction through thoughtful design and mentoring in a technically demanding environment spanning ML frameworks, infrastructure, and AI accelerator hardware.
The Team
The ML QA team is composed of highly skilled software engineers with a strong focus on automation, software quality, and data-driven validation. The team works closely with industry-standard machine learning frameworks and models, contributing to upstream open-source projects and collaborating across the wider software organization. Operating in a fast-paced environment, the team plays a critical role in ensuring reliability, performance, and maintainability across the ML software stack, helping to deliver robust and high-quality products to customers.
Responsibilities and Duties
- Design, implement, and maintain robust test infrastructure and automation for a complex ML software stack.
- Architect and evolve test frameworks and tooling with a focus on scalability, maintainability, and developer experience.
- Build and maintain CI/CD pipelines targeting simulators, emulators (e.g. QEMU), and physical hardware.
- Create representative ML workloads and gain insights from their execution (Numerical accuracy, performance analysis and benchmarking).
- Work closely with all Software development teams, supporting a culture of quality, security and maintainability.
- Review code and designs, setting a high bar for software engineering best practices.
- Mentor and support junior engineers, helping raise the overall technical capability of the team.
- Evaluate existing test strategies and infrastructure, identifying gaps and driving improvements aligned with team and organizational goals.
Candidate Profile
Essential
- Experience in production-quality software engineering roles.
- Strong software design and architecture skills, with experience working on large or complex systems.
- Strong proficiency in Python, including experience building and maintaining production codebases.
- Solid experience with CI/CD systems and automated testing (preferably GitHub-based workflows).
- Experience working in Linux environments.
- Familiarity with C or C++, with the ability to read, debug, and reason about low-level code when needed.
- Proven ability to mentor junior engineers and influence engineering practices within a team.
- Strong problem-solving skills and a proactive, self-directed approach to work.
- Bachelor/Master's/PhD or equivalent experience in Computer Science, Maths, Machine Learning, Data Science, or related field.
Desirable
- Exposure to machine learning frameworks such as PyTorch, JAX, Triton, TensorFlow.
- Experience with distributed workload management systems such as Kubernetes, VLLM, Keras or MLOps pipelines.
- Experience working with hardware simulators or emulators (e.g. QEMU).
- Experience developing for or working with FPGA-based systems.
- Experience with people management or mentoring.
Benefits
In addition to a competitive salary, Graphcore offers flexible working, a generous annual leave policy, private medical insurance and health cash plan, a dental plan, pension (matched up to 5%), life assurance and income protection. We have a generous parental leave policy and an employee assistance programme (which includes health, mental wellbeing, and bereavement support). We offer a range of healthy food and snacks at our central Bristol office and have our own barista bar! We welcome people of different backgrounds and experiences; we’re committed to building an inclusive work environment that makes Graphcore a great home for everyone. We offer an equal opportunity process and understand that there are visible and invisible differences in all of us. We can provide a flexible approach to interview and encourage you to chat to us if you require any reasonable adjustments. Applicants for this position must hold the right to work in the UK. Unfortunately at this time, we are unable to provide visa sponsorship or support for visa applications.
Staff Software Engineer (ML QA) employer: graphcore
Graphcore is an exceptional employer, offering a dynamic work environment in the heart of Bristol where innovation thrives. With a strong commitment to employee growth, we provide ample opportunities for mentorship and skill development, alongside competitive benefits such as flexible working arrangements, generous leave policies, and comprehensive health plans. Our inclusive culture fosters collaboration among talented individuals, making it a rewarding place to contribute to the future of AI technology.
StudySmarter Expert Advice🤫
We think this is how you could land Staff Software Engineer (ML QA)
✨Tip Number 1
Network like a pro! Reach out to current employees at Graphcore on LinkedIn or other platforms. A friendly chat can give you insider info and might even lead to a referral, which is always a bonus!
✨Tip Number 2
Prepare for the technical interview by brushing up on your Python skills and understanding CI/CD processes. Practice coding challenges and be ready to discuss your past projects in detail, especially those related to ML frameworks.
✨Tip Number 3
Show off your passion for AI and software quality during interviews. Share examples of how you've contributed to improving code quality or mentoring others. This will highlight your fit for the team culture at Graphcore.
✨Tip Number 4
Don’t forget to apply through our website! It’s the best way to ensure your application gets seen by the right people. Plus, it shows you’re genuinely interested in joining the Graphcore family.
We think you need these skills to ace Staff Software Engineer (ML QA)
Some tips for your application 🫡
Tailor Your Application:Make sure to customise your CV and cover letter for the Staff Software Engineer (ML QA) role. Highlight your experience with software design, automation, and CI/CD systems, as these are key aspects of what we're looking for.
Showcase Your Skills:Don’t just list your skills; demonstrate them! Include specific examples of projects where you've implemented robust test infrastructure or worked with machine learning frameworks. We love seeing how you’ve made an impact in previous roles.
Keep It Clear and Concise:While we appreciate detail, clarity is crucial. Make sure your application is easy to read and straight to the point. Use bullet points where possible to break up text and highlight your achievements.
Apply Through Our Website:We encourage you to apply directly through our website. This way, your application will be processed more efficiently, and you'll have access to all the latest updates about the role and our company!
How to prepare for a job interview at graphcore
✨Know Your Stuff
Make sure you brush up on your Python skills and understand the intricacies of CI/CD systems. Familiarise yourself with machine learning frameworks like PyTorch or TensorFlow, as well as any relevant experience with hardware simulators. This will show that you're not just a candidate, but someone who can hit the ground running.
✨Showcase Your Problem-Solving Skills
Prepare to discuss specific challenges you've faced in previous roles and how you tackled them. Graphcore values strong problem-solving abilities, so be ready to share examples that highlight your proactive approach and technical expertise.
✨Emphasise Team Collaboration
Since the role involves working closely with other software development teams, be prepared to talk about your experience in mentoring junior engineers and collaborating on projects. Highlight instances where you contributed to a culture of quality and maintainability within a team.
✨Ask Insightful Questions
Interviews are a two-way street! Prepare thoughtful questions about Graphcore's ML QA processes, team dynamics, and future projects. This not only shows your genuine interest in the role but also helps you assess if the company is the right fit for you.