At a Glance
- Tasks: Design and build data pipelines for cutting-edge AI research and development.
- Company: Join Canva, a leader in innovative design technology with a vibrant culture.
- Benefits: Equity packages, flexible leave, and a supportive parental leave policy.
- Other info: Remote work options and a focus on personal wellbeing.
- Why this job: Make a real impact in AI while enjoying a fun and dynamic work environment.
- Qualifications: Strong Python skills and experience with ML data workflows.
The predicted salary is between 80000 - 100000 £ per year.
Join the team redefining how the world experiences design. Our flagship campus is in Sydney, Australia but Austria is home to part of our European operations. You have choice in where and how you work, we trust our Canvanauts to choose the balance that empowers them and their team to achieve their goals.
At Canva, our mission is to empower the world to design. We’re building AI that feels magical and lands real impact for millions of people - helping anyone create with confidence. We're looking for a Machine Learning Engineer to own the data foundations that power our multimodal agent research—building the pipelines, datasets, and tooling that turn ambitious research ideas into trainable reality.
About the team: We explore multimodal agentic architectures, build scalable training and evaluation loops, and partner closely with product and platform teams to turn breakthroughs into delightful product features. We are a cutting-edge post-training team, developing new multimodal agentic systems.
About the role: You'll be responsible for the data lifecycle that fuels our agent research: from collection and curation through to preprocessing, quality assurance, and delivery into training pipelines. You'll work closely with research scientists to understand what data is needed, then design and build the systems to make it happen—reliably and at scale.
What you'll do:
- Design and build data pipelines for agent training: collection, filtering, deduplication, formatting, and versioning across text, image, and multimodal sources.
- Build and maintain infrastructure for efficient data loading, storage, and retrieval at scale (S3, distributed systems, streaming pipelines).
- Collaborate with research scientists to translate research requirements into concrete data specifications, and iterate as experiments reveal new needs.
- Create evaluation datasets and benchmarks in collaboration with researchers—curating task distributions that surface real failure modes.
- Develop tooling for dataset construction—including human annotation workflows, synthetic data generation, and preference data collection for RLHF/DPO-style training.
- Own data quality: build validation frameworks, monitor for drift and contamination, and establish standards that make datasets trustworthy and reproducible.
- Document datasets thoroughly: provenance, known limitations, intended use cases, and versioning history.
- Implement comprehensive test coverage for data pipelines and ML workflows, ensuring reliability and catching regressions early.
- Elevate codebase quality through code reviews, refactoring, and establishing engineering best practices that help research velocity scale sustainably.
- Contribute to team roadmaps by identifying data bottlenecks and proposing solutions that unblock research velocity.
You're likely a match if you have:
- Strong software engineering skills in Python, with experience building production-grade data pipelines and ML DevOps.
- Practical experience with prompt engineering—designing, testing, and refining prompts for reliable LLM/VLM outputs.
- Experience with ML data workflows: large-scale data processing and loading (Ray, or similar), data versioning, and format considerations for training (tokenization, batching, sharding).
- Hands-on experience working with data pipelines for large-scale distributed ML training runs.
- Familiarity with annotation tooling and human-in-the-loop data collection (Label Studio or internal systems).
- Understanding of ML training requirements—you know what 'good data' looks like for LLM/VLM fine-tuning and can anticipate downstream issues.
- Experience loading and writing large datasets to/from cloud infrastructure (AWS) and distributed storage systems.
- Strong communication skills: you can work with researchers to scope ambiguous problems and translate needs into actionable plans.
- A collaborative approach, comfortable taking ownership and iterating quickly.
Nice to have:
- Experience with preference data collection for RLHF or reward modelling.
- Familiarity with multimodal data (image-text pairs, video, design assets).
- Experience building synthetic data generation pipelines using LLMs.
- Background in data quality metrics and monitoring systems.
- Contributions to dataset releases or benchmarks in the ML community.
What's in it for you? Achieving our crazy big goals motivates us to work hard - and we do - but you'll experience lots of moments of magic, connectivity and fun woven throughout life at Canva, too. We also offer a range of benefits to set you up for every success in and outside of work.
We make hiring decisions based on your experience, skills and passion, as well as how you can enhance Canva and our culture. When you apply, please tell us the pronouns you use and any reasonable adjustments you may need during the interview process. We celebrate all types of skills and backgrounds at Canva so even if you don’t feel like your skills quite match what’s listed above - we still want to hear from you! Please note that interviews are conducted virtually.
Remote Senior Machine Learning Engineer - Multimodal Data in Newtownabbey employer: Canva
At Canva, we pride ourselves on fostering a vibrant and inclusive work culture that empowers our employees to thrive both personally and professionally. With flexible working arrangements, generous benefits like equity packages and an inclusive parental leave policy, we ensure that our Canvanauts have the support they need to achieve their goals while enjoying a fulfilling work-life balance. Join us in our mission to redefine design through innovative AI solutions, all from the comfort of your chosen workspace in beautiful Austria or beyond.
StudySmarter Expert Advice🤫
We think this is how you could land Remote Senior Machine Learning Engineer - Multimodal Data in Newtownabbey
✨Get Involved in Data Science Meetups
Tap into local data science meetups or workshops to connect with fellow enthusiasts and professionals. These events are goldmines for networking, and sometimes even lead directly to job openings at companies like Canva!
✨Show Off Your Projects
Start building a public portfolio showcasing your data science projects on platforms like GitHub or personal websites. Highlight unique analyses or models you've developed. This not only demonstrates your skills but also gets your name out there for roles like Remote Senior Machine Learning Engineer - Multimodal Data at Canva.
✨Leverage Professional Networks
Join professional bodies related to data science, like the Data Science Society or similar organisations. Getting involved can lead to mentorship opportunities and insider knowledge about full-time positions at companies like Canva.
✨Apply Directly through Our Website
When you find a suitable opening like Remote Senior Machine Learning Engineer - Multimodal Data at Canva, make sure to apply directly through our website. It gives you an edge and shows you're keen to join our team. Plus, who doesn’t love a direct application? It’s easier than navigating through job boards!
We think you need these skills to ace Remote Senior Machine Learning Engineer - Multimodal Data in Newtownabbey
Some tips for your application 🫡
Show Off Your Projects:In the world of data science, your projects can speak volumes about your skills. Make sure to showcase a few key projects in your CV or portfolio, especially those that highlight your ability to work with data sets, build models, or use relevant tools like Python, R, or SQL. Don’t forget to include links to any GitHub repositories if applicable!
Quantify Your Achievements:Employers love numbers! When drafting your CV, highlight your achievements with quantifiable results. For instance, mention how your data analysis led to a certain percentage increase in efficiency or revenue at a previous job or project. These details can really make your application pop!
Craft a Tailored Cover Letter:For a full-time role at Canva, your cover letter should reflect your passion for data science and your excitement about the specific projects or values of the company. Dive into why you’re a good fit, how your skills align with their needs, and any unique perspectives you can bring to the team.
Stand Out with Relevant Courses and Certifications:Although experience talks, relevant courses or certifications can be your ticket to impressing hiring managers at Canva. Mention any standout courses you've completed that equipped you with essential skills, such as machine learning certifications or data visualisation courses. This shows your commitment to continuously developing your skills in the field!
How to prepare for a job interview at Canva
✨Brush Up on Your Statistics
For a data science role, we need to seriously sharpen our statistics skills. Get ready to tackle technical questions on probability distributions, hypothesis testing, and regression analysis. These are often the bread and butter of data science interviews, so don't just skim over them!
✨Showcase Your Projects
Prepare a killer portfolio showcasing your data science projects. We should include details about the datasets used, the tools and techniques applied, and the impact of your findings. If we can walk them through a particularly challenging project or a cool visualisation that had real-world implications, it’ll really make us stand out!
✨Get Comfortable with Python and R
Most data science positions require us to be proficient in programming languages like Python and R. We should practice common libraries like pandas, NumPy, and scikit-learn, and be ready for live coding exercises or algorithm questions. Showing off our coding chops can really impress the interviewers at Canva!
✨Prepare for Case Studies
Expect to encounter real-world case studies during the interview. We might be asked how we’d approach a data problem or analyse a dataset to extract insights. It's essential to think out loud and demonstrate our problem-solving process so that the interviewer can see our logical thinking in action.