At a Glance
- Tasks: Design and build cutting-edge computer vision systems for real-world applications.
- Company: Join Autodesk, a leader in innovative software solutions.
- Benefits: Competitive salary, bonuses, stock options, and comprehensive benefits.
- Why this job: Make an impact with advanced technology in a collaborative environment.
- Qualifications: Experience in computer vision and deep learning using Python.
- Other info: Flexible work options and a culture of diversity and belonging.
The predicted salary is between 36000 - 60000 £ per year.
We are hiring a Senior Software Engineer focused on Computer Vision and Multimodal AI to build robust perception and understanding systems used across multiple teams and product areas. You will develop end-to-end pipelines that transform images and video into structured, reliable observations by combining modern vision models with multimodal reasoning and contextual signals (for example: domain metadata, documents, and sensor inputs).
This role blends applied research with strong software engineering: rapid iteration, rigorous evaluation, and production-minded implementation for cloud-scale batch processing and interactive workflows.
Key Responsibilities- Design, build, and improve multi-stage computer vision pipelines that may include segmentation, detection, tracking, and VLM-based analysis, producing structured outputs (entities, attributes, actions/events, confidence, provenance).
- Build systems that handle real-world variability in visual inputs (for example: low resolution, poor lighting, motion blur, cluttered scenes, inconsistent capture devices).
- Work with diverse media types such as photos, video, timelapse, 360 video, and RGB-D when available.
- Fuse visual evidence with contextual inputs such as metadata, documents, and sensor streams to improve recognition quality and reduce ambiguity.
- Evaluate and integrate state-of-the-art vision and vision-language foundation models, including open-vocabulary recognition, grounded perception, segmentation, and multimodal reasoning.
- Apply fine-tuning or adaptation approaches when needed; partner with ML teams on training, data strategy, and infrastructure best practices.
- Define measurable acceptance criteria and benchmarking for accuracy, robustness, latency/cost, and reliability across datasets and domains.
- Build scalable cloud workflows for batch processing and integrate outputs with APIs and downstream consumers.
- Improve operational performance and cost via batching, caching, model selection, and pipeline observability.
- Write maintainable code, contribute to design docs, code reviews, shared libraries, and cross-team technical decisions.
- Bachelor's degree in Computer Science, Electrical Engineering, Robotics, or related field (or equivalent practical experience).
- 4+ years of experience building computer vision systems using Python.
- Strong experience with deep learning for computer vision (detection, segmentation, and/or video understanding) using modern frameworks such as PyTorch.
- Experience taking ML prototypes into reliable pipelines, including evaluation, monitoring, and failure analysis.
- Experience building or integrating ML systems into cloud or backend workflows (batch processing and/or services).
- Strong collaboration and communication skills; ability to work across teams and stakeholders.
- Experience with vision-language models (VLMs) and multimodal systems (for example: grounded vision, open-vocabulary recognition, retrieval-augmented multimodal reasoning).
- Experience with multimodal fusion (combining imagery/video with metadata, documents, and sensor signals).
- Experience with video pipelines (tracking, temporal aggregation, long-video processing).
- Experience with real-world datasets, including data curation, labelling strategy, augmentation, and quality control under limited data constraints.
- Experience developing reusable platform components adopted across multiple teams.
- Delivered an end-to-end system that ingests real-world image/video inputs and outputs a structured, queryable set of observations (objects plus activities/events), with clear accuracy and reliability metrics.
- Demonstrated robustness to common visual failure modes (lighting, occlusion, clutter, camera variation) and measurable improvements when contextual signals are available.
- Built a modular pipeline architecture (segmentation/detection/VLM reasoning components) that can be reused and extended across domains and teams.
- Maintained strong engineering quality: reproducible experiments, documented decisions, maintainable code, and dependable integrations.
- Computer Vision
- Deep Learning
- PyTorch
- Object Detection
- Segmentation
- Tracking
- Video Understanding
- Vision-Language Models (VLM)
- Multimodal AI
- Open-Vocabulary
- Grounding
- Sensor Fusion
- Data Curation
- Model Evaluation
- Benchmarking
- Cloud ML Pipelines
- Batch Processing
- MLOps
- Observability
Sr Research Engineer, Computer Vision in London employer: Autodesk, Inc.
Contact Detail:
Autodesk, Inc. Recruiting Team
StudySmarter Expert Advice 🤫
We think this is how you could land Sr Research Engineer, Computer Vision in London
✨Tip Number 1
Network like a pro! Reach out to folks in the industry, attend meetups, and connect with people on LinkedIn. You never know who might have the inside scoop on job openings or can refer you directly.
✨Tip Number 2
Show off your skills! Create a portfolio showcasing your projects, especially those related to computer vision and multimodal AI. This gives potential employers a taste of what you can do and sets you apart from the crowd.
✨Tip Number 3
Prepare for interviews by brushing up on common technical questions and coding challenges. Practice explaining your thought process clearly, as communication is key when working across teams.
✨Tip Number 4
Don’t forget to apply through our website! It’s the best way to ensure your application gets seen by the right people. Plus, it shows you’re genuinely interested in joining our team.
We think you need these skills to ace Sr Research Engineer, Computer Vision in London
Some tips for your application 🫡
Tailor Your CV: Make sure your CV is tailored to the role of Sr Research Engineer in Computer Vision. Highlight your experience with computer vision systems, deep learning, and any relevant projects that showcase your skills. We want to see how your background aligns with what we're looking for!
Craft a Compelling Cover Letter: Your cover letter is your chance to shine! Use it to explain why you're passionate about computer vision and how your experience makes you a great fit for our team. Be sure to mention specific projects or technologies you've worked with that relate to the job description.
Showcase Your Projects: If you've worked on any relevant projects, whether personal or professional, make sure to include them in your application. We love seeing practical examples of your work, especially those that demonstrate your ability to build and improve computer vision pipelines.
Apply Through Our Website: We encourage you to apply directly through our website. It’s the best way to ensure your application gets into the right hands. Plus, it shows us you're serious about joining our team at StudySmarter!
How to prepare for a job interview at Autodesk, Inc.
✨Know Your Tech Inside Out
Make sure you’re well-versed in the latest computer vision technologies and frameworks, especially PyTorch. Brush up on your understanding of deep learning concepts, segmentation, and detection techniques. Being able to discuss these topics confidently will show that you're not just familiar with the theory but can also apply it practically.
✨Showcase Your Projects
Prepare to talk about specific projects where you've built or improved computer vision systems. Highlight your role, the challenges you faced, and how you overcame them. If possible, bring along examples of your work or even a demo to illustrate your skills and experience.
✨Understand the Company’s Vision
Research Autodesk and its products, especially those related to computer vision and multimodal AI. Understanding their goals and how your skills can contribute to their success will help you tailor your responses and demonstrate your genuine interest in the role.
✨Prepare for Technical Questions
Expect to face technical questions that assess your problem-solving skills and knowledge of computer vision pipelines. Practice explaining your thought process clearly and concisely. You might be asked to solve a problem on the spot, so being comfortable with coding challenges is key!