Sr Research Engineer, Computer Vision
Sr Research Engineer, Computer Vision

Sr Research Engineer, Computer Vision

Full-Time 43200 - 72000 £ / year (est.) No home office possible
Autodesk, Inc.

At a Glance

  • Tasks: Design and build cutting-edge computer vision systems for real-world applications.
  • Company: Join Autodesk, a leader in innovative software solutions.
  • Benefits: Competitive salary, bonuses, stock options, and comprehensive benefits.
  • Why this job: Make an impact with advanced technology in a collaborative environment.
  • Qualifications: Experience in computer vision and deep learning using Python.
  • Other info: Flexible work options and a culture of diversity and belonging.

The predicted salary is between 43200 - 72000 £ per year.

We are hiring a Senior Software Engineer focused on Computer Vision and Multimodal AI to build robust perception and understanding systems used across multiple teams and product areas. You will develop end-to-end pipelines that transform images and video into structured, reliable observations by combining modern vision models with multimodal reasoning and contextual signals (for example: domain metadata, documents, and sensor inputs).

This role blends applied research with strong software engineering: rapid iteration, rigorous evaluation, and production-minded implementation for cloud-scale batch processing and interactive workflows.

Key Responsibilities
  • Design, build, and improve multi-stage computer vision pipelines that may include segmentation, detection, tracking, and VLM-based analysis, producing structured outputs (entities, attributes, actions/events, confidence, provenance).
  • Build systems that handle real-world variability in visual inputs (for example: low resolution, poor lighting, motion blur, cluttered scenes, inconsistent capture devices).
  • Work with diverse media types such as photos, video, timelapse, 360 video, and RGB-D when available.
  • Fuse visual evidence with contextual inputs such as metadata, documents, and sensor streams to improve recognition quality and reduce ambiguity.
  • Evaluate and integrate state-of-the-art vision and vision-language foundation models, including open-vocabulary recognition, grounded perception, segmentation, and multimodal reasoning.
  • Apply fine-tuning or adaptation approaches when needed; partner with ML teams on training, data strategy, and infrastructure best practices.
  • Define measurable acceptance criteria and benchmarking for accuracy, robustness, latency/cost, and reliability across datasets and domains.
  • Build scalable cloud workflows for batch processing and integrate outputs with APIs and downstream consumers.
  • Improve operational performance and cost via batching, caching, model selection, and pipeline observability.
  • Write maintainable code, contribute to design docs, code reviews, shared libraries, and cross-team technical decisions.
Minimum Qualifications
  • Bachelor’s degree in Computer Science, Electrical Engineering, Robotics, or related field (or equivalent practical experience).
  • 4+ years of experience building computer vision systems using Python.
  • Strong experience with deep learning for computer vision (detection, segmentation, and/or video understanding) using modern frameworks such as PyTorch.
  • Experience taking ML prototypes into reliable pipelines, including evaluation, monitoring, and failure analysis.
  • Experience building or integrating ML systems into cloud or backend workflows (batch processing and/or services).
  • Strong collaboration and communication skills; ability to work across teams and stakeholders.
Preferred Qualifications
  • Experience with vision-language models (VLMs) and multimodal systems (for example: grounded vision, open-vocabulary recognition, retrieval-augmented multimodal reasoning).
  • Experience with multimodal fusion (combining imagery/video with metadata, documents, and sensor signals).
  • Experience with video pipelines (tracking, temporal aggregation, long-video processing).
  • Experience with real-world datasets, including data curation, labelling strategy, augmentation, and quality control under limited data constraints.
  • Experience developing reusable platform components adopted across multiple teams.
What Success Looks Like
  • Delivered an end-to-end system that ingests real-world image/video inputs and outputs a structured, queryable set of observations (objects plus activities/events), with clear accuracy and reliability metrics.
  • Demonstrated robustness to common visual failure modes (lighting, occlusion, clutter, camera variation) and measurable improvements when contextual signals are available.
  • Built a modular pipeline architecture (segmentation/detection/VLM reasoning components) that can be reused and extended across domains and teams.
  • Maintained strong engineering quality: reproducible experiments, documented decisions, maintainable code, and dependable integrations.
Keywords (for candidate matching)
  • Computer Vision
  • Deep Learning
  • PyTorch
  • Object Detection
  • Segmentation
  • Tracking
  • Video Understanding
  • Vision-Language Models (VLM)
  • Multimodal AI
  • Open-Vocabulary
  • Grounding
  • Sensor Fusion
  • Data Curation
  • Model Evaluation
  • Benchmarking
  • Cloud ML Pipelines
  • Batch Processing
  • MLOps
  • Observability

Sr Research Engineer, Computer Vision employer: Autodesk, Inc.

At Autodesk, we are committed to fostering a vibrant work culture that prioritises innovation and collaboration, making us an exceptional employer for a Senior Research Engineer in Computer Vision. Our flexible work arrangements, comprehensive benefits package, and emphasis on diversity and belonging create an environment where employees can thrive and grow professionally. With opportunities to engage in cutting-edge projects and contribute to meaningful advancements in technology, our London location offers a dynamic setting for those looking to make a significant impact in their field.
Autodesk, Inc.

Contact Detail:

Autodesk, Inc. Recruiting Team

StudySmarter Expert Advice 🤫

We think this is how you could land Sr Research Engineer, Computer Vision

✨Tip Number 1

Network like a pro! Reach out to folks in the industry, attend meetups, and connect with people on LinkedIn. You never know who might have the inside scoop on job openings or can refer you directly.

✨Tip Number 2

Show off your skills! Create a portfolio showcasing your projects, especially those related to computer vision and multimodal AI. This gives potential employers a taste of what you can do and sets you apart from the crowd.

✨Tip Number 3

Prepare for interviews by brushing up on common technical questions and coding challenges. Practice explaining your thought process clearly, as communication is key when working across teams.

✨Tip Number 4

Don’t forget to apply through our website! It’s the best way to ensure your application gets seen by the right people. Plus, it shows you’re genuinely interested in joining our team.

We think you need these skills to ace Sr Research Engineer, Computer Vision

Computer Vision
Deep Learning
Python
PyTorch
Object Detection
Segmentation
Tracking
Video Understanding
Vision-Language Models (VLM)
Multimodal AI
Sensor Fusion
Data Curation
Model Evaluation
Cloud ML Pipelines
Batch Processing

Some tips for your application 🫡

Tailor Your CV: Make sure your CV is tailored to the role of Sr Research Engineer in Computer Vision. Highlight your experience with computer vision systems, deep learning, and any relevant projects that showcase your skills.

Craft a Compelling Cover Letter: Your cover letter should tell us why you're passionate about computer vision and how your background makes you a great fit for our team. Be sure to mention specific experiences that align with the job description.

Showcase Your Projects: If you've worked on any relevant projects, whether personal or professional, make sure to include them. We love seeing practical applications of your skills, especially in building robust pipelines and handling real-world variability.

Apply Through Our Website: Don't forget to apply through our website! It’s the best way for us to receive your application and ensures you’re considered for the role. Plus, it shows us you’re keen on joining our team!

How to prepare for a job interview at Autodesk, Inc.

✨Know Your Tech Inside Out

Make sure you’re well-versed in the latest computer vision technologies and frameworks, especially PyTorch. Brush up on your understanding of deep learning concepts, segmentation, and detection techniques. Being able to discuss these topics confidently will show that you're not just familiar with the tools, but you can also apply them effectively.

✨Showcase Your Projects

Prepare to talk about specific projects where you've built or integrated computer vision systems. Highlight the challenges you faced, how you overcame them, and the impact of your work. This will demonstrate your hands-on experience and problem-solving skills, which are crucial for this role.

✨Understand the Company’s Vision

Research Autodesk and its approach to computer vision and multimodal AI. Familiarise yourself with their products and how they leverage technology. This knowledge will help you align your answers with their goals and show that you’re genuinely interested in contributing to their mission.

✨Prepare for Technical Questions

Expect technical questions that assess your understanding of building scalable cloud workflows and handling real-world variability in visual inputs. Practice explaining your thought process clearly and concisely, as communication is key when discussing complex technical topics.

Sr Research Engineer, Computer Vision
Autodesk, Inc.

Land your dream job quicker with Premium

You’re marked as a top applicant with our partner companies
Individual CV and cover letter feedback including tailoring to specific job roles
Be among the first applications for new jobs with our AI application
1:1 support and career advice from our career coaches
Go Premium

Money-back if you don't land a job in 6-months

>