At a Glance
- Tasks: Design and develop advanced computer vision systems for the construction industry.
- Company: Join Depixen, a leading tech company revolutionising construction with AI.
- Benefits: Competitive salary, flexible work options, and opportunities for professional growth.
- Other info: Collaborative environment with exciting projects and career advancement opportunities.
- Why this job: Make a real impact by connecting visual AI with verified data in a complex industry.
- Qualifications: Bachelor's degree in a relevant field and 3-6 years of experience in computer vision.
The predicted salary is between 60000 - 80000 € per year.
About the Role
This role is based at Depixen’s London Office. Depixen is a London-based technology company building the digital decision infrastructure of the construction industry. As a corporate member of the World Wide Web Consortium (W3C), Depixen develops W3C-compliant Linked Data architectures, domain-specific ontologies, taxonomy models, RDF-based data structures, and knowledge graph infrastructures for the construction sector. Through its AI projects with respected universities and institutions in the United Kingdom, Depixen continues to build a scalable global structure across the United States, Europe, and the Far East.
Computer vision is one of the critical perception layers of this infrastructure. In construction, architecture, and building products, visual data is not merely unstructured; it is deeply contextual. Product images, technical documents, drawings, site photos, spatial data, material surfaces, and building elements must be interpreted together with verified technical knowledge, semantic classifications, and machine-interpretable data models. This is not a conventional image recognition role. You will help connect visual AI outputs with verified data, taxonomy, ontology, RDF, and knowledge graph layers, turning perception into reliable decision intelligence for the construction industry.
We are seeking a talented Computer Vision Engineer to design, develop, and productionize advanced perception systems for real-world construction industry use cases. In this role, you will work on object detection, segmentation, OCR, visual matching, product recognition, building element analysis, site image interpretation, video analytics, and multimodal vision-language applications. The ideal candidate is analytical, research-driven, collaborative, and capable of turning advanced computer vision ideas into reliable production systems that address real industry problems.
Responsibilities
- Design, develop, and evaluate robust, scalable, and verifiable computer vision models and pipelines using modern deep learning frameworks.
- Build and optimize end-to-end vision systems covering data preprocessing, model development, deployment, monitoring, and continuous improvement.
- Develop systems for object detection, segmentation, tracking, OCR, image classification, visual matching, product recognition, and video analytics as required.
- Collaborate with data and modelling teams to connect visual AI outputs with taxonomy, ontology, RDF, and knowledge graph layers.
- Work with cross-functional teams to understand product requirements and translate them into scalable technical solutions.
- Develop automated testing, benchmarking, and evaluation workflows to ensure the performance, reliability, and safety of computer vision applications.
- Optimize models and inference pipelines for scalability, latency, and cost-efficiency across GPU and CPU environments.
- Contribute to dataset design, annotation processes, data quality validation, and tools that improve model performance and reliability.
- Translate research-level computer vision and multimodal AI approaches into production-ready technical solutions.
Required Qualifications
- Bachelor’s degree in Computer Science, Electrical Engineering, Computer Engineering, Artificial Intelligence Engineering, or a related field.
- 3-6 years of experience in computer vision, deep learning, or a related AI field.
- Strong proficiency in Python.
- Hands-on experience with deep learning frameworks such as PyTorch, TensorFlow, or similar.
- Practical experience developing, testing, and deploying computer vision models in production environments.
- Solid technical understanding of convolutional and transformer-based architectures, such as CNNs, ViT, YOLO, and Detectron2.
- Hands-on experience in several of the following areas: object detection, segmentation, OCR, tracking, image classification, or video analytics.
- Experience with ML Ops practices and tools such as Docker, Kubernetes, MLflow, and Weights & Biases.
- Systematic approach to model evaluation, benchmarking, data quality control, and error analysis.
- Familiarity with GPU/CPU inference optimization, latency management, and model deployment workflows.
- Ability to analyse technical problems clearly, document solutions effectively, and communicate across teams.
Preferred Qualifications
- Master’s or PhD degree in a relevant field.
- Experience implementing or fine-tuning vision-language models such as CLIP, BLIP, or SAM.
- Experience with multimodal AI, visual grounding, open-vocabulary detection, or image-text retrieval.
- Familiarity with edge deployment frameworks such as TensorRT, OpenVINO, or ONNX Runtime.
- Experience with 3D vision, point clouds, depth estimation, SLAM, spatial intelligence, or digital twin-based visual analysis.
- Experience working with construction, architecture, construction technologies, BIM, technical document analysis, or product data systems.
- Experience with OCR, technical document processing, drawing analysis, catalogue data extraction, or visual product matching.
- Contributions to open-source computer vision projects.
- Experience deploying models on cloud platforms such as AWS, GCP, or Azure.
- Experience with data annotation, synthetic data, active learning, or dataset quality management.
Problem Areas You May Work On
- Visual recognition and classification of building products.
- Matching product images with technical data, catalogue information, and semantic classifications.
- OCR-based data extraction from technical documents, catalogues, and PDFs.
- Joint analysis of architectural drawings, site photos, and product images.
- Detection of building elements and material surfaces.
- Linking visual data with taxonomy, ontology, and knowledge graph structures.
- Applying vision-language models in the construction industry context.
- Quality, compliance, or contextual analysis from site imagery.
- Connecting visual AI outputs to verified, structured, and machine-interpretable data infrastructure.
Why This Role Different
The construction industry is one of the most complex application areas for computer vision because it brings together architecture, engineering, construction, building materials, and site operations. In this domain, the meaning of an image cannot be derived from pixels alone. The product class, technical standard, usage context, relationship to building elements, material characteristics, performance values, and verifiable data counterpart must be considered together. At Depixen, computer vision outputs are not treated as isolated predictions. They are treated as decision components connected to verified data, semantic classification, ontology, RDF, and knowledge graph layers. This makes the role not only about model development, but about building reliable, contextual, and verifiable AI systems for the construction industry.
AI Engineer [Computer Vision] in City of London employer: Depixen
Depixen is an exceptional employer, offering a dynamic work environment in the heart of London where innovation meets collaboration. Employees benefit from a culture that prioritises professional growth through hands-on experience with cutting-edge AI technologies and opportunities to engage in impactful projects within the construction industry. With a commitment to developing scalable solutions and a focus on meaningful contributions, Depixen empowers its team to turn advanced ideas into reliable systems that drive real-world change.
StudySmarter Expert Advice🤫
We think this is how you could land AI Engineer [Computer Vision] in City of London
✨Tip Number 1
Network like a pro! Attend industry meetups, conferences, or online webinars related to AI and computer vision. It's a great way to meet people in the field, learn about job openings, and get your name out there.
✨Tip Number 2
Show off your skills! Create a portfolio showcasing your projects, especially those related to computer vision. Include links to GitHub repos or any live demos. This will give potential employers a taste of what you can do.
✨Tip Number 3
Don’t just apply blindly! Tailor your approach for each company. Research Depixen and understand their projects. When you reach out, mention how your skills align with their needs in the construction tech space.
✨Tip Number 4
Use our website to apply! We make it easy for you to submit your application directly. Plus, it shows you're genuinely interested in joining our team at StudySmarter and working on exciting AI projects.
We think you need these skills to ace AI Engineer [Computer Vision] in City of London
Some tips for your application 🫡
Tailor Your CV:Make sure your CV is tailored to the AI Engineer role. Highlight your experience with computer vision, deep learning frameworks, and any relevant projects you've worked on. We want to see how your skills align with what we're looking for!
Craft a Compelling Cover Letter:Your cover letter is your chance to shine! Use it to explain why you're passionate about computer vision in the construction industry. Share specific examples of your work and how it relates to our mission at Depixen. Let us know why you’re the perfect fit!
Showcase Your Projects:If you've got any projects or contributions to open-source that demonstrate your skills, make sure to include them! We love seeing practical applications of your knowledge, especially in areas like object detection or video analytics.
Apply Through Our Website:We encourage you to apply directly through our website. It’s the best way to ensure your application gets into the right hands. Plus, it shows us you’re serious about joining the Depixen team!
How to prepare for a job interview at Depixen
✨Know Your Tech Inside Out
Make sure you’re well-versed in the deep learning frameworks mentioned in the job description, like PyTorch and TensorFlow. Brush up on your knowledge of convolutional and transformer-based architectures, as these will likely come up during technical discussions.
✨Showcase Real-World Applications
Prepare to discuss specific projects where you've developed or deployed computer vision models. Highlight how you tackled challenges in object detection, segmentation, or OCR, and be ready to explain how your solutions can apply to the construction industry.
✨Collaborate Like a Pro
Since this role involves working with cross-functional teams, think of examples that demonstrate your collaborative skills. Be ready to talk about how you’ve effectively communicated technical concepts to non-technical stakeholders or worked alongside data and modelling teams.
✨Ask Insightful Questions
Prepare thoughtful questions about Depixen’s projects and their approach to integrating visual AI outputs with verified data. This shows your genuine interest in the role and helps you understand how you can contribute to their mission in the construction sector.