Master thesis : 3D object registration with deep learning
Wallon, Robin
Promoteur(s) : Van Droogenbroeck, Marc
Date de soutenance : 27-jui-2022/28-jui-2022 • URL permanente : http://hdl.handle.net/2268.2/15685
Détails
Titre : | Master thesis : 3D object registration with deep learning |
Auteur : | Wallon, Robin |
Date de soutenance : | 27-jui-2022/28-jui-2022 |
Promoteur(s) : | Van Droogenbroeck, Marc |
Membre(s) du jury : | Boigelot, Bernard
Phillips, Christophe Greffe, Nathan |
Langue : | Anglais |
Discipline(s) : | Ingénierie, informatique & technologie > Sciences informatiques |
Institution(s) : | Université de Liège, Liège, Belgique |
Diplôme : | Master : ingénieur civil en science des données, à finalité spécialisée |
Faculté : | Mémoires de la Faculté des Sciences appliquées |
Résumé
[en] This thesis was realized within the company Euresys. The main objective is to implement an easy-to-implement deep learning solution to infer the pose of an object on a Zmap image acquisition to replace the existing non-deep learning solution from Euresys whose robustness could be improved. Indeed, the previous solution encounters difficulties when the object is far from being perpendicular to the point of view, as the solution is based mainly on 2D alignment. This new deep learning solution offers more flexibility to capture the object in any view.
Since obtaining annotation for the 3D object registration can be costly and time-consuming. Thus, synthetic images of the Zmap format are created by projecting a 3D model on a reference plane.
By exploring different existing methods, an extension of the model YOLOV5small to solve this task is proposed in two steps. First, a 2D object detector is designed to show us the feasibility of training a model on the synthetic dataset to infer predictions on the acquisition. In the process, we noticed that good results can be achieved by changing the reference plane of the Zmap acquisition during the test and by using appropriate data augmentation during training.
Then, an extension of the 2D model detector to infer the 3D object registration is proposed by discretizing the model space into viewpoints and adding a regression parameter to infer the in-plane rotation. Unfortunately, we did not obtain metrics of the accuracy of the prediction on the acquisition but proposed a comparaison between the acquisition and a synthetic reproduction. This gave us interesting results but we thought that they could be improved. For example, by using the refinement step (ICP) already used by Euresys for the previous solution which could help to have better results. This thesis concludes with information on potential improvements and scope extensions that could be made for further work.
Fichier(s)
Document(s)
Description: One page summary
Taille: 209.85 kB
Format: Adobe PDF
Description: Thesis report
Taille: 17.36 MB
Format: Adobe PDF
Annexe(s)
Description: Annexe to illustrate the synthetic dataset used for the 2D object detector
Taille: 5.16 MB
Format: Unknown
Description: Annexe to illustrate the synthetic dataset used
Taille: 5.75 MB
Format: Unknown
Description: Annexe to illustrate the synthetic dataset used
Taille: 18.89 MB
Format: Unknown
Description: Annexe to illustrate the synthetic dataset used
Taille: 43.11 MB
Format: Unknown
Citer ce mémoire
L'Université de Liège ne garantit pas la qualité scientifique de ces travaux d'étudiants ni l'exactitude de l'ensemble des informations qu'ils contiennent.