Feedback

Faculté des Sciences appliquées
Faculté des Sciences appliquées
Mémoire
VIEW 96 | DOWNLOAD 1

Master thesis : Depth estimation from sports broadcast videos

Télécharger
Liesse, Valentine ULiège
Promoteur(s) : Van Droogenbroeck, Marc ULiège
Date de soutenance : 27-jui-2022/28-jui-2022 • URL permanente : http://hdl.handle.net/2268.2/14382
Détails
Titre : Master thesis : Depth estimation from sports broadcast videos
Auteur : Liesse, Valentine ULiège
Date de soutenance  : 27-jui-2022/28-jui-2022
Promoteur(s) : Van Droogenbroeck, Marc ULiège
Membre(s) du jury : Barnich, Olivier 
Cioppa, Anthony ULiège
Langue : Anglais
Nombre de pages : 109
Mots-clés : [en] Deep learning
[en] computer vision
[en] depth estimation
[en] self-supervised
[en] monodepth2
[en] stereo vision
[en] monocular vision
Discipline(s) : Ingénierie, informatique & technologie > Sciences informatiques
Institution(s) : Université de Liège, Liège, Belgique
Diplôme : Master : ingénieur civil électricien, à finalité spécialisée en "signal processing and intelligent robotics"
Faculté : Mémoires de la Faculté des Sciences appliquées

Résumé

[en] Nowadays, broadcasters must continually innovate to create compelling content that evokes emotions in viewers. Photo-realistic depth of field effect, augmented reality, or even 3D reconstruction could be one way to achieve this. Depth estimation is one of the steps to video understanding that allows these applications.

In this master thesis, methods to estimate a depth map from a single image were explored. Especially, deep learning methods were preferred to those based on geometry or the use of sensors. Thus, the solution provided by this work is entirely software-based.

In particular, a custom model has been created for a sports broadcast dataset. However, this dataset does not contain any ground truth depth map. Hence, self-supervised monocular depth estimation methods were investigated in this document. Specifically, trainings were performed with non-rectified and synchronized stereo images captured with uncalibrated cameras. In contrast to trainings based on monocular sequences, the scenes are static and the only movement is that of the camera. This is the ideal configuration for training with self-supervised methods. However, to our knowledge, we are the only ones to have attempted to train self-supervised monocular depth estimation methods using non-rectified stereo images captured with uncalibrated cameras. More precisely, in this work, the intrinsic and extrinsic parameters of the cameras are estimated during training along with the depth map.

During the course of this research, we were able to demonstrate the ability of self-supervised methods to predict coherent depth maps when trained on non-rectified stereo sports broadcast videos captured with uncalibrated cameras. Moreover, we addressed the limitations regarding the generation of depth maps under those conditions.


Fichier(s)

Document(s)

File
Access Master_thesis_LIESSE_Valentine.pdf
Description:
Taille: 172.19 MB
Format: Adobe PDF

Annexe(s)

File
Access Images_LIESSE_Valentine.pdf
Description:
Taille: 1.91 MB
Format: Adobe PDF
File
Access Abstract_LIESSE_Valentine.pdf
Description:
Taille: 174.93 kB
Format: Adobe PDF

Auteur

  • Liesse, Valentine ULiège Université de Liège > Master ingé. civ. électr., à fin.

Promoteur(s)

Membre(s) du jury

  • Barnich, Olivier EVS, rue Bois Saint Jean 13, 4102 SERAING
  • Cioppa, Anthony ULiège Université de Liège - ULiège > Dép. d'électric., électron. et informat. (Inst.Montefiore) > Robotique intelligente
    ORBi Voir ses publications sur ORBi
  • Nombre total de vues 96
  • Nombre total de téléchargements 1










Tous les documents disponibles sur MatheO sont protégés par le droit d'auteur et soumis aux règles habituelles de bon usage.
L'Université de Liège ne garantit pas la qualité scientifique de ces travaux d'étudiants ni l'exactitude de l'ensemble des informations qu'ils contiennent.