Feedback

Faculté des Sciences appliquées
Faculté des Sciences appliquées
MASTER THESIS
VIEW 145 | DOWNLOAD 775

Energy-based Multi-Modal Attention

Download
Werenne, Aurélien ULiège
Promotor(s) : Marée, Raphaël ULiège
Date of defense : 9-Sep-2019/10-Sep-2019 • Permalink : http://hdl.handle.net/2268.2/7854
Details
Title : Energy-based Multi-Modal Attention
Author : Werenne, Aurélien ULiège
Date of defense  : 9-Sep-2019/10-Sep-2019
Advisor(s) : Marée, Raphaël ULiège
Committee's member(s) : Geurts, Pierre ULiège
Louppe, Gilles ULiège
Embrechts, Jean-Jacques ULiège
Language : English
Number of pages : 74
Keywords : [en] Multimodal, Deep Learning, Attention, Robustness
Discipline(s) : Engineering, computing & technology > Computer science
Target public : Researchers
Professionals of domain
Student
General public
Complementary URL : https://github.com/Werenne/energy-based-multimodal-attention
Institution(s) : Université de Liège, Liège, Belgique
Degree: Master en ingénieur civil en informatique, à finalité spécialisée en "intelligent systems"
Faculty: Master thesis of the Faculté des Sciences appliquées

Abstract

[en] A multi-modal neural network exploits information from different channels and in different terms (e.g., images, text, sounds, sensor measures) in the hope that the information carried by each mode is complementary, in order to improve the predictions the neural network. Nevertheless, in realistic situations, varying levels of perturbations can occur on the data of the modes, which may decrease the quality of the inference process. An additional difficulty is that these perturbations vary between the modes and on a per-sample basis. This work presents a solution to this problem. The three main contributions are described below.
First, a novel attention module is designed, analysed and implemented. This attention module is constructed to help multi-modal networks handle modes with perturbations.

Secondly, two new regularizers are developed to improve the generalization of the robustness gain on more intensive failing modes (relative to the training set).

Lastly, a unified multi-modal attention module is presented, combining the main types of attention mechanisms in the deep learning literature with our module. We suggest that this unified module could be coupled with a prediction model to enable the latter face unexpected situations, and improve the extraction of the relevant information in the data.


File(s)

Document(s)

File
Access main.pdf
Description: Master Thesis
Size: 5.33 MB
Format: Adobe PDF

Annexe(s)

File
Access summary.pdf
Description: Summary
Size: 45.9 kB
Format: Adobe PDF

Author

  • Werenne, Aurélien ULiège Université de Liège > Master ingé. civ. info., à fin.

Promotor(s)

Committee's member(s)

  • Geurts, Pierre ULiège Université de Liège - ULiège > Dép. d'électric., électron. et informat. (Inst.Montefiore) > Algorith. des syst. en interaction avec le monde physique
    ORBi View his publications on ORBi
  • Louppe, Gilles ULiège Université de Liège - ULiège > Dép. d'électric., électron. et informat. (Inst.Montefiore) > Big Data
    ORBi View his publications on ORBi
  • Embrechts, Jean-Jacques ULiège Université de Liège - ULiège > Dép. d'électric., électron. et informat. (Inst.Montefiore) > Techniques du son et de l'image
    ORBi View his publications on ORBi
  • Total number of views 145
  • Total number of downloads 775










All documents available on MatheO are protected by copyright and subject to the usual rules for fair use.
The University of Liège does not guarantee the scientific quality of these students' works or the accuracy of all the information they contain.