Feedback

Faculté des Sciences appliquées
Faculté des Sciences appliquées
Mémoire
VIEW 160 | DOWNLOAD 667

ChatBot with GANs

Télécharger
Castillo Lenz, Sergio Miguel ULiège
Promoteur(s) : Ittoo, Ashwin ULiège
Date de soutenance : 22-jan-2021 • URL permanente : http://hdl.handle.net/2268.2/11395
Détails
Titre : ChatBot with GANs
Titre traduit : [fr] ChatBot avec GANs
Auteur : Castillo Lenz, Sergio Miguel ULiège
Date de soutenance  : 22-jan-2021
Promoteur(s) : Ittoo, Ashwin ULiège
Membre(s) du jury : Hiard, Samuel ULiège
Louppe, Gilles ULiège
Langue : Anglais
Mots-clés : [en] machine learning
[en] gan
[en] auto-encoders
[en] daily dialog
[en] deep learning
Discipline(s) : Ingénierie, informatique & technologie > Sciences informatiques
Public cible : Chercheurs
Professionnels du domaine
Etudiants
Institution(s) : Université de Liège, Liège, Belgique
Diplôme : Master en sciences informatiques, à finalité spécialisée en "intelligent systems"
Faculté : Mémoires de la Faculté des Sciences appliquées

Résumé

[en] Since its introduction in 2014 [Goodfellow et al., 2014], the architecture of Generative Adversarial Networks (GANs) have experienced various evolutions to reach its current state where it is capable to recreate realistic images of any given context. Those improvements, both in terms of complexity and stability, enabled successful applications of GANs frameworks in the field of computer vision and transfer learning. On the other hand, GANs lack of successful applications within the field of Natural Language Processing (NLP) where models based on Transformers architecture, such as Bidirectional Encoder Representations from Transformers (BERT) and Generative Pre-Training (GPT), remain the current state-of-the-art for various NLP tasks.

Given this current situation, this thesis investigates why GANs remain underused for NLP tasks. As such, we explore some researchers’ proposals within the area of Dialog Systems by using data from the Daily Dialog dataset, a human-written and multi-turn dialog set reflecting daily human communication.

Moreover, we investigate the influence of an embedding layer of the proposed GAN models. In order to do so first, we test pre-trained “word-level” embeddings, such as Stanford's Glove and Spacy embeddings.

Second, we train the model by using our own word embeddings coming from the Daily Dialog dataset. The Word2Vec algorithm is used in this case. Third, we explore the idea of using BERT as a contextualized word embeddings. From these experiments it was observed that the use of pre-trained embeddings, not only accelerates the convergence during the training but also, improves the quality of the produced samples by the model, to some extents avoiding an early arrival of mode collapse.

In conclusion, despite their limited success in the NLP area, GAN-trained models offer an interesting approach during the training phase, as the generator G is able to produce different but potentially correct response samples and is not penalized by not producing the most likely single correct sequence of words. This actually follows an important characteristic of the human learning process. Overall, this thesis successfully explores propositions made to tackle drawbacks of the GAN architecture within the NLP area and opens doors for critical progresses in the area.


Fichier(s)

Document(s)

File
Access TFE_scastillo.pdf
Description:
Taille: 2.44 MB
Format: Adobe PDF
File
Access abstract_TFE_scastillo(1).pdf
Description:
Taille: 66.41 kB
Format: Adobe PDF

Auteur

  • Castillo Lenz, Sergio Miguel ULiège Université de Liège > Master sc. informatiques, à fin.

Promoteur(s)

Membre(s) du jury

  • Hiard, Samuel ULiège Université de Liège - ULiège > Dép. d'électric., électron. et informat. (Inst.Montefiore) > Dép. d'électric., électron. et informat. (Inst.Montefiore)
    ORBi Voir ses publications sur ORBi
  • Louppe, Gilles ULiège Université de Liège - ULiège > Dép. d'électric., électron. et informat. (Inst.Montefiore) > Big Data
    ORBi Voir ses publications sur ORBi
  • Nombre total de vues 160
  • Nombre total de téléchargements 667










Tous les documents disponibles sur MatheO sont protégés par le droit d'auteur et soumis aux règles habituelles de bon usage.
L'Université de Liège ne garantit pas la qualité scientifique de ces travaux d'étudiants ni l'exactitude de l'ensemble des informations qu'ils contiennent.