ChatBot with GANs

ChatBot with GANs

Castillo Lenz, Sergio Miguel

Date de soutenance : 22-jan-2021 • URL permanente : `http://hdl.handle.net/2268.2/11395`

Détails

Titre :	ChatBot with GANs
Titre traduit :	[fr] ChatBot avec GANs
Auteur :	Castillo Lenz, Sergio Miguel
Date de soutenance :	22-jan-2021
Promoteur(s) :	Ittoo, Ashwin
Membre(s) du jury :	Hiard, Samuel Louppe, Gilles
Langue :	Anglais
Mots-clés :	[en] machine learning [en] gan [en] auto-encoders [en] daily dialog [en] deep learning
Discipline(s) :	Ingénierie, informatique & technologie > Sciences informatiques
Public cible :	Chercheurs Professionnels du domaine Etudiants
Institution(s) :	Université de Liège, Liège, Belgique
Diplôme :	Master en sciences informatiques, à finalité spécialisée en "intelligent systems"
Faculté :	Mémoires de la Faculté des Sciences appliquées

Résumé

[en] Since its introduction in 2014 [Goodfellow et al., 2014], the architecture of Generative Adversarial Networks (GANs) have experienced various evolutions to reach its current state where it is capable to recreate realistic images of any given context. Those improvements, both in terms of complexity and stability, enabled successful applications of GANs frameworks in the field of computer vision and transfer learning. On the other hand, GANs lack of successful applications within the field of Natural Language Processing (NLP) where models based on Transformers architecture, such as Bidirectional Encoder Representations from Transformers (BERT) and Generative Pre-Training (GPT), remain the current state-of-the-art for various NLP tasks.

Given this current situation, this thesis investigates why GANs remain underused for NLP tasks. As such, we explore some researchers’ proposals within the area of Dialog Systems by using data from the Daily Dialog dataset, a human-written and multi-turn dialog set reflecting daily human communication.

Moreover, we investigate the influence of an embedding layer of the proposed GAN models. In order to do so first, we test pre-trained “word-level” embeddings, such as Stanford's Glove and Spacy embeddings.

Second, we train the model by using our own word embeddings coming from the Daily Dialog dataset. The Word2Vec algorithm is used in this case. Third, we explore the idea of using BERT as a contextualized word embeddings. From these experiments it was observed that the use of pre-trained embeddings, not only accelerates the convergence during the training but also, improves the quality of the produced samples by the model, to some extents avoiding an early arrival of mode collapse.

In conclusion, despite their limited success in the NLP area, GAN-trained models offer an interesting approach during the training phase, as the generator G is able to produce different but potentially correct response samples and is not penalized by not producing the most likely single correct sequence of words. This actually follows an important characteristic of the human learning process. Overall, this thesis successfully explores propositions made to tackle drawbacks of the GAN architecture within the NLP area and opens doors for critical progresses in the area.

Fichier(s)

Document(s)

TFE_scastillo.pdf
Description:
Taille: 2.44 MB
Format: Adobe PDF

abstract_TFE_scastillo(1).pdf
Description:
Taille: 66.41 kB
Format: Adobe PDF

Citer ce mémoire

Tous les documents disponibles sur MatheO sont protégés par le droit d'auteur et soumis aux règles habituelles de bon usage.
L'Université de Liège ne garantit pas la qualité scientifique de ces travaux d'étudiants ni l'exactitude de l'ensemble des informations qu'ils contiennent.

Mémoire

ChatBot with GANs

Castillo Lenz, Sergio Miguel

Promoteur(s) : Ittoo, Ashwin

Date de soutenance : 22-jan-2021 • URL permanente : http://hdl.handle.net/2268.2/11395

Détails

Résumé

Fichier(s)

Document(s)

Auteur

Promoteur(s)

Membre(s) du jury

Citer ce mémoire

Date de soutenance : 22-jan-2021 • URL permanente : `http://hdl.handle.net/2268.2/11395`