Feedback

Faculté des Sciences appliquées
Faculté des Sciences appliquées
MASTER THESIS
VIEW 8075 | DOWNLOAD 4445

Master thesis : Automatic Multispeaker Voice Cloning

Download
Jemine, Corentin ULiège
Promotor(s) : Louppe, Gilles ULiège
Date of defense : 26-Jun-2019/27-Jun-2019 • Permalink : http://hdl.handle.net/2268.2/6801
Details
Title : Master thesis : Automatic Multispeaker Voice Cloning
Translated title : [fr] Clonage de la voix en temps réel
Author : Jemine, Corentin ULiège
Date of defense  : 26-Jun-2019/27-Jun-2019
Advisor(s) : Louppe, Gilles ULiège
Committee's member(s) : Geurts, Pierre ULiège
Fonteneau, Raphaël ULiège
Language : English
Number of pages : 37
Keywords : [fr] voix
[fr] audio
[fr] text-to-speech
[fr] tts
[fr] neurone
[fr] réseau
[fr] deep
[fr] deep learning
[fr] machine learning
[fr] transfert
[fr] generation
[en] voice
[en] audio
[en] transfer
[en] generation
[en] text-to-speech
[en] tts
[en] neural
[en] network
[en] deep
[en] deep learning
[en] machine learning
Discipline(s) : Engineering, computing & technology > Computer science
Target public : Professionals of domain
Student
General public
Institution(s) : Université de Liège, Liège, Belgique
Degree: Master en science des données, à finalité spécialisée
Faculty: Master thesis of the Faculté des Sciences appliquées

Abstract

[en] Recent advances in deep learning have shown impressive results in the domain of text-to-speech. To this end, a deep neural network is usually trained using a corpus of several hours of professionally recorded speech from a single speaker. Giving a new voice to such a model is highly expensive, as it requires recording a new dataset and retraining the model. A recent research introduced a three-stage pipeline that allows to clone a voice unseen during training from only a few seconds of reference speech, and without retraining the model. The authors share remarkably natural-sounding results, but provide no implementation. We reproduce this framework and open-source the first public implementation of it. We adapt the framework with a newer vocoder model, so as to make it run in real-time.


File(s)

Document(s)

File
Access s123578Jemine2019.pdf
Description:
Size: 2.61 MB
Format: Adobe PDF

Annexe(s)

File
Access summary_s123578Jemine2019.pdf
Description:
Size: 64.22 kB
Format: Adobe PDF

Author

  • Jemine, Corentin ULiège Université de Liège > Bac. sc. info.

Promotor(s)

Committee's member(s)

  • Geurts, Pierre ULiège Université de Liège - ULiège > Dép. d'électric., électron. et informat. (Inst.Montefiore) > Algorith. des syst. en interaction avec le monde physique
    ORBi View his publications on ORBi
  • Fonteneau, Raphaël ULiège Université de Liège - ULiège > Dép. d'électric., électron. et informat. (Inst.Montefiore) > Dép. d'électric., électron. et informat. (Inst.Montefiore)
    ORBi View his publications on ORBi
  • Total number of views 8075
  • Total number of downloads 4445










All documents available on MatheO are protected by copyright and subject to the usual rules for fair use.
The University of Liège does not guarantee the scientific quality of these students' works or the accuracy of all the information they contain.