Feedback

Faculté des Sciences appliquées
Faculté des Sciences appliquées
MASTER THESIS

Deep Learning on Tabular Data

Download
Bamboneyeho, Sonny ULiège
Promotor(s) : Geurts, Pierre ULiège
Date of defense : 8-Sep-2025/9-Sep-2025 • Permalink : http://hdl.handle.net/2268.2/24644
Details
Title : Deep Learning on Tabular Data
Translated title : [fr] Apprentissage profond sur des données tabulaires
Author : Bamboneyeho, Sonny ULiège
Date of defense  : 8-Sep-2025/9-Sep-2025
Advisor(s) : Geurts, Pierre ULiège
Committee's member(s) : Louppe, Gilles ULiège
Huynh-Thu, Vân Anh ULiège
Marée, Raphaël ULiège
Language : English
Number of pages : 52
Keywords : [en] Machine Learning
[en] Deep Learning
[en] Artifical Intelligence
[en] Tabular Data
Discipline(s) : Engineering, computing & technology > Computer science
Target public : Researchers
Professionals of domain
Student
Institution(s) : Université de Liège, Liège, Belgique
Degree: Master : ingénieur civil en science des données, à finalité spécialisée
Faculty: Master thesis of the Faculté des Sciences appliquées

Abstract

[en] This thesis examines the capacity of deep learning models to handle tabular data and whether they can surpass traditional methods. Despite the development of deep learning, applying it to tabular data has been less explored. Also, tabular data generation has been investigated to try to improve deep learning performance on tabular data. A related work chapter has been written to review what has already been done for deep learning on tabular data and tabular data generation. Then, key concepts have been defined to clarify the theoretical framework. A special focus has been made on the limitations of deep learning on tabular data. The datasets in this thesis are all designed for regression tasks with numerical features. The methodologies include the explanation of each deep learning model chosen for this thesis, the interpretability tool SHAP (Shapley Additive Explanations), and the tabular data generation method using Large Language Models (LLMs). Models were ranked for each dataset, and an average rank across datasets was obtained. The results proved that the traditional methods outperformed the majority of deep learning models. However, some deep learning models were close to them proving their potential. The SHAP analysis provided insights into the performance of the models by highlighting which features contributed most to their predictions. Generating tabular data with LLMs has been tested. The results are dependent on the dataset used, meaning that performance can improve or deteriorate. To conclude, deep learning can be a viable alternative to traditional methods, but it has limitations, particularly computational.


File(s)

Document(s)

File
Access tfe_s202657.pdf
Description:
Size: 2.76 MB
Format: Adobe PDF

Annexe(s)

File
Access tfe_s202657_images.zip
Description:
Size: 878.27 kB
Format: Unknown
File
Access tfe_s202657_summary_EN_FR.pdf
Description:
Size: 75.89 kB
Format: Adobe PDF

Author

  • Bamboneyeho, Sonny ULiège Université de Liège > Mast. ing. civ. sc. don. fin. spéc.

Promotor(s)

Committee's member(s)

  • Louppe, Gilles ULiège Université de Liège - ULiège > Dép. d'électric., électron. et informat. (Inst.Montefiore) > Big Data
    ORBi View his publications on ORBi
  • Huynh-Thu, Vân Anh ULiège Université de Liège - ULiège > Dép. d'électric., électron. et informat. (Inst.Montefiore) > Informatique
    ORBi View his publications on ORBi
  • Marée, Raphaël ULiège Université de Liège - ULiège > Dép. d'électric., électron. et informat. (Inst.Montefiore) > Méthodes stochastiques
    ORBi View his publications on ORBi








All documents available on MatheO are protected by copyright and subject to the usual rules for fair use.
The University of Liège does not guarantee the scientific quality of these students' works or the accuracy of all the information they contain.