Deep Learning on Tabular Data

Deep Learning on Tabular Data

Bamboneyeho, Sonny

Date of defense : 8-Sep-2025/9-Sep-2025 • Permalink : `http://hdl.handle.net/2268.2/24644`

Details

Title :	Deep Learning on Tabular Data
Translated title :	[fr] Apprentissage profond sur des données tabulaires
Author :	Bamboneyeho, Sonny
Date of defense :	8-Sep-2025/9-Sep-2025
Advisor(s) :	Geurts, Pierre
Committee's member(s) :	Louppe, Gilles Huynh-Thu, Vân Anh Marée, Raphaël
Language :	English
Number of pages :	52
Keywords :	[en] Machine Learning [en] Deep Learning [en] Artifical Intelligence [en] Tabular Data
Discipline(s) :	Engineering, computing & technology > Computer science
Target public :	Researchers Professionals of domain Student
Institution(s) :	Université de Liège, Liège, Belgique
Degree:	Master : ingénieur civil en science des données, à finalité spécialisée
Faculty:	Master thesis of the Faculté des Sciences appliquées

Abstract

[en] This thesis examines the capacity of deep learning models to handle tabular data and whether they can surpass traditional methods. Despite the development of deep learning, applying it to tabular data has been less explored. Also, tabular data generation has been investigated to try to improve deep learning performance on tabular data. A related work chapter has been written to review what has already been done for deep learning on tabular data and tabular data generation. Then, key concepts have been defined to clarify the theoretical framework. A special focus has been made on the limitations of deep learning on tabular data. The datasets in this thesis are all designed for regression tasks with numerical features. The methodologies include the explanation of each deep learning model chosen for this thesis, the interpretability tool SHAP (Shapley Additive Explanations), and the tabular data generation method using Large Language Models (LLMs). Models were ranked for each dataset, and an average rank across datasets was obtained. The results proved that the traditional methods outperformed the majority of deep learning models. However, some deep learning models were close to them proving their potential. The SHAP analysis provided insights into the performance of the models by highlighting which features contributed most to their predictions. Generating tabular data with LLMs has been tested. The results are dependent on the dataset used, meaning that performance can improve or deteriorate. To conclude, deep learning can be a viable alternative to traditional methods, but it has limitations, particularly computational.

File(s)

Document(s)

tfe_s202657.pdf
Description:
Size: 2.76 MB
Format: Adobe PDF

Ask a request copy

Annexe(s)

tfe_s202657_images.zip
Description:
Size: 878.27 kB
Format: Unknown

Ask a request copy

tfe_s202657_summary_EN_FR.pdf
Description:
Size: 75.89 kB
Format: Adobe PDF

Ask a request copy

Cite this master thesis

All documents available on MatheO are protected by copyright and subject to the usual rules for fair use.
The University of Liège does not guarantee the scientific quality of these students' works or the accuracy of all the information they contain.

MASTER THESIS

Deep Learning on Tabular Data

Bamboneyeho, Sonny

Promotor(s) : Geurts, Pierre

Date of defense : 8-Sep-2025/9-Sep-2025 • Permalink : http://hdl.handle.net/2268.2/24644

Details

Abstract

File(s)

Document(s)

Annexe(s)

Author

Promotor(s)

Committee's member(s)

Cite this master thesis

Date of defense : 8-Sep-2025/9-Sep-2025 • Permalink : `http://hdl.handle.net/2268.2/24644`