Deep Learning on Tabular Data
Bamboneyeho, Sonny
Promotor(s) :
Geurts, Pierre
Date of defense : 8-Sep-2025/9-Sep-2025 • Permalink : http://hdl.handle.net/2268.2/24644
Details
| Title : | Deep Learning on Tabular Data |
| Translated title : | [fr] Apprentissage profond sur des données tabulaires |
| Author : | Bamboneyeho, Sonny
|
| Date of defense : | 8-Sep-2025/9-Sep-2025 |
| Advisor(s) : | Geurts, Pierre
|
| Committee's member(s) : | Louppe, Gilles
Huynh-Thu, Vân Anh
Marée, Raphaël
|
| Language : | English |
| Number of pages : | 52 |
| Keywords : | [en] Machine Learning [en] Deep Learning [en] Artifical Intelligence [en] Tabular Data |
| Discipline(s) : | Engineering, computing & technology > Computer science |
| Target public : | Researchers Professionals of domain Student |
| Institution(s) : | Université de Liège, Liège, Belgique |
| Degree: | Master : ingénieur civil en science des données, à finalité spécialisée |
| Faculty: | Master thesis of the Faculté des Sciences appliquées |
Abstract
[en] This thesis examines the capacity of deep learning models to handle tabular data and whether they can surpass traditional methods. Despite the development of deep learning, applying it to tabular data has been less explored. Also, tabular data generation has been investigated to try to improve deep learning performance on tabular data. A related work chapter has been written to review what has already been done for deep learning on tabular data and tabular data generation. Then, key concepts have been defined to clarify the theoretical framework. A special focus has been made on the limitations of deep learning on tabular data. The datasets in this thesis are all designed for regression tasks with numerical features. The methodologies include the explanation of each deep learning model chosen for this thesis, the interpretability tool SHAP (Shapley Additive Explanations), and the tabular data generation method using Large Language Models (LLMs). Models were ranked for each dataset, and an average rank across datasets was obtained. The results proved that the traditional methods outperformed the majority of deep learning models. However, some deep learning models were close to them proving their potential. The SHAP analysis provided insights into the performance of the models by highlighting which features contributed most to their predictions. Generating tabular data with LLMs has been tested. The results are dependent on the dataset used, meaning that performance can improve or deteriorate. To conclude, deep learning can be a viable alternative to traditional methods, but it has limitations, particularly computational.
File(s)
Document(s)
Annexe(s)
Cite this master thesis
The University of Liège does not guarantee the scientific quality of these students' works or the accuracy of all the information they contain.

Master Thesis Online


tfe_s202657.pdf