Use of Flink Hybrid sources to initialise transformations
Azoury, Elie Junior
Promoteur(s) : Debruyne, Christophe
Date de soutenance : 24-jui-2024/25-jui-2024 • URL permanente : http://hdl.handle.net/2268.2/20383
Détails
Titre : | Use of Flink Hybrid sources to initialise transformations |
Auteur : | Azoury, Elie Junior |
Date de soutenance : | 24-jui-2024/25-jui-2024 |
Promoteur(s) : | Debruyne, Christophe |
Membre(s) du jury : | Fontaine, Pascal
Leduc, Guy Bruggeman, Jehan |
Langue : | Anglais |
Mots-clés : | [en] Apache Flink, Flink Hybrid Sources, Data transformation, Digazu, Stream processing, Batch processing |
Discipline(s) : | Ingénierie, informatique & technologie > Sciences informatiques |
Public cible : | Chercheurs Professionnels du domaine Etudiants Grand public |
Institution(s) : | Université de Liège, Liège, Belgique |
Diplôme : | Master en sciences informatiques, à finalité spécialisée en "computer systems security" |
Faculté : | Mémoires de la Faculté des Sciences appliquées |
Résumé
[en] This master's thesis explores using Apache Flink's HybridSource technology to enhance the data transformation processes within the Digazu framework, a data processing platform developed by EuraNova. The thesis addresses the complexities of transitioning from batch to stream processing, traditionally managed through custom orchestration solutions. By leveraging HybridSource, which facilitates seamless data ingestion from multiple heterogeneous sources, the study aims to reduce orchestration code complexity, optimize resource usage, and improve latency performance.
The research involves a comprehensive literature review on streaming frameworks, a technical analysis of the HybridSource implementation in Apache Flink, and empirical validation through proof-of-concept experiments. By practically implementing HybridSource within Digazu's infrastructure, the study evaluates its feasibility and effectiveness in streamlining data transformation processes. This investigation demonstrates significant reductions in orchestration code complexity and optimized resource usage and highlights improvements in latency performance.
Fichier(s)
Document(s)
Citer ce mémoire
L'Université de Liège ne garantit pas la qualité scientifique de ces travaux d'étudiants ni l'exactitude de l'ensemble des informations qu'ils contiennent.