Evaluating LLMs on large contexts : a RAG approach on text comprehension
Lu, Benoît
Promotor(s) : Ittoo, Ashwin
Date of defense : 5-Sep-2024/6-Sep-2024 • Permalink : http://hdl.handle.net/2268.2/21150
Details
Title : | Evaluating LLMs on large contexts : a RAG approach on text comprehension |
Author : | Lu, Benoît |
Date of defense : | 5-Sep-2024/6-Sep-2024 |
Advisor(s) : | Ittoo, Ashwin |
Committee's member(s) : | Poumay, Judicaël
Geurts, Pierre |
Language : | English |
Number of pages : | 57 |
Keywords : | [en] Large Language Model [en] Retrieval Augmented Generation [en] Natural Language Processing [en] Context Window [en] Text Comprehension |
Discipline(s) : | Engineering, computing & technology > Computer science |
Target public : | Professionals of domain Student |
Institution(s) : | Université de Liège, Liège, Belgique |
Degree: | Master en science des données, à finalité spécialisée |
Faculty: | Master thesis of the Faculté des Sciences appliquées |
Abstract
[en] While the latest Large Language Models (LLMs) continue to expand in size and context window capacity, their knowledge base remains constrained by their training corpus. Retrieval Augmented Generation (RAG) offers a solution to this limitation by enhancing the LLM’s responses with relevant information retrieved from external sources. In contrast to the rapidly growing context windows, which now extend to millions of tokens, this study evaluates the effectiveness of augmenting prompts as an alternative approach to using large contexts, this is done by evaluating multiple-choice questions originally made for long context settings. By using parts of the context with RAG, I demonstrate that a well-constructed RAG system
can achieve strong performance with significantly reduced token usage. However, the results also reveal challenges related to prompt sensitivity. Despite these challenges, the potential reduction in inference costs due to lower token usage makes this approach particularly appealing, depending on the application
context.
Cite this master thesis
The University of Liège does not guarantee the scientific quality of these students' works or the accuracy of all the information they contain.