Faculté des Sciences appliquées
Faculté des Sciences appliquées

Term extraction from domain specific texts

Poumay, Judicaël ULiège
Promotor(s) : Ittoo, Ashwin ULiège
Date of defense : 9-Sep-2019/10-Sep-2019 • Permalink :
Title : Term extraction from domain specific texts
Author : Poumay, Judicaël ULiège
Date of defense  : 9-Sep-2019/10-Sep-2019
Advisor(s) : Ittoo, Ashwin ULiège
Committee's member(s) : Jamar, Julie ULiège
Gribomont, Pascal ULiège
Language : English
Number of pages : 36
Keywords : [en] term extraction
[en] terminology extraction
[en] financial text
[en] information extraction
[en] abbreviation extraction
[en] long term
[en] complex terminology
[en] multi word term
[en] termhood
[en] unithood
[en] unsupervised
Discipline(s) : Engineering, computing & technology > Computer science
Target public : Researchers
Professionals of domain
Institution(s) : Université de Liège, Liège, Belgique
Degree: Master en science des données, à finalité spécialisée
Faculty: Master thesis of the Faculté des Sciences appliquées


[en] In the thesis, we developed a novel unsupervised algorithm for terminology extraction (TE).
TE consists in detecting and ranking possible terms from a given document. While a term is a sequence of words that refers to a particular concept in a given domain.
This thesis also brings with it two other ancillary contributions. A new relevancy measure for term ranking; which uses a mix of a termhood, a unithood, and a noise measure to provide a reliable score. And an abbreviation extractor which discovers and extracts the extended form of abbreviated terms using a simple heuristic.
Many algorithms already exist for extracting terms but they have limitations. Primarily, we found that no current method was capable of reliably extracting long and complex terminology. Therefore, the algorithm we proposed was designed to handle such task.



Access Erratum_Master_thesis_2019.pdf
Description: -
Size: 712.45 kB
Format: Adobe PDF
Access Master_thesis_2019.pdf
Description: -
Size: 753.17 kB
Format: Adobe PDF
Access summary_TFE_2019.pdf
Description: -
Size: 51.27 kB
Format: Adobe PDF


  • Poumay, Judicaël ULiège Université de Liège > Bac. sc. info.


Committee's member(s)

  • Jamar, Julie ULiège Université de Liège - ULiège > HEC Liège : UER > UER Opérations
    ORBi View his publications on ORBi
  • Gribomont, Pascal ULiège Université de Liège - ULiège > Dép. d'électric., électron. et informat. (Inst.Montefiore) > Informatique et intelligence artificielle
    ORBi View his publications on ORBi
  • Total number of views 71
  • Total number of downloads 522

All documents available on MatheO are protected by copyright and subject to the usual rules for fair use.
The University of Liège does not guarantee the scientific quality of these students' works or the accuracy of all the information they contain.