Topic modeling of investment style news.
Boemer, Dominik
Promotor(s) : Ittoo, Ashwin
Date of defense : 24-Aug-2020/8-Sep-2020 • Permalink : http://hdl.handle.net/2268.2/10346
Details
Title : | Topic modeling of investment style news. |
Author : | Boemer, Dominik |
Date of defense : | 24-Aug-2020/8-Sep-2020 |
Advisor(s) : | Ittoo, Ashwin |
Committee's member(s) : | Gillain, Cédric
Pietquin, John |
Language : | English |
Number of pages : | 140 |
Keywords : | [en] style investing [en] news coverage [en] topic modeling [en] latent Dirichlet allocation |
Discipline(s) : | Business & economic sciences > Finance |
Target public : | Researchers Professionals of domain Student |
Institution(s) : | Université de Liège, Liège, Belgique |
Degree: | Master en sciences de gestion, à finalité spécialisée en management général (Horaire décalé) |
Faculty: | Master thesis of the HEC-Ecole de gestion de l'Université de Liège |
Abstract
[en] Smart beta exchange-traded funds (ETFs) are increasingly popular investment products among institutional investors. These ETFs can be categorized into different styles depending on the systematic risk factors to which they provide exposure. Hence, the question arises whether certain topics within the news coverage of specific styles influence the investment decision and thereby fund flows towards respective smart beta ETFs. This thesis focuses on partially answering this question by identifying the major topics in investment style news and their importance measured by their frequency of occurrence.
Based on a review of topic models, which are machine learning methods to discover topics in large collections of documents, latent Dirichlet allocation (LDA) is selected to identify the topics in investment style news. Moreover, the most extensive literature survey of LDA in finance (to the best of our knowledge) is compiled in order to optimally apply this method.
Subsequently, the major topics in a unique corpus, which has never before been investigated by topic models (to the best of our knowledge), are identified by LDA. This corpus consists of 1720 articles related to small-cap investing from 9 magazines targeting institutional investors.
The 5 major topics are "equity market (economy)", "analyst research, trading and banking", "retirement planing", "indexes, ETFs and performance" and "fund management and fund launches". These topics either persist, disappear or specialize when the number of topics to identify is increased. Dominant topics of individual magazines correspond to those proposed by the corpus specialist and the short descriptions of the magazines. The dominant topic over time is "fund management and fund launches", which follows a seasonal trend characterized by lower coverage at the end of the year and higher coverage in January, thus suggesting that changes of fund management and fund launches preferentially occur at the beginning of the year.
Since the topic proportions of each article are identified, the correlation between the importance of topics over time and corresponding fund flows can be studied in future research.
Cite this master thesis
The University of Liège does not guarantee the scientific quality of these students' works or the accuracy of all the information they contain.