Feedback

Faculté des Sciences appliquées
Faculté des Sciences appliquées
MASTER THESIS
VIEW 79 | DOWNLOAD 1

Master's Thesis : Coordination on the battlefield by multi-agent reinforcement learning.

Download
Fombellida-Lopez, Arnaud ULiège
Promotor(s) : Ernst, Damien ULiège
Date of defense : 25-Jun-2020/26-Jun-2020 • Permalink : http://hdl.handle.net/2268.2/9072
Details
Title : Master's Thesis : Coordination on the battlefield by multi-agent reinforcement learning.
Author : Fombellida-Lopez, Arnaud ULiège
Date of defense  : 25-Jun-2020/26-Jun-2020
Advisor(s) : Ernst, Damien ULiège
Committee's member(s) : Geurts, Pierre ULiège
Wehenkel, Louis ULiège
Leroy, Pascal ULiège
Vanlerberghe, Shani 
Language : English
Discipline(s) : Engineering, computing & technology > Computer science
Institution(s) : Université de Liège, Liège, Belgique
Degree: Master en science des données, à finalité spécialisée
Faculty: Master thesis of the Faculté des Sciences appliquées

Abstract

[en] This study investigates Multi-Agent Reinforcement Learning algorithms that improve agent cooperation and coordination through agent-to-agent communication. The basics of Reinforcement Learning are presented after introducing the IRIS project and the high-level objectives of this work. Multi-Agent Reinforcement Learning's State-of-the-Art methods are then presented with a focus on algorithms which allow agents to learn to communicate. An algorithm designed to allow use different methods to aggregate messages and that is able to learn when to send message is presented. Moreover, the possibility to leverage Multi-Agent critics is shown. Within the Starcraft II Multi-Agent Challenge, variations of the communicating algorithm are tested on a hand-crafted scenario. Various performance and behavior metrics are analyzed and compared with a non-communicating algorithm. Results show that, although final performance is similar, allowing agents to communicate increases the learning rate and leads to behaviorally different agents.


File(s)

Document(s)

File
Access Master Thesis - Fombellida.pdf
Description: -
Size: 18.21 MB
Format: Adobe PDF

Annexe(s)

File
Access Code.zip
Description: -
Size: 102.57 MB
Format: Unknown

Author

  • Fombellida-Lopez, Arnaud ULiège Université de Liège > Mast. sc. don. à fin.

Promotor(s)

Committee's member(s)

  • Geurts, Pierre ULiège Université de Liège - ULiège > Dép. d'électric., électron. et informat. (Inst.Montefiore) > Algorith. des syst. en interaction avec le monde physique
    ORBi View his publications on ORBi
  • Wehenkel, Louis ULiège Université de Liège - ULiège > Dép. d'électric., électron. et informat. (Inst.Montefiore) > Méthodes stochastiques
    ORBi View his publications on ORBi
  • Leroy, Pascal ULiège Université de Liège - ULiège > Dép. d'électric., électron. et informat. (Inst.Montefiore) > Smart grids
    ORBi View his publications on ORBi
  • Vanlerberghe, Shani John Cockerill Defence
  • Total number of views 79
  • Total number of downloads 1










All documents available on MatheO are protected by copyright and subject to the usual rules for fair use.
The University of Liège does not guarantee the scientific quality of these students' works or the accuracy of all the information they contain.