Master's Thesis : Coordination on the battlefield by multi-agent reinforcement learning.
Fombellida-Lopez, Arnaud
Promotor(s) : Ernst, Damien
Date of defense : 25-Jun-2020/26-Jun-2020 • Permalink : http://hdl.handle.net/2268.2/9072
Details
Title : | Master's Thesis : Coordination on the battlefield by multi-agent reinforcement learning. |
Author : | Fombellida-Lopez, Arnaud |
Date of defense : | 25-Jun-2020/26-Jun-2020 |
Advisor(s) : | Ernst, Damien |
Committee's member(s) : | Geurts, Pierre
Wehenkel, Louis Leroy, Pascal Vanlerberghe, Shani |
Language : | English |
Discipline(s) : | Engineering, computing & technology > Computer science |
Institution(s) : | Université de Liège, Liège, Belgique |
Degree: | Master en science des données, à finalité spécialisée |
Faculty: | Master thesis of the Faculté des Sciences appliquées |
Abstract
[en] This study investigates Multi-Agent Reinforcement Learning algorithms that improve agent cooperation and coordination through agent-to-agent communication. The basics of Reinforcement Learning are presented after introducing the IRIS project and the high-level objectives of this work. Multi-Agent Reinforcement Learning's State-of-the-Art methods are then presented with a focus on algorithms which allow agents to learn to communicate. An algorithm designed to allow use different methods to aggregate messages and that is able to learn when to send message is presented. Moreover, the possibility to leverage Multi-Agent critics is shown. Within the Starcraft II Multi-Agent Challenge, variations of the communicating algorithm are tested on a hand-crafted scenario. Various performance and behavior metrics are analyzed and compared with a non-communicating algorithm. Results show that, although final performance is similar, allowing agents to communicate increases the learning rate and leads to behaviorally different agents.
File(s)
Document(s)
Description: -
Size: 18.21 MB
Format: Adobe PDF
Annexe(s)
Description: -
Size: 102.57 MB
Format: Unknown
Cite this master thesis
The University of Liège does not guarantee the scientific quality of these students' works or the accuracy of all the information they contain.