Emergent behaviors and scalability for multi-agent reinforcement learning-based pedestrian models

Martínez Gil, Francisco; Lozano, Miguel; Fernández Rebollo, Fernando

Publication:
Emergent behaviors and scalability for multi-agent reinforcement learning-based pedestrian models

dc.affiliation.dpto	UC3M. Departamento de Informática	es
dc.affiliation.grupoinv	UC3M. Grupo de Investigación: Planificación y Aprendizaje	es
dc.contributor.author	Martínez Gil, Francisco
dc.contributor.author	Lozano, Miguel
dc.contributor.author	Fernández Rebollo, Fernando
dc.contributor.funder	Ministerio de Economía y Competitividad (España)	es
dc.date.accessioned	2020-01-15T16:45:38Z
dc.date.available	2020-01-15T16:45:38Z
dc.date.issued	2017-05-01
dc.description.abstract	This paper analyzes the emergent behaviors of pedestrian groups that learn through the multiagent reinforcement learning model developed in our group. Five scenarios studied in the pedestrian model literature, and with different levels of complexity, were simulated in order to analyze the robustness and the scalability of the model. Firstly, a reduced group of agents must learn by interaction with the environment in each scenario. In this phase, each agent learns its own kinematic controller, that will drive it at a simulation time. Secondly, the number of simulated agents is increased, in each scenario where agents have previously learnt, to test the appearance of emergent macroscopic behaviors without additional learning. This strategy allows us to evaluate the robustness and the consistency and quality of the learned behaviors. For this purpose several tools from pedestrian dynamics, such as fundamental diagrams and density maps, are used. The results reveal that the developed model is capable of simulating human-like micro and macro pedestrian behaviors for the simulation scenarios studied, including those where the number of pedestrians has been scaled by one order of magnitude with respect to the situation learned.	es
dc.description.sponsorship	This work has been supported by grant TIN2015-65686-C5-1-R of Ministerio de Economía y Competitividad.	en
dc.identifier.bibliographicCitation	F. Martínez, M. A. Lozano, F. Fernández. (2017). Emergent behaviors and scalability for multi-agent reinforcement learning-based pedestrian models. Simulation Modelling Practice and Theory, 74, pp. 117-133	en
dc.identifier.doi	https://doi.org/10.1016/j.simpat.2017.03.003
dc.identifier.issn	1569-190X
dc.identifier.publicationfirstpage	117
dc.identifier.publicationlastpage	133
dc.identifier.publicationtitle	Simulation Modelling Practice and theory	en
dc.identifier.publicationvolume	74
dc.identifier.uri	https://hdl.handle.net/10016/29470
dc.identifier.uxxi	AR/0000020003
dc.language.iso	eng	es
dc.publisher	Elsevier	en
dc.relation.projectID	Gobierno de España. TIN2015-65686-C5-1-R	es
dc.rights	© 2017 Elsevier B.V. All rights reserved.	es
dc.rights	Atribución-NoComercial-SinDerivadas 3.0 España	*
dc.rights.accessRights	open access	en
dc.rights.uri	http://creativecommons.org/licenses/by-nc-nd/3.0/es/	*
dc.subject.eciencia	Informática	es
dc.subject.other	Pedestrian simulation and modeling	en
dc.subject.other	Multi-agent reinforcement learning (Marl)	en
dc.subject.other	Behavioural simulation	en
dc.subject.other	Emergent behaviours	en
dc.title	Emergent behaviors and scalability for multi-agent reinforcement learning-based pedestrian models	en
dc.type	research article	*
dc.type.hasVersion	AM	*
dspace.entity.type	Publication