Learning to Avoid Risky Actions

Malfaz Vázquez, María Ángeles; Salichs Sánchez-Caballero, Miguel

Publication:
Learning to Avoid Risky Actions

dc.affiliation.dpto	UC3M. Departamento de Ingeniería de Sistemas y Automática	es
dc.affiliation.grupoinv	UC3M. Grupo de Investigación: Laboratorio de Robótica (Robotics Lab)	es
dc.contributor.author	Malfaz Vázquez, María Ángeles
dc.contributor.author	Salichs Sánchez-Caballero, Miguel
dc.date.accessioned	2014-03-27T12:33:43Z
dc.date.available	2014-03-27T12:33:43Z
dc.date.issued	2011-12
dc.description.abstract	When a reinforcement learning agent executes actions that can cause frequent damage to itself, it can learn, by using Q-learning, that these actions must not be executed again. However, there are other actions that do not cause damage frequently but only once in a while, for example, risky actions such as parachuting. These actions may imply punishment to the agent and, depending on its personality, it would be better to avoid them. Nevertheless, using the standard Q-learning algorithm, the agent is not able to learn to avoid them, because the result of these actions can be positive on average. In this article, an additional mechanism of Q-learning, inspired by the emotion of fear, is introduced in order to deal with those risky actions by considering the worst results. Moreover, there is a daring factor for adjusting the consideration of the risk. This mechanism is implemented on an autonomous agent living in a virtual environment. The results present the performance of the agent with different daring degrees.	en
dc.description.sponsorship	The funds provided by the Spanish Government through the project called “A New Approach to Social Robotics” (AROS), of MICINN (Ministry of Science and Innovation) and through the RoboCity2030-IICM project (S2009/DPI-1559), funded by Programas de Actividades I+D en la Comunidad de Madrid and cofunded by Structural Funds of the EU.	en
dc.format.extent	22
dc.format.mimetype	application/pdf
dc.identifier.bibliographicCitation	Cybernetics and Systems: An International Journal, 2011, vol. 42 (8), pp. 636-658	en
dc.identifier.doi	10.1080/01969722.2011.634681
dc.identifier.issn	0196-9722 (print)
dc.identifier.issn	1087-6553 (online)
dc.identifier.publicationfirstpage	636
dc.identifier.publicationissue	8
dc.identifier.publicationlastpage	658
dc.identifier.publicationtitle	Cybernetics and Systems: An International Journal	en
dc.identifier.publicationvolume	42
dc.identifier.uri	https://hdl.handle.net/10016/18621
dc.identifier.uxxi	AR/0000009819
dc.language.iso	eng
dc.publisher	Taylor & Francis Group	en
dc.relation.projectID	Comunidad de Madrid. S2009/DPI-1559/ROBOCITY2030 II	es
dc.relation.publisherversion	http://dx.doi.org/10.1080/01969722.2011.634681
dc.rights.accessRights	open access
dc.subject.eciencia	Robótica e Informática Industrial	es
dc.subject.other	Autonomous agent	en
dc.subject.other	Decision making system	en
dc.subject.other	Fear	en
dc.subject.other	Reinforcement learning	en
dc.subject.other	Risky actions	en
dc.title	Learning to Avoid Risky Actions	en
dc.type	research article	*
dc.type.hasVersion	AM	*
dspace.entity.type	Publication

Files

Original bundle

Now showing 1 - 1 of 1

Name:: learning_CSIJ_2011_ps.pdf
Size:: 666.41 KB
Format:: Adobe Portable Document Format

Download

Collections

DISA - LR - Artículos de Revistas

Publication: Learning to Avoid Risky Actions

Files

Original bundle

Collections

Publication:
Learning to Avoid Risky Actions