Español English Contacte con nosotros http://www.uc3m.es/portal/page/portal/biblioteca
DSpace e-Archivo

Archivo Abierto Institucional de la Universidad Carlos III de Madrid > Investigación > Departamentos > Departamento de Informática > Grupo de Investigación en Planificación y Aprendizaje Automático (PLG) > DI - PLG - Artículos de Revistas >

Please use this identifier to cite or link to this item: http://hdl.handle.net/10016/6795

Google™ Scholar. Others By: Fernández, Fernando - Borrajo, Daniel - Parker, Lynne E.
Files in This Item:
reinforcement_fernandez_JIRS_2005_ps.pdf636,52 kBAdobe PDFformato pdf
Title: A reinforcement learning algorithm in cooperative multi-robot domains
Author(s): Fernández, Fernando
Borrajo, Daniel
Parker, Lynne E.
Publisher: Springer
Issued date: Aug-2005
Citation: Journal of Intelligent and Robotic Systems, August 2005, vol. 43, n. 2-4, p. 161-174
URI: http://hdl.handle.net/10016/6795
ISSN: 0921-0296 (Print)
1573-0409 (Online)
DOI: http://dx.doi.org/10.1007/s10846-005-5137-x
Abstract: Reinforcement learning has been widely applied to solve a diverse set of learning tasks, from board games to robot behaviours. In some of them, results have been very successful, but some tasks present several characteristics that make the application of reinforcement learning harder to define. One of these areas is multi-robot learning, which has two important problems. The first is credit assignment, or how to define the reinforcement signal to each robot belonging to a cooperative team depending on the results achieved by the whole team. The second one is working with large domains, where the amount of data can be large and different in each moment of a learning step. This paper studies both issues in a multi-robot environment, showing that introducing domain knowledge and machine learning algorithms can be combined to achieve successful cooperative behaviours.
Sponsor: This work has been partially funded by grants This work has been partially funded by grants from Spanish Science and Technology Department number TAP1999-0535-C02-02, and TIC2002-04146-C05-05.
Review: PeerReviewed
Publisher version: http://dx.doi.org/10.1007/s10846-005-5137-x
Keywords: Reinforcement learning
Function approximation
State space discretizations
Collaborative multi-robot domains
Rights: © Springer
Appears in Collections:DI - PLG - Artículos de Revistas

Refworks Export

SFX Query

Items in E-Archivo are protected by copyright, with all rights reserved, unless otherwise indicated.

 

Valid XHTML 1.0! © Universidad Carlos III de Madrid - Software DSpace - Terms of use - Feedback