|
Archivo Abierto Institucional de la Universidad Carlos III de Madrid >
Investigación >
Departamentos >
Departamento de Informática >
Grupo de Investigación en Planificación y Aprendizaje Automático (PLG) >
DI - PLG - Comunicaciones en Congresos y otros eventos >
Please use this identifier to cite or link to this item:
http://hdl.handle.net/10016/7369
|
| Title: | VQQL. Applying vector quantization to reinforcement learning |
| Author(s): | Fernández, Fernando Borrajo, Daniel |
| Publisher: | Springer |
| Issued date: | 2000 |
| Citation: | RoboCup-99: Robot Soccer World Cup III, Springer-Verlag, Stockholm (Sweden), 2000, p. 49-57 |
| URI: | http://hdl.handle.net/10016/7369 |
| ISBN: | 978-3-540-41043-0 |
| ISSN: | 0302-9743 (Print) 1611-3349 (Online) |
| DOI: | http://dx.doi.org/10.1007/3-540-45327-X_24 |
| Description: | Proceeding of: RoboCup-99: Robot Soccer World Cup III, July 27 to August 6, 1999, Stockholm, Sweden |
| Abstract: | Reinforcement learning has proven to be a set of successful techniques for finding optimal policies on uncertain and/or dynamic domains, such as the RoboCup. One of the problems on using such techniques appears with large state and action spaces, as it is the case of input information coming from the Robosoccer simulator. In this paper, we describe a new mechanism for solving the states generalization problem in reinforcement learning algorithms. This clustering mechanism is based on the vector quantization technique for signal analog-to-digital conversion and compression, and on the Generalized Lloyd Algorithm for the design of vector quantizers. Furthermore, we present the VQQL model, that integrates Q-Learning as reinforcement learning technique and vector quantization as state generalization technique. We show some results on applying this model to learning the interception task skill for Robosoccer agents. |
| Review: | PeerReviewed |
| Serie / Nº.: | Lecture notes in computer science 1856/2000 |
| Publisher version: | http://dx.doi.org/10.1007/3-540-45327-X_24 |
| Rights: | © Springer-Verlag Berlin Heidelberg |
| Appears in Collections: | DI - PLG - Capítulos de Monografías DI - PLG - Comunicaciones en Congresos y otros eventos
|
Items in E-Archivo are protected by copyright, with all rights reserved, unless otherwise indicated.
|