|
Archivo Abierto Institucional de la Universidad Carlos III de Madrid >
Investigación >
Departamentos >
Departamento de Informática >
Grupo de Computación Evolutiva y Redes Neuronales (EVANNAI) >
DI - GCERN - Artículos de revistas científicas >
Please use this identifier to cite or link to this item:
http://hdl.handle.net/10016/6598
|
| Title: | OMBO: An opponent modeling approach |
| Author(s): | Ledezma, Agapito Aler, Ricardo Sanchis, Araceli Borrajo, Daniel |
| Publisher: | IOS Press |
| Issued date: | 2009 |
| Citation: | AI Communications, 22, 1, (2009), 21-35 |
| URI: | http://hdl.handle.net/10016/6598 |
| ISSN: | 0921-7126 |
| DOI: | http://dx.doi.org/10.3233/AIC-2009-0442 |
| Abstract: | In competitive domains, some knowledge about the opponent can give players a clear advantage. This idea led many people to propose approaches that automatically acquire models of opponents, based only on the observation of their input–output behavior. If opponent outputs could be accessed directly, a model can be constructed by feeding a machine learning method with traces of the behavior of the opponent. However, that is not the case in the RoboCup domain where an agent does not have direct access to the opponent inputs and outputs. Rather, the agent sees the opponent behavior from its own point of view and inputs and outputs (actions) have to be inferred from observation. In this paper, we present an approach to model low-level behavior of individual opponent agents. First, we build a classifier to infer and label opponent actions based on observation. Second, our agent observes an opponent and labels its actions using the previous classifier. From these observations, machine learning techniques generate a model that predicts the opponent actions. Finally, the agent uses the model to anticipate opponent actions. In order to test our ideas, we have created an architecture called OMBO (Opponent Modeling Based on Observation). Using OMBO, a striker agent can anticipate goalie actions. Results show that in this striker-goalie scenario, scores are significantly higher using the acquired opponent's model of actions. |
| Sponsor: | This work has been partially supported by the Spanish MCyT under projects TRA2007-67374- C02-02 and TIN-2005-08818-C04.Also, it has been supported under MEC grant by TIN2005-08945- C06-05. We thank anonymous reviewers for their helpful comments. |
| Review: | PeerReviewed |
| Publisher version: | http://dx.doi.org/10.3233/AIC-2009-0442 |
| Keywords: | Opponent modeling Learning about agents |
| Rights: | © IOS Press |
| Appears in Collections: | DI - GCERN - Artículos de revistas científicas DI - PLG - Artículos de Revistas
|
This item is licensed under a Creative Commons License
Items in E-Archivo are protected by copyright, with all rights reserved, unless otherwise indicated.
|