Español English Contacte con nosotros http://www.uc3m.es/portal/page/portal/biblioteca
DSpace e-Archivo

Archivo Abierto Institucional de la Universidad Carlos III de Madrid > Investigación > Departamentos > Departamento de Informática > Grupo de Computación Evolutiva y Redes Neuronales (EVANNAI) > DI - GCERN - Artículos de revistas científicas >

Please use this identifier to cite or link to this item: http://hdl.handle.net/10016/6598

Google™ Scholar. Others By: Ledezma, Agapito - Aler, Ricardo - Sanchis, Araceli - Borrajo, Daniel
Files in This Item:
ombo_aler_AIC_2009_ps.pdf165,32 kBAdobe PDFformato pdf
Title: OMBO: An opponent modeling approach
Author(s): Ledezma, Agapito
Aler, Ricardo
Sanchis, Araceli
Borrajo, Daniel
Publisher: IOS Press
Issued date: 2009
Citation: AI Communications, 22, 1, (2009), 21-35
URI: http://hdl.handle.net/10016/6598
ISSN: 0921-7126
DOI: http://dx.doi.org/10.3233/AIC-2009-0442
Abstract: In competitive domains, some knowledge about the opponent can give players a clear advantage. This idea led many people to propose approaches that automatically acquire models of opponents, based only on the observation of their input–output behavior. If opponent outputs could be accessed directly, a model can be constructed by feeding a machine learning method with traces of the behavior of the opponent. However, that is not the case in the RoboCup domain where an agent does not have direct access to the opponent inputs and outputs. Rather, the agent sees the opponent behavior from its own point of view and inputs and outputs (actions) have to be inferred from observation. In this paper, we present an approach to model low-level behavior of individual opponent agents. First, we build a classifier to infer and label opponent actions based on observation. Second, our agent observes an opponent and labels its actions using the previous classifier. From these observations, machine learning techniques generate a model that predicts the opponent actions. Finally, the agent uses the model to anticipate opponent actions. In order to test our ideas, we have created an architecture called OMBO (Opponent Modeling Based on Observation). Using OMBO, a striker agent can anticipate goalie actions. Results show that in this striker-goalie scenario, scores are significantly higher using the acquired opponent's model of actions.
Sponsor: This work has been partially supported by the Spanish MCyT under projects TRA2007-67374- C02-02 and TIN-2005-08818-C04.Also, it has been supported under MEC grant by TIN2005-08945- C06-05. We thank anonymous reviewers for their helpful comments.
Review: PeerReviewed
Publisher version: http://dx.doi.org/10.3233/AIC-2009-0442
Keywords: Opponent modeling
Learning about agents
Rights: © IOS Press
Appears in Collections:DI - GCERN - Artículos de revistas científicas
DI - PLG - Artículos de Revistas

Refworks Export

SFX Query

This item is licensed under a Creative Commons License
Creative Commons

Items in E-Archivo are protected by copyright, with all rights reserved, unless otherwise indicated.

 

Valid XHTML 1.0! © Universidad Carlos III de Madrid - Software DSpace - Terms of use - Feedback