Español English Contacte con nosotros http://www.uc3m.es/portal/page/portal/biblioteca
DSpace e-Archivo

Archivo Abierto Institucional de la Universidad Carlos III de Madrid > Investigación > Departamentos > Departamento de Informática > Grupo de Investigación en Planificación y Aprendizaje Automático (PLG) > DI - PLG - Artículos de Revistas >

Please use this identifier to cite or link to this item: http://hdl.handle.net/10016/6790

Google™ Scholar. Others By: Billhardt, Holger - Borrajo, Daniel - Maojo, Víctor
Files in This Item:
context_billhardt_JASIST_2002_ps.pdf393,14 kBAdobe PDFformato pdf
Title: A context vector model for information retrieval
Author(s): Billhardt, Holger
Borrajo, Daniel
Maojo, Víctor
Publisher: Wiley & Sons
Issued date: 2002
Citation: Journal of the American Society for Information Science and Technology, 2002, vol. 53, n. 3, p. 236-249
URI: http://hdl.handle.net/10016/6790
DOI: http://dx.doi.org/10.1002/asi.10032
Abstract: In the vector space model for information retrieval, term vectors are pair-wise orthogonal, that is, terms are assumed to be independent. It is well known that this assumption is too restrictive. In this article, we present our work on an indexing and retrieval method that, based on the vector space model, incorporates term dependencies and thus obtains semantically richer representations of documents. First, we generate term context vectors based on the co-occurrence of terms in the same documents. These vectors are used to calculate context vectors for documents. We present different techniques for estimating the dependencies among terms. We also define term weights that can be employed in the model. Experimental results on four text collections (MED, CRANFIELD, CISI, and CACM) show that the incorporation of term dependencies in the retrieval process performs statistically significantly better than the classical vector space model with IDF weights. We also show that the degree of semantic matching versus direct word matching that performs best varies on the four collections. We conclude that the model performs well for certain types of queries and, generally, for information tasks with high recall requirements. Therefore, we propose the use of the context vector model in combination with other, direct word-matching methods.
Review: PeerReviewed
Publisher version: http://dx.doi.org/10.1002/asi.10032
Keywords: Vector space models
Document retrieval
Vector analysis
Co-occurrence analysis
Contextual information
Rights: © Wiley Periodicals
Appears in Collections:DI - PLG - Artículos de Revistas

Refworks Export

SFX Query

Items in E-Archivo are protected by copyright, with all rights reserved, unless otherwise indicated.

 

Valid XHTML 1.0! © Universidad Carlos III de Madrid - Software DSpace - Terms of use - Feedback