RT Generic
T1 Elementos de posicionamiento de los sistemas de pregunta respuesta en la web
A1 Brita-Paja Núñez, Pablo
AB El presente trabajo surge de la unión de dos elementos los cuales, a pesar de que fueronconcebidos por separado, han resultado ser complementarios el uno del otro. Por unlado, están los sistemas pregunta-respuesta, nacidos en la década de los 60 y, por otrolado, el internet, nacido a principios de la década de los 90. Ambos tienen una finalidadque emana del acceso y manejo de la información. Es por ello que cuando les damos unuso conjunto, conformamos dispositivos más sofisticados, útiles y complejos.A través del procesamiento del lenguaje natural (Python), el análisis de datos (R) y laidentificación de patrones (Python), se procederá a determinar los elementos deposicionamiento de los sistemas pregunta-respuesta en la web. Es decir, se pretendeentender qué factores son relevantes para que un buscador elija un documentodeterminado y, dentro de dicho documento, la información que se selecciona y se extraecomo respuesta.Resulta de interés general profundizar en el estudio del funcionamiento de estos factoresdebido a la importancia y el gran uso de este tipo de herramientas en multitud decampos, tanto científicos como cotidianos. El problema es lo difuso que es para lapoblación el funcionamiento de éstas, debido a su complejidad y a lo opacos que son losdiseñadores con su desarrollo.El trabajo se centra en Google por ser el principal buscador utilizado en la actualidadcon una amplia ventaja sobre los demás. Se ha generado el corpus a partir dedocumentos tanto en inglés como en español y se han aplicado distintos algoritmos paraobtener qué factores influyen y determinan su elección.Los resultados del estudio realizado confirman que los factores que más influyen son larelevancia, la fiabilidad y el tiempo de carga del documento para cada consulta.
AB The present work arises from the union of two elements which, although they wereconceived separately, have turned out to be complementary to each other. On the onehand, there are the question-answer systems, born in the 60's, and on the other hand, theInternet, born in the early 90's. Both have a purpose that emanates from the access andmanagement of information. Both have a purpose that emanates from the access andmanagement of information. That is why together they form a more sophisticated,useful and complex device.Through natural language processing (Python), data analysis (R) and patternidentification (Python), we will proceed to determine the positioning elements ofquestion-answer systems on the web. In other words, the aim is to understand whichfactors are relevant both for a search engine to choose a certain document, and to choosewhich section within the document to select and extract as an answer.It is of general interest to deepen the study of the operation of these tools due to theimportance and the great use of this type of tools in many fields, both scientific andevery day. The problem is how fuzzy it is for the population how they work, due to theircomplexity and how opaque designers are with their development.The work will focus on Google because it is the main search engine used today with awide advantage over the others. A corpus will be generated from documents in bothEnglish and Spanish and different algorithms will be applied to obtain which factorsinfluence and determine their choice.The results of the study will confirm that the most influential factors are relevance,reliability and document loading time for each query.
YR 2022
FD 2022
LK https://hdl.handle.net/10016/36335
UL https://hdl.handle.net/10016/36335
LA spa
DS e-Archivo
RD 18 jul. 2024