Publication: Evaluación e implantación de la distribución Cloudera Distribution Hadoop para sistemas Big Data
Loading...
Identifiers
Publication date
2020-07-17
Defense date
2020-07-07
Authors
Advisors
Tutors
Journal Title
Journal ISSN
Volume Title
Publisher
Abstract
En la era de la información, la sociedad, los clientes y las empresas cada vez
generan e intentan procesar mayores cantidades de datos. Para conseguir obtener y
analizar tanta información surge el término Big Data.
En este proyecto se intenta explicar en qué consiste y qué sistemas están
implicados en el proceso de procesamiento de grandes volúmenes de información.
Se realizará un estudio de los tipos de sistemas Big Data, soluciones comerciales
disponibles actualmente y los componentes de estos sistemas.
Probaremos extensivamente sistemas de virtualización para decidir cuáles son
realmente capaces de gestionar servidores de producción con altas capacidades de
computación.
Para finalizar, se analizará, desarrollará e implantará una solución que sirva para
exponer de forma práctica los conocimientos teóricos adquiridos.
In the Information Age, society, customers and companies are generating and try to process larger amounts of data. The Big Data term arises to get obtain and analyze as much information. This project attempts to explain how it works and what systems are involved in the process of processing large volumes of information. It will be made a study of the types of Big Data systems, currently available commercial solutions and the components of these systems. We will extensively test virtualization systems to decide which are really capable of managing production servers with high computing capabilities. Finally, it will be analyze, develop and set up a solution that fits conveniently expose the theoretical knowledge achieved.
In the Information Age, society, customers and companies are generating and try to process larger amounts of data. The Big Data term arises to get obtain and analyze as much information. This project attempts to explain how it works and what systems are involved in the process of processing large volumes of information. It will be made a study of the types of Big Data systems, currently available commercial solutions and the components of these systems. We will extensively test virtualization systems to decide which are really capable of managing production servers with high computing capabilities. Finally, it will be analyze, develop and set up a solution that fits conveniently expose the theoretical knowledge achieved.
Description
Keywords
Big data, Datos masivos, Ingeniería del conocimiento, Recuperación de la información