Publication: Evaluating data caching techniques in DMCF workflows using Hercules
dc.affiliation.dpto | UC3M. Departamento de Informática | es |
dc.affiliation.grupoinv | UC3M. Grupo de Investigación: Arquitectura de Computadores, Comunicaciones y Sistemas | es |
dc.contributor.author | Rodrigo Duro, Francisco José | |
dc.contributor.author | Marozzo, Fabrizio | |
dc.contributor.author | García Blas, Javier | |
dc.contributor.author | Carretero Pérez, Jesús | |
dc.contributor.author | Talia, Domenico | |
dc.contributor.author | Trunfio, Paolo | |
dc.date.accessioned | 2015-11-18T09:57:26Z | |
dc.date.available | 2015-11-18T09:57:26Z | |
dc.date.issued | 2015-10 | |
dc.description.abstract | The Data Mining Cloud Framework (DMCF) is an environment for designing and executing data analysis workflows in cloud platforms. Currently, DMCF relies on the default storage of the public cloud provider for any I/O related operation. This implies that the I/O performance of DMCF is limited by the performance of the default storage. In this work we propose the usage of the Hercules system within DMCF as an ad-hoc storage system for temporary data produced inside workflow-based applications. Hercules is a distributed in-memory storage system highly scalable and easy to deploy. The proposed solution takes advantage of the scalability capabilities of Hercules to avoid the bandwidth limits of the default storage. Early experimental results are presented in this paper, they show promising performance, particularly for write operations, compared to the performance obtained using the default storage services. | en |
dc.description.sponsorship | This work is partially supported by EU under the COST Program Action IC1305: Network for Sustainable Ultrascale Computing (NESUS). This work is partially supported by the grant TIN2013-41350-P, Scalable Data Management Techniques for High-End Computing Systems from the Spanish Ministry of Economy and Competitiveness. | en |
dc.format.extent | 12 | |
dc.format.mimetype | application/pdf | |
dc.identifier.bibliographicCitation | Carretero Pérez, Jesús; et.al. (eds.). (2015) Proceedings of the Second International Workshop on Sustainable Ultrascale Computing Systems (NESUS 2015): Krakow, Poland. Universidad Carlos III de Madrid, pp. 95-106. | en |
dc.identifier.isbn | 978-84-608-2581-4 | |
dc.identifier.publicationfirstpage | 95 | |
dc.identifier.publicationlastpage | 106 | |
dc.identifier.publicationtitle | Proceedings of the Second International Workshop on Sustainable Ultrascale Computing Systems (NESUS 2015): Krakow, Poland | en |
dc.identifier.uri | https://hdl.handle.net/10016/22027 | |
dc.identifier.uxxi | CC/0000024008 | |
dc.language.iso | eng | |
dc.relation.eventdate | September 10-11, 2015 | en |
dc.relation.eventnumber | 2 | |
dc.relation.eventplace | Krakow, Poland | en |
dc.relation.eventtitle | International Workshop on Sustainable Ultrascale Computing Systems (NESUS 2015) | en |
dc.relation.projectID | Gobierno de España. TIN2013-41350-P | |
dc.rights.accessRights | open access | |
dc.subject.eciencia | Informática | es |
dc.subject.other | DMCF | en |
dc.subject.other | Hercules | en |
dc.subject.other | Data analysis | en |
dc.subject.other | Workflows | en |
dc.subject.other | In-memory storage | en |
dc.subject.other | Microsoft Azure | en |
dc.title | Evaluating data caching techniques in DMCF workflows using Hercules | en |
dc.type | conference paper | * |
dc.type.hasVersion | VoR | * |
dspace.entity.type | Publication |
Files
Original bundle
1 - 1 of 1