Evaluating data caching techniques in DMCF workflows using Hercules

e-Archivo Repository

Show simple item record

dc.contributor.author Rodrigo Duro, Francisco José
dc.contributor.author Marozzo, Fabrizio
dc.contributor.author García Blas, Javier
dc.contributor.author Carretero Pérez, Jesús
dc.contributor.author Talia, Domenico
dc.contributor.author Trunfio, Paolo
dc.date.accessioned 2015-11-18T09:57:26Z
dc.date.available 2015-11-18T09:57:26Z
dc.date.issued 2015-10
dc.identifier.bibliographicCitation Carretero Pérez, Jesús; et.al. (eds.). (2015) Proceedings of the Second International Workshop on Sustainable Ultrascale Computing Systems (NESUS 2015): Krakow, Poland. Universidad Carlos III de Madrid, pp. 95-106.
dc.identifier.isbn 978-84-608-2581-4
dc.identifier.uri http://hdl.handle.net/10016/22027
dc.description.abstract The Data Mining Cloud Framework (DMCF) is an environment for designing and executing data analysis workflows in cloud platforms. Currently, DMCF relies on the default storage of the public cloud provider for any I/O related operation. This implies that the I/O performance of DMCF is limited by the performance of the default storage. In this work we propose the usage of the Hercules system within DMCF as an ad-hoc storage system for temporary data produced inside workflow-based applications. Hercules is a distributed in-memory storage system highly scalable and easy to deploy. The proposed solution takes advantage of the scalability capabilities of Hercules to avoid the bandwidth limits of the default storage. Early experimental results are presented in this paper, they show promising performance, particularly for write operations, compared to the performance obtained using the default storage services.
dc.description.sponsorship This work is partially supported by EU under the COST Program Action IC1305: Network for Sustainable Ultrascale Computing (NESUS). This work is partially supported by the grant TIN2013-41350-P, Scalable Data Management Techniques for High-End Computing Systems from the Spanish Ministry of Economy and Competitiveness.
dc.format.extent 12
dc.format.mimetype application/pdf
dc.language.iso eng
dc.subject.other DMCF
dc.subject.other Hercules
dc.subject.other Data analysis
dc.subject.other Workflows
dc.subject.other In-memory storage
dc.subject.other Microsoft Azure
dc.title Evaluating data caching techniques in DMCF workflows using Hercules
dc.type bookPart
dc.type conferenceObject
dc.subject.eciencia Informática
dc.rights.accessRights openAccess
dc.relation.projectID Gobierno de España. TIN2013-41350-P
dc.type.version publishedVersion
dc.relation.eventdate September 10-11, 2015
dc.relation.eventnumber 2
dc.relation.eventplace Krakow, Poland
dc.relation.eventtitle International Workshop on Sustainable Ultrascale Computing Systems (NESUS 2015)
dc.relation.eventtype proceeding
dc.identifier.publicationfirstpage 95
dc.identifier.publicationlastpage 106
dc.identifier.publicationtitle Proceedings of the Second International Workshop on Sustainable Ultrascale Computing Systems (NESUS 2015): Krakow, Poland
dc.identifier.uxxi CC/0000024008
 Find Full text

Files in this item

*Click on file's image for preview. (Embargoed files's preview is not supported)


This item appears in the following Collection(s)

Show simple item record