Rodrigo Duro, Francisco JoséMarozzo, FabrizioGarcía Blas, JavierCarretero Pérez, JesúsTalia, DomenicoTrunfio, Paolo2015-11-182015-11-182015-10Carretero Pérez, Jesús; et.al. (eds.). (2015) Proceedings of the Second International Workshop on Sustainable Ultrascale Computing Systems (NESUS 2015): Krakow, Poland. Universidad Carlos III de Madrid, pp. 95-106.978-84-608-2581-4https://hdl.handle.net/10016/22027The Data Mining Cloud Framework (DMCF) is an environment for designing and executing data analysis workflows in cloud platforms. Currently, DMCF relies on the default storage of the public cloud provider for any I/O related operation. This implies that the I/O performance of DMCF is limited by the performance of the default storage. In this work we propose the usage of the Hercules system within DMCF as an ad-hoc storage system for temporary data produced inside workflow-based applications. Hercules is a distributed in-memory storage system highly scalable and easy to deploy. The proposed solution takes advantage of the scalability capabilities of Hercules to avoid the bandwidth limits of the default storage. Early experimental results are presented in this paper, they show promising performance, particularly for write operations, compared to the performance obtained using the default storage services.12application/pdfengDMCFHerculesData analysisWorkflowsIn-memory storageMicrosoft AzureEvaluating data caching techniques in DMCF workflows using Herculesconference paperInformáticaopen access95106Proceedings of the Second International Workshop on Sustainable Ultrascale Computing Systems (NESUS 2015): Krakow, PolandCC/0000024008