Data management techniques
Author(s):
Bilas, Angelos; Carretero Pérez, Jesús; Cortes, Toni; García Blas, Francisco Javier; González Ferez, Pilar; Papagiannis, Anastasios; Queratl, Anna; Marozo, Fabrizio; Saloustros, Giorgios; Shoker, Ali; Talia, Domenico; Trunfio, Paolo
Publisher:
IET - The Institution Of Engineering And Technology
Issued date:
2019-01
Citation:
Bilas, A., Carretero, J., Cortes, T., García-Blas, J. González-Pérez, P., Papagiannis, A., Queralt, A., Marozzo, F., Saloustros, G., Shoker, A., Talia, D., Trunfio, P. (2019). Data management techniques. En J. Carretero (Ed.), Ultrascale Computing Systems (85-126). IET Digital Library
ISBN:
9781785618338
Keywords:
availability
,
scalability
,
CRDT
,
dataclay
,
key-value stores
,
ssds
,
memory mapped I/O
,
RDMA-Based communication
,
data-centric
,
data-locality
,
workflows
,
cloud computing
,
DMCF
,
Hercules
Rights:
© The Institution of Engineering and Technology 2019
Abstract:
Today, it is projected that data storage and management is becoming one of the key challenges in order to achieve ultrascale computing for several reasons. First, data is expected to grow exponentially in the coming years and this progression will imply that d
Today, it is projected that data storage and management is becoming one of the key challenges in order to achieve ultrascale computing for several reasons. First, data is expected to grow exponentially in the coming years and this progression will imply that disruptive technologies will be needed to store large amounts of data and more importantly to access it in a timely manner. Second, the improvement of computing elements and their scalability are shifting application execution from CPU bound to I/O bound. This creates additional challenges for significantly improving the access to data to keep with computation time and thus avoid high-performance computing (HPC) from being underutilized due to large periods of I/O activity. Third, the two initially separate worlds of HPC that mainly consisted on one hand of simulations that are CPU bound and on the other hand of analytics that mainly perform huge data scans to discover information and are I/O bound are blurring. Now, simulations and analytics need to work cooperatively and share the same I/O infrastructure.
[+]
[-]
Show full item record
Impact:
Files in this item
File's Preview
*Click on file's image for preview. (Embargoed files's preview is not supported)
This item appears in the following
Collection(s)