Performance-aware scheduling of parallel applications on non-dedicated clusters

Cascajo García, Alberto; Expósito Singh, David; Carretero Pérez, Jesús

Publication:
Performance-aware scheduling of parallel applications on non-dedicated clusters

Identifiers

URI: https://hdl.handle.net/10016/29648

ISSN: 2079-9292

DOI: https://doi.org/10.3390/electronics8090982

UXXI: AR/0000024221

Files

performance_ELECTRONICS_2019.pdf (1.1 MB)

Publication date

2019-09-02

Authors

Cascajo García, Alberto

Expósito Singh, David

Carretero Pérez, Jesús

Publisher

MDPI

Impact

Export

Abstract

This work presents a HPC framework that provides new strategies for resource management and job scheduling, based on executing different applications in shared compute nodes, maximizing platform utilization. The framework includes a scalable monitoring tool that is able to analyze the platform's compute node utilization. We also introduce an extension of CLARISSE, a middleware for data-staging coordination and control on large-scale HPC platforms that uses the information provided by the monitor in combination with application-level analysis to detect performance degradation in the running applications. This degradation, caused by the fact that the applications share the compute nodes and may compete for their resources, is avoided by means of dynamic application migration. A description of the architecture, as well as a practical evaluation of the proposal, shows significant performance improvements up to 20% in the makespan and 10% in energy consumption compared to a non-optimized execution.

Description

ASPIDE: Exascale programIng models for extreme data processing

Keywords

Scalable tools, Monitoring tools, Scheduling, Malleability

Bibliographic citation

Electronics, 2019, 8, 982, 21 pp.

Collections

ASPIDE: Exascale programIng models for extreme data processing
OpenAIRE: Open Access Infrastructure for Research in Europe

Full item page

Publication:
Performance-aware scheduling of parallel applications on non-dedicated clusters

Identifiers

Files

Publication date

Defense date

Authors

Advisors

Tutors

Journal Title

Journal ISSN

Volume Title

Publisher

Impact

Export

Research Projects

Organizational Units

Journal Issue

Abstract

Description

Keywords

Bibliographic citation

Collections

Publication: Performance-aware scheduling of parallel applications on non-dedicated clusters

Identifiers

Files

Publication date

Defense date

Authors

Advisors

Tutors

Journal Title

Journal ISSN

Volume Title

Publisher

Impact

Export

Research Projects

Organizational Units

Journal Issue

Abstract

Description

Keywords

Bibliographic citation

Collections

Publication:
Performance-aware scheduling of parallel applications on non-dedicated clusters