RT Journal Article T1 Performance-aware scheduling of parallel applications on non-dedicated clusters A1 Cascajo García, Alberto A1 Expósito Singh, David A1 Carretero Pérez, Jesús AB This work presents a HPC framework that provides new strategies for resource management and job scheduling, based on executing different applications in shared compute nodes, maximizing platform utilization. The framework includes a scalable monitoring tool that is able to analyze the platform's compute node utilization. We also introduce an extension of CLARISSE, a middleware for data-staging coordination and control on large-scale HPC platforms that uses the information provided by the monitor in combination with application-level analysis to detect performance degradation in the running applications. This degradation, caused by the fact that the applications share the compute nodes and may compete for their resources, is avoided by means of dynamic application migration. A description of the architecture, as well as a practical evaluation of the proposal, shows significant performance improvements up to 20% in the makespan and 10% in energy consumption compared to a non-optimized execution. PB MDPI SN 2079-9292 YR 2019 FD 2019-09-02 LK https://hdl.handle.net/10016/29648 UL https://hdl.handle.net/10016/29648 LA eng NO ASPIDE: Exascale programIng models for extreme data processing NO This work was partially supported by the Spanish Ministry of Economy, Industry and Competitiveness under the grant TIN2016-79637-P "Towards Unification of HPC and Big Data Paradigms"; and the European Union's Horizon 2020 research and innovation program under Grant No. 801091, project "Exascale programming models for extreme data processing" (ASPIDE). DS e-Archivo RD 12 sept. 2024