Improving the Performance of the MPI_Allreduce Collective Operation through Rank Renaming

Rico-Gallego, Juan-Antonio; Díaz-Martín, Juan-Carlos

Publication:
Improving the Performance of the MPI_Allreduce Collective Operation through Rank Renaming

Identifiers

URI: https://hdl.handle.net/10016/21978

ISBN: 978-84-617-2251-8

Files

improving_NESUS_2014.pdf (551.92 KB)

Publication date

2014-11

Authors

Rico-Gallego, Juan-Antonio

Díaz-Martín, Juan-Carlos

Impact

Export

Abstract

Collective operations, a key issue in the global efficiency of HPC applications, are optimized in current MPI libraries by choosing at runtime between a set of algorithms, based on platform-dependent beforehand established parameters, as the message size or the number of processes. However, with progressively more cores per node, the cost of a collective algorithm must be mainly imputed to process-to-processor mapping, because its decisive influence over the network traffic. Hierarchical design of collective algorithms pursuits to minimize the data movement through the slowest communication channels of the multi-core cluster. Nevertheless, the hierarchical implementation of some collectives becomes inefficient, and even impracticable, due to the operation definition itself. This paper proposes a new approach that departs from a frequently found regular mapping, either sequential or round-robin. While keeping the mapping, the rank assignation to the processes is temporarily changed prior to the execution of the collective algorithm. The new assignation makes the communication pattern to adapt to the communication channels hierarchy. We explore this technique for the Ring algorithm when used in the well-known MPI_Allreduce collective, and discuss the obtained performance results. Extensions to other algorithms and collective operations are proposed.

Description

Proceedings of: First International Workshop on Sustainable Ultrascale Computing Systems (NESUS 2014). Porto (Portugal), August 27-28, 2014.

Keywords

MPI Collectives, Parallel algorithms, Message passing interface, Multi-core clusters

Bibliographic citation

Carretero Pérez, Jesús; et.al. (eds.). (2014) Proceedings of the First International Workshop on Sustainable Ultrascale Computing Systems (NESUS 2014): Porto, Portugal. Universidad Carlos III de Madrid, pp. 1-6.

Collections

First International Workshop on Sustainable Ultrascale Computing Systems (NESUS 2014)

Full item page

Publication:
Improving the Performance of the MPI_Allreduce Collective Operation through Rank Renaming

Identifiers

Files

Publication date

Defense date

Authors

Advisors

Tutors

Journal Title

Journal ISSN

Volume Title

Publisher

Impact

Export

Research Projects

Organizational Units

Journal Issue

Abstract

Description

Keywords

Bibliographic citation

Collections

Publication: Improving the Performance of the MPI_Allreduce Collective Operation through Rank Renaming

Identifiers

Files

Publication date

Defense date

Authors

Advisors

Tutors

Journal Title

Journal ISSN

Volume Title

Publisher

Impact

Export

Research Projects

Organizational Units

Journal Issue

Abstract

Description

Keywords

Bibliographic citation

Collections

Publication:
Improving the Performance of the MPI_Allreduce Collective Operation through Rank Renaming