Español English Contacte con nosotros http://www.uc3m.es/portal/page/portal/biblioteca
DSpace e-Archivo

Archivo Abierto Institucional de la Universidad Carlos III de Madrid > Investigación > Departamentos > Departamento de Física > Grupo de Física de Plasmas > DF - GFP - Artículos de revistas >

Please use this identifier to cite or link to this item: http://hdl.handle.net/10016/8910

Google™ Scholar. Others By: Hirshman, S. P. - Perumalla, K. S. - Lynch, V. E. - Sánchez, Raúl
Files in This Item:
bcyclic_sanchez_jcp_2010_ps.pdfpost-print version1,09 MBAdobe PDFformato pdf
Title: BCYCLIC: A parallel block tridiagonal matrix cyclic solver
Author(s): Hirshman, S. P.
Perumalla, K. S.
Lynch, V. E.
Sánchez, Raúl
Publisher: Elsevier
Issued date: Sep-2010
Citation: Journal of Computational Physics, 2010, v. 229, n. 18, p. 6392-6404
URI: http://hdl.handle.net/10016/8910
ISSN: 0021-9991
DOI: 10.1016/j.jcp.2010.04.049
Description: 13 pages, 6 figures.
Abstract: A block tridiagonal matrix is factored with minimal fill-in using a cyclic reduction algorithm that is easily parallelized. Storage of the factored blocks allows the application of the inverse to multiple right-hand sides which may not be known at factorization time. Scalability with the number of block rows is achieved with cyclic reduction, while scalability with the block size is achieved using multithreaded routines (OpenMP, GotoBLAS) for block matrix manipulation. This dual scalability is a noteworthy feature of this new solver, as well as its ability to efficiently handle arbitrary (non-powers-of-2) block row and processor numbers. Comparison with a state-of-the art parallel sparse solver is presented. It is expected that this new solver will allow many physical applications to optimally use the parallel resources on current supercomputers. Example usage of the solver in magneto-hydrodynamic (MHD), three-dimensional equilibrium solvers for high-temperature fusion plasmas is cited.
Sponsor: This research has been sponsored by the US Department of Energy under Contract DE-AC05-00OR22725 with UT-Battelle, LLC. This research used resources of the National Center for Computational Sciences at Oak Ridge National Laboratory, which is supported by the Office of Science of the Department of Energy under Contract DE-AC05-00OR22725.
Review: PeerReviewed
Publisher version: http://dx.doi.org/10.1016/j.jcp.2010.04.049
Keywords: Cyclic reduction
Block matrix
Dense blocks
Tridiagonal matrix
Thomas algorithm
Parallel computing
Rights: © Elsevier
Appears in Collections:DF - GFP - Artículos de revistas

Refworks Export

SFX Query

Items in E-Archivo are protected by copyright, with all rights reserved, unless otherwise indicated.

 

Valid XHTML 1.0! © Universidad Carlos III de Madrid - Software DSpace - Terms of use - Feedback