Blocked Schur algorithms for computing the matrix square root

Edvin Deadman, Nicholas J. Higham, Rui Ralha

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    Abstract

    The Schur method for computing a matrix square root reduces the matrix to the Schur triangular form and then computes a square root of the triangular matrix. We show that by using either standard blocking or recursive blocking the computation of the square root of the triangular matrix can be made rich in matrix multiplication. Numerical experiments making appropriate use of level 3 BLAS show significant speedups over the point algorithm, both in the square root phase and in the algorithm as a whole. In parallel implementations, recursive blocking is found to provide better performance than standard blocking when the parallelism comes only from threaded BLAS, but the reverse is true when parallelism is explicitly expressed using OpenMP. The excellent numerical stability of the point algorithm is shown to be preserved by blocking. These results are extended to the real Schur method. Blocking is also shown to be effective for multiplying triangular matrices. © 2013 Springer-Verlag.
    Original languageEnglish
    Title of host publicationLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)|Lect. Notes Comput. Sci.
    PublisherSpringer Nature
    Pages171-182
    Number of pages11
    Volume7782
    ISBN (Print)9783642368028
    DOIs
    Publication statusPublished - 2013
    Event11th International Conference on Applied Parallel and Scientific Computing, PARA 2012 - Helsinki
    Duration: 1 Jul 2013 → …

    Publication series

    NameLecture Notes in Computer Science

    Conference

    Conference11th International Conference on Applied Parallel and Scientific Computing, PARA 2012
    CityHelsinki
    Period1/07/13 → …

    Fingerprint

    Dive into the research topics of 'Blocked Schur algorithms for computing the matrix square root'. Together they form a unique fingerprint.

    Cite this