Exploiting fast matrix multiplication within the level 3 BLAS

    Research output: Contribution to journalArticlepeer-review

    Abstract

    The Level 3 BLAS (BLAS3) are a set of specifications of FORTRAN 77 subprograms for carrying out matrix multiplications and the solution of triangular systems with multiple right-hand sides. They are intended to provide efficient and portable building blocks for linear algebra algorithms on high-performance computers. We describe algorithms for the BLAS3 operations that are asymptotically faster than the conventional ones. These algorithms are based on Strassen's method for fast matrix multiplication, which is now recognized to be a practically useful technique once matrix dimensions exceed about 100. We pay particular attention to the numerical stability of these ″fast BLAS3'. Error bounds are given and their significance is explained and illustrated with the aid of numerical experiments. Our conclusion is that the fast BLAS3, although not as strongly stable as conventional implementations, are stable enough to merit careful consideration in many applications.
    Original languageEnglish
    Pages (from-to)352-368
    Number of pages16
    JournalACM Transactions on Mathematical Software
    Volume16
    Issue number4
    DOIs
    Publication statusPublished - Dec 1990

    Fingerprint

    Dive into the research topics of 'Exploiting fast matrix multiplication within the level 3 BLAS'. Together they form a unique fingerprint.

    Cite this