Optimization and performance evaluation of the IDR iterative Krylov solver on GPUs

Hartwig Anzt, Moritz Kreutzer, Eduardo Ponce, Gregory Peterson, Gerhard Wellein, Jack Dongarra

    Research output: Contribution to journalArticlepeer-review

    Abstract

    In this paper, we present an optimized GPU implementation for the induced dimension reduction algorithm. We improve data locality, combine it with an efficient sparse matrix vector kernel, and investigate the potential of overlapping computation with communication as well as the possibility of concurrent kernel execution. A comprehensive performance evaluation is conducted using a suitable performance model. The analysis reveals efficiency of up to 90%, which indicates that the implementation achieves performance close to the theoretically attainable bound.
    Original languageEnglish
    Pages (from-to)1-11
    JournalInternational Journal of High Performance Computing Applications
    Early online date5 May 2016
    DOIs
    Publication statusPublished - 2016

    Fingerprint

    Dive into the research topics of 'Optimization and performance evaluation of the IDR iterative Krylov solver on GPUs'. Together they form a unique fingerprint.

    Cite this