Retrospect: Deterministic replay of MPI applications for interactive distributed debugging

Aurelien Bouteiller, George Bosilca, Jack Dongarra

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    Abstract

    While high performance computing was eagerly adopted by users as a vehicle for satisfying a growing demand on computational power, some areas are still poorly explored. The MPI paradigm is considered as being the keystone for the large development of the HPC infrastructure over the last decade. However, even today the users have to face the lack of tools able to help increase the stability of the software stack and/or of the applications. In this paper we present and evaluate a tool designed to allow developers to further investigate the execution of parallel applications by enabling them to dynamically move back and forth in the execution timeline of a parallel application. Based on an unobtrusive message logging mechanism, deterministic replay is enforced, leading to a simpler and more efficient way to debug parallel software. © Springer-Verlag Berlin Heidelberg 2007.
    Original languageEnglish
    Title of host publicationLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)|Lect. Notes Comput. Sci.
    PublisherSpringer Nature
    Pages297-306
    Number of pages9
    Volume4757
    ISBN (Print)9783540754152
    DOIs
    Publication statusPublished - 2007
    Event14th European PVM/MPI Users' Group Meeting on Parallel Virtual Machine and Message Passing Interface - Paris
    Duration: 1 Jul 2007 → …
    http://dblp.uni-trier.de/db/conf/pvm/pvm2007.html#BouteillerBD07http://dblp.uni-trier.de/rec/bibtex/conf/pvm/BouteillerBD07.xmlhttp://dblp.uni-trier.de/rec/bibtex/conf/pvm/BouteillerBD07

    Publication series

    NameLecture Notes in Computer Science

    Conference

    Conference14th European PVM/MPI Users' Group Meeting on Parallel Virtual Machine and Message Passing Interface
    CityParis
    Period1/07/07 → …
    Internet address

    Fingerprint

    Dive into the research topics of 'Retrospect: Deterministic replay of MPI applications for interactive distributed debugging'. Together they form a unique fingerprint.

    Cite this