Protecting QDI interconnects from transient faults using delay-insensitive redundant check codes

Guangda Zhang, Wei Song, Jim Garside, Javier Navaridas Palma, Zhiying Wang

    Research output: Chapter in Book/Conference proceedingConference contributionpeer-review

    Abstract

    Asynchronous circuit design is a promising technology for large-scale multi-core systems. As a family of asynchronous circuits, Quasi-delay-insensitive (QDI) circuits have been widely used to build chip-level long interconnects due to their tolerance to delay variations. However, \{QDI\} interconnects are vulnerable to faults. Traditional fault-tolerant techniques for synchronous circuits cannot be easily used to protect \{QDI\} interconnects. This paper focuses on protecting \{QDI\} interconnects from transient faults. The first contribution of this paper is a fault-tolerant delay-insensitive redundant check code named DIRC. Using \{DIRC\} in 4-phase 1-of-n \{QDI\} pipelines, all 1-bit and some multi-bit transient faults are tolerated. The \{DIRC\} and basic pipeline stages are mutually exchangeable. Arbitrary basic stages can be replaced by \{DIRC\} ones to strengthen fault-tolerance. This feature permits designers to use \{DIRC\} flexibly according to the practical design requirement. The second contribution is a redundant technique protecting the acknowledge wires (RPA). Experimental results indicate that \{DIRC\} pipelines have moderate area and speed overheads. Compared with unprotected basic pipelines, the average speed decrease of \{DIRC\} pipelines is less than 50%, with the 128-bit 1-of-2 \{DIRC\} pipeline only 28% slower. In severe environments with multi-bit transient faults, the fault-tolerance capability of \{DIRC\} pipelines increases thousands-fold.
    Original languageEnglish
    Title of host publicationMicroprocessors and Microsystems
    PublisherElsevier BV
    Pages826-842
    Number of pages17
    Volume38
    Edition8 : Pt.A
    ISBN (Print)0141-9331
    DOIs
    Publication statusPublished - 2014
    Event2013 edition of the Euromicro Conference on Digital System Design (DSD 2013) - Microprocessors and Microsystems
    Duration: 1 Jan 1824 → …

    Conference

    Conference2013 edition of the Euromicro Conference on Digital System Design (DSD 2013)
    CityMicroprocessors and Microsystems
    Period1/01/24 → …

    Keywords

    • Asynchronous circuits; Quasi-delay-insensitive (QDI) interconnects; Fault tolerance; Transient fault; 1-of-n; Error-correcting code

    Fingerprint

    Dive into the research topics of 'Protecting QDI interconnects from transient faults using delay-insensitive redundant check codes'. Together they form a unique fingerprint.

    Cite this