IDEALS Home University of Illinois at Urbana-Champaign logo The Alma Mater The Main Quad

Understanding the Propagation of Hard Errors to Software and its Implications for Resilient System Design

Show full item record

Bookmark or cite this item: http://hdl.handle.net/2142/11298

Files in this item

File Description Format
PDF Understanding t ... esilient System Design.pdf (271KB) (no description provided) PDF
Title: Understanding the Propagation of Hard Errors to Software and its Implications for Resilient System Design
Author(s): Li, Man-Lap; Ramachandran, Pradeep; Adve, Sarita V.; Adve, Vikram S.; Zhou, YuanYuan
Subject(s): computer science
Abstract: With continued CMOS scaling, future shipped hardware will be increasingly vulnerable to in-the-field faults. To be broadly deployable, the hardware reliability solution must incur low overheads, precluding use of expensive redundancy. We explore a co-designed hardware-software solution that treats most hardware faults as software bugs and leverages common mechanisms for hardware and software reliability, thereby amortizing some of the overhead. Fundamental to such a solution is a characterization of how hardware faults in different microarchitectural structures of a modern processor propagate through the application and OS. This paper aims to provide such a characterization, identify low-cost detection methods to intercept fault propagation, and to provide guidelines for a complete co-designed reliability solution. We focus on hard faults because they are increasingly important and have different system implications than the much studied transients. We achieve our goals through fault injection experiments with a microarchitecture level full system timing simulator.
Issue Date: 2007-02
Genre: Technical Report
Type: Text
URI: http://hdl.handle.net/2142/11298
Other Identifier(s): UIUCDCS-R-2007-2822
Rights Information: You are granted permission for the non-commercial reproduction, distribution, display, and performance of this technical report in any format, BUT this permission is only for a period of 45 (forty-five) days from the most recent time that you verified that this technical report is still available from the University of Illinois at Urbana-Champaign Computer Science Department under terms that include this permission. All other rights are reserved by the author(s).
Date Available in IDEALS: 2009-04-21
 

This item appears in the following Collection(s)

Show full item record

Item Statistics

  • Total Downloads: 261
  • Downloads this Month: 13
  • Downloads Today: 2

Browse

My Account

Information

Access Key