Files in this item



application/pdfrmk.pdf (361kB)
body of reportPDF


application/pdfCRHC-06-12 REPORT DOCUMENTATION PAGE.pdf (22kB)
report documentation page (cover is missing)PDF


Title:An Operating-System-Level Framework for Providing Application-Aware Reliability
Author(s):Wang, Long; Kalbarczyk, Zbigniew; Gu, Weining; Iyer, Ravi
Subject(s):Operating system
Application aware
Application specific
Hang detection
Application Transparent
Abstract:Operating systems enable collecting and extracting rich information on application execution characteristics, including program counter traces, memory access patterns, and operating-system-generated signals. This information can be exploited to design highly efficient, application-aware reliability mechanisms that are transparent to applications. This paper describes the Reliability MicroKernel framework (RMK), a loadable kernel module for providing application-aware reliability and dynamically configuring reliability mechanisms installed in RMK. The RMK prototype is implemented in Linux and supports detection of application/OS failures and transparent application checkpointing. Experiment results show that the OS hang detection and application hang detection, which exploit characteristics of application and system behavior, can achieve 100% coverage and low false positive rates. Moreover, the performance overhead of RMK and the detection/checkpointing mechanisms is small (0.6% for application hang detection and 0.1% for transparent application checkpointing in the experiments).
Issue Date:2006-09
Publisher:Coordinated Science Laboratory, University of Illinois at Urbana-Champaign
Series/Report:Coordinated Science Laboratory Report no. UILU-ENG-06-2218, CRHC-06-12
Genre:Technical Report
Sponsor:National Science Foundation / CNS-04-06351, CNS-05-24695, and ACI-0121658 ITR/AP
Gigascale Systems Research Center/MARCO
Date Available in IDEALS:2018-03-30

This item appears in the following Collection(s)

Item Statistics