Files in this item



application/pdf3314741.pdf (3MB)Restricted to U of Illinois
(no description provided)PDF


Title:A Fault Tolerance Protocol for Fast Recovery
Author(s):Chakravorty, Sayantan
Doctoral Committee Chair(s):Kale, Laxmikant V.
Department / Program:Computer Science
Discipline:Computer Science
Degree Granting Institution:University of Illinois at Urbana-Champaign
Subject(s):Computer Science
Abstract:This thesis presents research aimed at developing a fault tolerant protocol that is relevant in the context of parallel computing and provides fast restarts. We propose to combine the ideas of message logging and object based virtualization. We leverage the facts that message logging based protocols do not require all processors to rollback when one processor crashes and that object based virtualization allows work to be moved from one processor to another. We develop a message logging protocol that operates in conjunction with object based virtualization. We evaluate and study the implementation of our protocol in the Charm++/AMPI run-time. We use benchmarks and real world applications to investigate and improve the performance of different aspects of our protocol. We also modify the load balancing framework of the Charm++ run-time to work with the message logging protocol. We show that in the presence of faults, an application using our fault tolerance protocol takes less time to complete than a traditional checkpoint based protocol.
Issue Date:2008
Description:207 p.
Thesis (Ph.D.)--University of Illinois at Urbana-Champaign, 2008.
Other Identifier(s):(MiAaPQ)AAI3314741
Date Available in IDEALS:2015-09-25
Date Deposited:2008

This item appears in the following Collection(s)

Item Statistics