Files in this item

FilesDescriptionFormat

application/pdf

application/pdfFaraz_Faghri.pdf (589kB)
(no description provided)PDF

Description

Title:Performance guarantees for deadline-driven MapReduce jobs under failure
Author(s):Faghri, Faraz
Advisor(s):Beck, Carolyn L.
Department / Program:Industrial & Enterprise Systems Engineering
Discipline:Industrial Engineering
Degree Granting Institution:University of Illinois at Urbana-Champaign
Degree:M.S.
Genre:Thesis
Subject(s):Performance of systems
Service Level Objectives
Fault tolerance
Cloud computing
Hadoop
MapReduce
Abstract:Increasingly, large systems and data centers are being built in a 'scale out' manner, i.e. using large numbers of commodity hardware components, instead of traditional 'scale up' using expensive, specialized equipment. However, large numbers of commodity components imply higher rates of failure across such systems. Such failures can cause applications to miss their deadlines for task completion. For this reason, cloud service providers and cloud applications must anticipate failures and engineer their services accordingly. In this thesis, we first analyze the availability of a commodity data center designed for MapReduce applications. MapReduce is increasingly used in industry for efficient large scale data processing tasks including personal advertising, spam detection, as well as data mining. We show how MapReduce software level fault tolerance can be used to achieve the same availability as scale up data centers. Second, we extend existing job schedulers for deadline-driven jobs to handle machine and software failures and satisfy the service level objectives.
Issue Date:2013-08-22
URI:http://hdl.handle.net/2142/45663
Rights Information:Copyright 2013 Faraz Faghri
Date Available in IDEALS:2013-08-22
2015-08-22
Date Deposited:2013-08


This item appears in the following Collection(s)

Item Statistics