Files in this item



application/pdfscalDistFS.pdf (548kB)
An efficient and scalable distributed file system for (cloud computing).PDF


Title:A Scalable Distributed File System for Cloud Computing
Author(s):Fesehaye, Debessay; Malik, Rahul; Nahrstedt, Klara
Subject(s):Cloud architecture, resource allocation, distributed name-node
Abstract:Large scale distributed systems such as cloud computing applications are becoming very common. These applications come with increasing challenges on how to transfer and where to store and compute data. The most prevalent distributed file systems to deal with these challenges is the Hadoop File System (HDFS) which is a variant of the Google File System (GFS). However HDFS has two potential problems. The first one is that it depends on a single name node to manage almost all operations of every data block in the file system. As a result it can be a bottleneck resource and a single point of failure. The second potential problem with HDFS is that it depends on TCP to transfer data. As has been cited in many studies TCP takes many rounds before it can send at the full capacity of the links in the cloud. This results in low link utilization and longer download times. To overcome these problems of HDFS we present a new distributed file system. Our scheme uses a light weight front end server to connect all requests with many name nodes. This helps distribute load of a single name node to many name nodes. Our second contribution is to use an efficient protocol to send and route data. Our protocol can achieve full link utilization and hence decreased download times. Based on simulation our protocol can outperform HDFS and hence GFS.
Issue Date:2010-03-16
Citation Info:ScienceCloud2010
Genre:Technical Report
Conference Proceeding (whole)
Publication Status:published or submitted for publication
Peer Reviewed:not peer reviewed
Date Available in IDEALS:2010-03-16

This item appears in the following Collection(s)

Item Statistics