Files in this item



application/pdfWasef TR.pdf (501kB)
(no description provided)PDF


Title:Leveraging Metadata in NoSQL Storage Systems
Author(s):Alkhaldi, Ala'; Gupta, Indranil; Raghavan, Vaijayanth; Ghosh, Mainak
Subject(s):Metadata, NoSQL, Data Provenance
Abstract:NoSQL systems have grown in popularity for storing big data because these systems offer high availability, i.e., operations with high throughput and low latency. However, metadata in these systems are handled today in ad-hoc ways. We present Wasef, a system that treats metadata in a NoSQL database system, as first-class citizens. Metadata may include information such as: operational history for portions of a database table (e.g., columns), placement information for ranges of keys, and operational logs for data items (keyvalue pairs). Wasef allows the NoSQL system to store and query this metadata efficiently.We integrateWasef into Apache Cassandra, one of the most popular key-value stores. We then implement three important uses cases in Cassandra: dropping columns in a flexible manner, verifying data durability during migrational operations such as node decommissioning, and maintaining data provenance. Our experimental evaluation uses AWS EC2 instances and YCSB workloads. Our results show that Wasef: i) scales well with the size of the data and the metadata; ii) affects throughput minimally by only 9%, and iii) affects operational latencies by only 3%.
Issue Date:2015
Genre:Technical Report
Date Available in IDEALS:2015-02-20

This item appears in the following Collection(s)

Item Statistics