The Data De-Duplication In Cloud Storage Management
Main Article Content
Abstract
With the explosive growth in data volume, the I/O bottleneck has become an increasingly daunting
challenge for big data analytics in the Cloud. Recent studies have shown that moderate to high data redundancy
clearly exists in primary storage systems in the Cloud. Moreover, directly applying data de-duplication to primary
storage systems in the Cloud will likely cause space contention in memory and data fragmentation on disks. Based
on these observations, this project propose a performance-oriented I/O de-duplication, called POD, rather than a
capacity oriented I/O de-duplication, exemplified by I Dedup, To improve the I/O performance of primary storage
systems in the Cloud without sacrificing capacity savings of the latter,
Article Details
All articles published in NVEO are licensed under Copyright Creative Commons Attribution-NonCommercial 4.0 International License.