Distributed Storage Evaluation on a Three-Wide Inter-Data Center Deployment Conference Paper uri icon

abstract

  • The demand for cloud storage is exploding as an ever increasing number of enterprises and consumers are storing and processing their data in the cloud. Hence, distributed object storage solutions (e.g., QFS, Swift, HDFS) are becoming very critical components of any cloud infrastructure. These systems are able to offer good reliability by distributing redundant information across a large number of commodity servers, making it possible to achieve 10 nines and beyond with relative ease. One drawback of these systems is that they are usually designed for deployment within a single data center, where node-to-node latencies are small. Geo-replication (i.e., distributing redundant information across data centers) for most open-source storage systems is, to the best of our knowledge, accomplished by asynchronously mirroring a given deployment. Given that geo-replication is critical for ensuring very high degrees of reliability (e.g., for achieving 16 nines), in this work we evaluate how these storage systems perform when they are directly deployed in a WAN setting. To this end, three popular distributed object stores, namely Quantcast-QFS, Swift and Tahoe-LAFS, are considered and tested in a three-wide data center environment and our findings are reported. 2013 IEEE.

name of conference

  • 2013 IEEE International Conference on Big Data

published proceedings

  • 2013 IEEE INTERNATIONAL CONFERENCE ON BIG DATA

author list (cited authors)

  • Chen, Y., Daniels, S., Hadjieleftheriou, M., Liu, P., Tian, C., & Vaishampayan, V.

citation count

  • 5

complete list of authors

  • Chen, Yih-Farn||Daniels, Scott||Hadjieleftheriou, Marios||Liu, Pingkai||Tian, Chao||Vaishampayan, Vinay

publication date

  • October 2013