KVRangeDB: RangeQueries for a Hash-based Key-Value Device Academic Article uri icon

abstract

  • Keyvalue (KV) software has proven useful to a wide variety of applications including analytics, time-series databases, and distributed file systems. To satisfy the requirements of diverse workloads, KV stores have been carefully tailored to best match the performance characteristics of underlying solid-state block devices. Emerging KV storage device is a promising technology for both simplifying the KV software stack and improving the performance of persistent storage-based applications. However, while providing fast, predictable put and get operations, existing KV storage devices do not natively support range queries that are critical to all three types of applications described above. In this article, we present KVRangeDB, a software layer that enables processing range queries for existing hash-based KV solid-state disks (KVSSDs). As an effort to adapt to the performance characteristics of emerging KVSSDs, KVRangeDB implements log-structured merge tree key index that reduces compaction I/O, merges keys when possible, and provides separate caches for indexes and values. We evaluated the KVRangeDB under a set of representative workloads, and compared its performance with two existing database solutions: a Rocksdb variant ported to work with the KVSSD, and Wisckey, a keyvalue database that is carefully tuned for conventional block devices. On filesystem aging workloads, KVRangeDB outperforms Wisckey by 23.7 in terms of throughput and reduce CPU usage and external write amplifications by 14.3 and 9.8, respectively.

published proceedings

  • ACM TRANSACTIONS ON STORAGE

author list (cited authors)

  • Qin, M., Zheng, Q., Lee, J., Settlemyer, B., Wen, F., Reddy, N., & Gratz, P.

citation count

  • 0

complete list of authors

  • Qin, Mian||Zheng, Qing||Lee, Jason||Settlemyer, Bradley||Wen, Fei||Reddy, Narasimha||Gratz, Paul

publication date

  • August 2023