FROZENHOT Cache: Rethinking Cache Management for Modern Hardware

被引:9
作者
Qiu, Ziyue [1 ,2 ,4 ]
Yang, Juncheng
Zhang, Juncheng [1 ]
Li, Cheng [1 ,3 ]
Ma, Xiaosong [5 ]
Chen, Qi [2 ]
Yang, Mao [2 ]
Xu, Yinlong [1 ,3 ]
机构
[1] Univ Sci & Technol China, Hefei, Peoples R China
[2] Microsoft Res, Hefei, Peoples R China
[3] Anhui Prov Key Lab High Performance Comp, Hefei, Peoples R China
[4] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA
[5] HBKU, Qatar Comp Res Inst, Doha, Qatar
来源
PROCEEDINGS OF THE EIGHTEENTH EUROPEAN CONFERENCE ON COMPUTER SYSTEMS, EUROSYS 2023 | 2023年
基金
中国国家自然科学基金;
关键词
Caching; Storage; Concurrency Algorithm; Performance Scalability;
D O I
10.1145/3552326.3587446
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Caching is crucial for accelerating data access, employed as a ubiquitous design in modern systems at many parts of computer systems. With increasing core count, and shrinking latency gap between cache and modern storage devices, hit-path scalability becomes increasingly critical. However, existing production in-memory caches often use list-based management with promotion on each cache hit, which requires extensive locking and poses a significant overhead for scaling beyond a fewcores. Moreover, existing techniques for improving scalability either (1) only focus on the indexing structure and do not improve cache management scalability, or (2) sacrifice efficiency or miss-path scalability. Inspired by highly skewed data popularity and short-term hotspot stability in cache workloads, we propose Frozen-Hot, a generic approach to improve the scalability of list-based caches. FROZENHOT partitions the cache space into two parts: a frozen cache and a dynamic cache. The frozen cache serves requests for hot objects with minimal latency by eliminating promotion and locking, while the latter leverages the existing cache design to achieve workload adaptivity. We built FROZENHOT as a library that can be easily integrated into existing systems. We demonstrate its performance by enabling FROZENHOT in two production systems: HHVM and RocksDB using under 100 lines of code. Evaluated using production traces from MSR and Twitter, FROZENHOT improves the throughput of three baseline cache algorithms by up to 551%. Compared to stock RocksDB, FROZENHOT-enhanced RocksDB shows a higher throughput on all YCSB workloads with up to 90% increase, as well as reduced tail latency.
引用
收藏
页码:557 / 573
页数:17
相关论文
共 81 条
[1]  
[Anonymous], 2022, AWS Introduces Storage-Optimized I4i Instances for IO-Heavy Workloads. I4i/
[2]  
[Anonymous], OBJECT ORIENTED PROG
[3]   Concurrent Updates with RCU: Search Tree as an Example [J].
Arbel, Maya ;
Attiya, Hagit .
PROCEEDINGS OF THE 2014 ACM SYMPOSIUM ON PRINCIPLES OF DISTRIBUTED COMPUTING (PODC'14), 2014, :196-205
[4]  
Atikoglu Berk, 2012, Performance Evaluation Review, V40, P53, DOI 10.1145/2318857.2254766
[5]  
Badam Anirudh, 2009, NSDI, V9, P123
[6]  
bcache, About us
[7]  
Beckmann N, 2018, PROCEEDINGS OF THE 15TH USENIX SYMPOSIUM ON NETWORKED SYSTEMS DESIGN AND IMPLEMENTATION (NSDI'18), P389
[8]  
Berezecki Mateusz, 2011, 2011 INT GREEN COMP, P1
[9]  
Berg B, 2020, PROCEEDINGS OF THE 14TH USENIX SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION (OSDI '20), P769
[10]  
Bronson Nathan, 2013, Proceedings of USENIX ATC '13: 2013 USENIX Annual Technical Conference. ATC '13, P49