A Latency-Optimized and Energy-Efficient Write Scheme in NVM-Based Main Memory

被引:7
作者
Guo, Yuncheng [1 ]
Hua, Yu [1 ]
Zuo, Pengfei [1 ]
机构
[1] Huazhong Univ Sci & Technol, Sch Comp Sci & Technol, Wuhan Natl Lab Optoelect, Wuhan 430074, Hubei, Peoples R China
基金
中国国家自然科学基金;
关键词
Nonvolatile memory; Phase change materials; Random access memory; Energy consumption; Resistance; Memory management; Scalability; Compression; encoder; frequent pattern; nonvolatile memory (NVM); PHASE-CHANGE MEMORY; RANDOM-ACCESS MEMORY; ARCHITECTURE; COMPRESSION;
D O I
10.1109/TCAD.2018.2883981
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Nonvolatile memory (NVM) technologies are promising candidates as the next-generation main memory due to high scalability and low energy consumption. However, the performance bottlenecks, such as high write latency and low cell endurance, still exist in NVMs. To address these problems, frequent pattern compression schemes have been widely used, which however suffer from the lack of flexibility and adaptability. In order to overcome these shortcomings, we propose a well-adaptive NVM write scheme, called dynamic frequent pattern compression (DFPC), to significantly reduce the amount of write units and extend the lifetime. Instead of only using static frequent patterns in existing FPC schemes, which are predefined and not always efficient for all applications, the idea behind DFPC is to exploit the characteristics of data distribution in execution to obtain dynamic patterns, which often appear in the real-world applications. To further improve the compression ratio, we exploit the value locality in a cache line to extend the granularity of dynamic patterns. Hence, DFPC can encode the contents of cache lines with more kinds of frequent data patterns. Moreover, to further support efficient write and read operations in the context of multilevel/triple-level cell NVMs, we need to extend the DFPC to improve performance in terms of the access latency and energy consumption. We hence propose a latency-optimized and energy-efficient compression write scheme to encode the compressed data with low energy and latency states, i.e., enhanced DFPC, thus reducing the latency and energy consumption. We implement DFPC in GEM5 with NVMain and execute the applications from SPEC CPU2006 to evaluate our scheme. Experimental results demonstrate the efficacy and efficiency of DFPC. We have released the source codes for public use at Github https://github.com/dfpcscheme/DFPCScheme.
引用
收藏
页码:62 / 74
页数:13
相关论文
共 52 条
[1]  
Alameldeen Alaa R, 2004, 1500 U WISC MAD DEP
[2]   HyComp: A Hybrid Cache Compression Method for Selection of Data-Type-Specific Compression Methods [J].
Arelakis, Angelos ;
Dahlgren, Fredrik ;
Stenstrom, Per .
PROCEEDINGS OF THE 48TH ANNUAL IEEE/ACM INTERNATIONAL SYMPOSIUM ON MICROARCHITECTURE (MICRO-48), 2015, :38-49
[3]   Silent Shredder: Zero-Cost Shredding for Secure Non-Volatile Main Memory Controllers [J].
Awad, Amro ;
Manadhata, Pratyusa ;
Haber, Stuart ;
Solihin, Yan ;
Horne, William .
ACM SIGPLAN NOTICES, 2016, 51 (04) :263-276
[4]   A Bipolar-Selected Phase Change Memory Featuring Multi-Level Cell Storage [J].
Bedeschi, Ferdinando ;
Fackenthal, Rich ;
Resta, Claudio ;
Donze, Enzo Michele ;
Jagasivamani, Meenatchi ;
Buda, Egidio Cassiodoro ;
Pellizzer, Fabio ;
Chow, David W. ;
Cabrini, Alessandro ;
Calvi, Giacomo Matteo Angelo ;
Faravelli, Roberto ;
Fantini, Andrea ;
Torelli, Guido ;
Mills, Duane ;
Gastaldi, Roberto ;
Casagrande, Giulio .
IEEE JOURNAL OF SOLID-STATE CIRCUITS, 2009, 44 (01) :217-227
[5]  
Binkert Nathan, 2011, Computer Architecture News, V39, P1, DOI 10.1145/2024716.2024718
[6]  
Dgien DB, 2014, IEEE INT SYMP NANO, P51, DOI 10.1109/NANOARCH.2014.6880482
[7]   Delta-Compressed Caching for Overcoming the Write Bandwidth Limitation of Hybrid Main Memory [J].
Du, Yu ;
Zhou, Miao ;
Childers, Bruce ;
Melhem, Rami ;
Mosse, Daniel .
ACM TRANSACTIONS ON ARCHITECTURE AND CODE OPTIMIZATION, 2013, 9 (04)
[8]   A robust main-memory compression scheme [J].
Ekman, M ;
Stenstrom, P .
32ND INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE, PROCEEDINGS, 2005, :74-85
[9]  
Guo YC, 2018, DES AUT TEST EUROPE, P1622, DOI 10.23919/DATE.2018.8342274
[10]  
Hay A, 2011, INT SYMP MICROARCH, P186