ParaBit: Processing Parallel Bitwise Operations in NAND Flash Memory based SSDs

被引:13
作者
Gao, Congming [1 ]
Xin, Xin [2 ]
Lu, Youyou [1 ]
Zhang, Youtao [3 ]
Yang, Jun [2 ]
Shu, Jiwu [1 ]
机构
[1] Tsinghua Univ, Dept Comp Sci & Technol, BNRist, Beijing, Peoples R China
[2] Univ Pittsburgh, Dept Elect & Comp Engn, Pittsburgh, PA 15260 USA
[3] Univ Pittsburgh, Dept Comp Sci, Pittsburgh, PA 15260 USA
来源
PROCEEDINGS OF 54TH ANNUAL IEEE/ACM INTERNATIONAL SYMPOSIUM ON MICROARCHITECTURE, MICRO 2021 | 2021年
基金
中国国家自然科学基金; 美国国家科学基金会; 中国博士后科学基金;
关键词
flash memory; in-storage computing; near data processing; bitwise operation;
D O I
10.1145/3466752.3480078
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Processing-in-memory (PIM) and in-storage-computing (ISC) architectures have been constructed to implement computation inside memory and near storage, respectively. While effectively mitigating the overhead of data movement from memory and storage to the processor, due to the limited bandwidth of existing systems, these architectures still suffer from the large data movement overhead between storage and memory, in particular, if the amount of required data is large. It has become a major constraint for further improving the computation efficiency in PIM and ISC architectures. In this paper, we propose ParaBit, a scheme that enables Parallel Bitwise operations in NAND flash storage where data reside. By adjusting the latching circuit control and the sequence of sensing operations, ParaBit enables in-flash bitwise operation with no or little extra hardware, which effectively reduces the overhead of data movement between storage and memory. We exploit the massive parallelism in NAND flash based SSDs to mitigate the long latency of flash operations. Our experimental results show that the proposed ParaBit design achieves significant performance improvements over the state-of-the-art PIM and ISC architectures.
引用
收藏
页码:59 / 70
页数:12
相关论文
共 60 条
[1]  
Agrawal Nitin, 2008, ATC USENIX
[2]   A Scalable Processing-in-Memory Accelerator for Parallel Graph Processing [J].
Ahn, Junwhan ;
Hong, Sungpack ;
Yoo, Sungjoo ;
Mutlu, Onur ;
Choi, Kiyoung .
2015 ACM/IEEE 42ND ANNUAL INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE (ISCA), 2015, :105-117
[3]   Data Reorganization in Memory Using 3D-stacked DRAM [J].
Akin, Berkin ;
Franchetti, Franz ;
Hoe, James C. .
2015 ACM/IEEE 42ND ANNUAL INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE (ISCA), 2015, :131-143
[4]  
[Anonymous], 2004, CF
[5]  
[Anonymous], 2011, ADMS
[6]  
Asghari-Moghaddam H, 2016, INT SYMP MICROARCH
[7]   'Memristive' switches enable 'stateful' logic operations via material implication [J].
Borghetti, Julien ;
Snider, Gregory S. ;
Kuekes, Philip J. ;
Yang, J. Joshua ;
Stewart, Duncan R. ;
Williams, R. Stanley .
NATURE, 2010, 464 (7290) :873-876
[8]  
Bruce J, 2000, 2000 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS 2000), VOLS 1-3, PROCEEDINGS, P2061, DOI 10.1109/IROS.2000.895274
[9]   Error Characterization, Mitigation, and Recovery in Flash-Memory-Based Solid-State Drives [J].
Cai, Yu ;
Ghose, Saugata ;
Haratsch, Erich F. ;
Luo, Yixin ;
Mutlu, Onur .
PROCEEDINGS OF THE IEEE, 2017, 105 (09) :1666-1704
[10]  
Chee-Yong Chan, 1998, SIGMOD Record, V27, P355, DOI 10.1145/276305.276336