GraphSAR: A Sparsity-Aware Processing-in-Memory Architecture for Large-scale Graph Processing on ReRAMs

被引:43
作者
Dai, Guohao [1 ]
Huang, Tianhao [2 ]
Wang, Yu [1 ]
Yang, Huazhong [1 ]
Wawrzynek, John [3 ]
机构
[1] Tsinghua Univ, Dept EE, BNRist, Beijing, Peoples R China
[2] MIT, Cambridge, MA 02139 USA
[3] Univ Calif Berkeley, Berkeley, CA 94720 USA
来源
24TH ASIA AND SOUTH PACIFIC DESIGN AUTOMATION CONFERENCE (ASP-DAC 2019) | 2019年
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
D O I
10.1145/3287624.3287637
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Large-scale graph processing has drawn great attention in recent years. The emerging metal-oxide resistive random access memory (ReRAM) and ReRAM crossbars have shown huge potential in accelerating graph processing. However, the sparse feature of natural graphs hinders the performance of graph processing on ReRANIs. Previous work of graph processing on ReRAMs stored and computed edges separately, leading to high energy consumption and long latency of transferring data. In this paper, we present GraphSAR, a sparsity-aware processing-in-memory large-scale graph processing accelerator on ReRAMs. Computations over edges are performed in the memory, eliminating overheads of transferring edges. Moreover, graphs are divided considering the sparsity. Subgraphs with low densities are further divided into smaller ones to minimize the waste of memory space. According to our extensive experimental results, GraphSAR achieves 4.43x energy reduction and 1.85x speedup (8.19x lower energy-delay product, EDP) against previous graph processing architecture on ReRAMs (GraphR [1]).
引用
收藏
页码:120 / 126
页数:7
相关论文
共 50 条
[31]   Optimizing Differential Computation for Large-Scale Graph Processing [J].
Sahu, Siddhartha ;
Salihoglu, Semih .
PROCEEDINGS OF THE 7TH ACM SIGMOD JOINT INTERNATIONAL WORKSHOP ON GRAPH DATA MANAGEMENT EXPERIENCES & SYSTEMS, GRADES 2024 AND NETWORK DATA ANALYTICS, NDA 2024, GRADES-NDA 2024, 2024,
[32]   Reconfigurable Processing-in-Memory Architecture for Data Intensive Applications [J].
Bavikadi, Sathwika ;
Sutradhar, Purab Ranjan ;
Ganguly, Amlan ;
Dinakarrao, Sai Manoj Pudukotai .
PROCEEDINGS OF THE 37TH INTERNATIONAL CONFERENCE ON VLSI DESIGN, VLSID 2024 AND 23RD INTERNATIONAL CONFERENCE ON EMBEDDED SYSTEMS, ES 2024, 2024, :222-227
[33]   Block rank sparsity-aware DOA estimation with large-scale arrays in the presence of unknown mutual coupling [J].
Meng, Dandan ;
Wang, Xianpeng ;
Huang, Mengxing ;
Cao, Chunjie ;
Zhang, Kun .
DIGITAL SIGNAL PROCESSING, 2019, 94 :96-104
[34]   Survey of external memory large-scale graph processing on a multi-core system [J].
Huang, Jianqiang ;
Qin, Wei ;
Wang, Xiaoying ;
Chen, Wenguang .
JOURNAL OF SUPERCOMPUTING, 2020, 76 (01) :549-579
[35]   Survey of external memory large-scale graph processing on a multi-core system [J].
Jianqiang Huang ;
Wei Qin ;
Xiaoying Wang ;
Wenguang Chen .
The Journal of Supercomputing, 2020, 76 :549-579
[36]   Exploiting Parallelism for Convolutional Connections in Processing-In-Memory Architecture [J].
Wang, Yi ;
Zhang, Mingxu ;
Yang, Jing .
PROCEEDINGS OF THE 2017 54TH ACM/EDAC/IEEE DESIGN AUTOMATION CONFERENCE (DAC), 2017,
[37]   Study on Processing-in-Memory Technology based on Dataflow Architecture [J].
Choi, Kyu Hyun ;
Hwang, Taeho .
2022 INTERNATIONAL CONFERENCE ON ELECTRONICS, INFORMATION, AND COMMUNICATION (ICEIC), 2022,
[38]   A Customized Processing-in-Memory Architecture for Biological Sequence Alignment [J].
Akbari, Nasrin ;
Modarressi, Mehdi ;
Daneshtalab, Masoud ;
Loni, Mohammad .
2018 IEEE 29TH INTERNATIONAL CONFERENCE ON APPLICATION-SPECIFIC SYSTEMS, ARCHITECTURES AND PROCESSORS (ASAP), 2018, :158-165
[39]   LargeGraph: An Efficient Dependency-Aware GPU-Accelerated Large-Scale Graph Processing [J].
Zhang, Yu ;
Peng, Da ;
Liao, Xiaofei ;
Jin, Hai ;
Liu, Haikun ;
Gu, Lin ;
He, Bingsheng .
ACM TRANSACTIONS ON ARCHITECTURE AND CODE OPTIMIZATION, 2021, 18 (04)
[40]   Implementation of a Low-Overhead Processing-in-Memory Architecture [J].
Jang, Young-Jong ;
Kim, Byung-Soo ;
Kim, Dong-Sun ;
Hwang, Tae-ho .
2016 INTERNATIONAL SOC DESIGN CONFERENCE (ISOCC), 2016, :185-186