ShuntFlowPlus: An Efficient and Scalable Dataflow Accelerator Architecture for Stream Applications

被引:0
|
作者
Gong, Shijun [1 ,2 ]
Li, Jiajun [1 ,2 ]
Lu, Wenyan [1 ,2 ]
Yan, Guihai [1 ,2 ]
Li, Xiaowei [1 ,2 ]
机构
[1] Chinese Acad Sci, Inst Comp Technol, State Key Lab Comp Architecture, Zhongguancun 6 Kexueyuan South Rd, Beijing 100190, Peoples R China
[2] Univ Chinese Acad Sci, Zhongguancun 6 Kexueyuan South Rd, Beijing 100190, Peoples R China
基金
中国国家自然科学基金;
关键词
Streaming processing; sliding-window aggregations; dataflow; buffer sharing;
D O I
10.1145/3453164
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Streaming processing is an important and growing class of applications for analyzing continuous streams in real time. In such applications, sliding-window aggregation (SWAG) is a widely used approach, and generalpurpose processors cannot efficiently handle SWAG because of the specific computation patterns. This article proposes an efficient dataflow accelerator architecture for ubiquitous SWAGs, called ShuntFlowPlus. ShuntFlowPlus supports two main categories of SWAGs that are widely used in streaming processing. Meanwhile, we propose a shunt rule to enable ShuntFlowPlus to efficiently handle SWAGs with arbitrary parameters. Furthermore, we propose a novel realization scheme of SWAG kernels based on buffer sharing to maximize buffer utilization. As a case study, we implemented ShuntFlowPlus on an Altera Arria 10 AX115N FPGA board at 150 MHz and compared it to previous approaches. The experimental results show that ShuntFlowPlus provides a tremendous throughput and latency advantage over CPU and GPU implementations on both reducelike and index-like SWAGs. Compare to ShuntFlow, 41% of buffer resources are saved.
引用
收藏
页数:24
相关论文
共 50 条
  • [1] ShuntFlow: An Efficient and Scalable Dataflow Accelerator Architecture for Streaming Applications
    Gong, Shijun
    Li, Jiajun
    Lu, Wenyan
    Yan, Guihai
    Li, Xiaowei
    PROCEEDINGS OF THE 2019 56TH ACM/EDAC/IEEE DESIGN AUTOMATION CONFERENCE (DAC), 2019,
  • [2] An efficient dataflow accelerator for scientific applications
    Ye, Xiaochun
    Tan, Xu
    Wu, Meng
    Feng, Yujing
    Wang, Da
    Zhang, Hao
    Pei, Songwen
    Fan, Dongrui
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2020, 112 (112): : 580 - 588
  • [3] Scalable and Parameterized Architecture for Efficient Stream Mining
    Zhang, Li
    Li, Dawei
    Zou, Xuecheng
    Hu, Yu
    Xu, Xiaowei
    IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2018, E101A (01): : 219 - 231
  • [4] An Energy-Efficient Accelerator Architecture with Serial Accumulation Dataflow for Deep CNNs
    Ahmadi, Mehdi
    Vakili, Shervin
    Langlois, J. M. Pierre
    2020 18TH IEEE INTERNATIONAL NEW CIRCUITS AND SYSTEMS CONFERENCE (NEWCAS'20), 2020, : 214 - 217
  • [5] A General-Purpose Many-Accelerator Architecture Based on Dataflow Graph Clustering of Applications
    陈鹏
    张磊
    韩银和
    陈云霁
    Journal of Computer Science & Technology, 2014, 29 (02) : 239 - 246
  • [6] A General-Purpose Many-Accelerator Architecture Based on Dataflow Graph Clustering of Applications
    Chen, Peng
    Zhang, Lei
    Han, Yin-He
    Chen, Yun-Ji
    JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2014, 29 (02) : 239 - 246
  • [7] A General-Purpose Many-Accelerator Architecture Based on Dataflow Graph Clustering of Applications
    Peng Chen
    Lei Zhang
    Yin-He Han
    Yun-Ji Chen
    Journal of Computer Science and Technology, 2014, 29 : 239 - 246
  • [8] A Scalable Pipelined Dataflow Accelerator for Object Region Proposals on FPGA Platform
    Fu, Wenzhi
    Yang, Jianlei
    Dai, Pengcheng
    Chen, Yiran
    Zhao, Weisheng
    2018 INTERNATIONAL CONFERENCE ON FIELD-PROGRAMMABLE TECHNOLOGY (FPT 2018), 2018, : 349 - 352
  • [9] A Scalable Dataflow Accelerator for Real Time Onboard Hyperspectral Image Classification
    Wang, Shaojun
    Niu, Xinyu
    Ma, Ning
    Luk, Wayne
    Leong, Philip
    Peng, Yu
    APPLIED RECONFIGURABLE COMPUTING, ARC 2016, 2016, : 105 - 116
  • [10] Energy and Bandwidth Efficient Sparse Programmable Dataflow Accelerator
    Schneider, Felix
    Karagounis, Michael
    Choubey, Bhaskar
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2024, 71 (09) : 4092 - 4105