Efficient Implementation of 2-D Convolution on DRRA and DiMArch Architectures

被引:2
作者
Pudi, Dhilleswararao [1 ]
Ryansh, Rajeev [1 ]
Boppu, Srinivas [1 ]
Yang, Yu [2 ]
Hemani, Ahmed [2 ]
机构
[1] Indian Inst Technol, Bhubaneswar, Odisha, India
[2] Royal Inst Technol KTH, Kista, Sweden
来源
THE PROCEEDINGS OF THE 13TH INTERNATIONAL SYMPOSIUM ON HIGHLY EFFICIENT ACCELERATORS AND RECONFIGURABLE TECHNOLOGIES, HEART 2023 | 2023年
关键词
Convolution; Coarse Grain Reconfigurable Architectures; Dynamically Reconfigurable Resource Array; Distributed Memory Architecture; Machine Learning; Convolutional Neural Networks; FPGA;
D O I
10.1145/3597031.3597049
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Convolution has been widely employed in image processing and computer vision applications such as picture augmentation, smoothing, and structure extraction. In addition, convolution operations are the most prevalent computing patterns in machine learning domains. Convolutions, for example, are used in a substantial chunk of state-of-the-art convolutional neural network operations. Therefore, effectively mapping convolution operations onto hardware architectures is crucial for achieving superior performance while accelerating convolutional neural networks. In this paper, we proposed various algorithms to efficiently map the 2-D convolution operation onto a dynamically reconfigurable resource array and distributed memory architecture. Furthermore, we have discussed the mapping of 2-D convolution on the target architecture for an input matrix of arbitrary size, as well as the generalization of the proposed approaches for multi-column DRRA architectures.
引用
收藏
页码:86 / 92
页数:7
相关论文
共 24 条
[1]   Reconfigurable pipelined 2-D convolvers for fast digital signal processing [J].
Bosi, B ;
Bois, G ;
Savaria, Y .
IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 1999, 7 (03) :299-308
[2]  
Chen K, 2016, 2016 IEEE INTERNATIONAL SYMPOSIUM ON DEFECT AND FAULT TOLERANCE IN VLSI AND NANOTECHNOLOGY SYSTEMS (DFT), P31, DOI 10.1109/DFT.2016.7684065
[3]   A new approach for design of an efficient FPGA-based reconfigurable convolver for image processing [J].
Dehghani, Abbas ;
Kavari, Ali ;
Kalbasi, Mahdi ;
RahimiZadeh, Keyvan .
JOURNAL OF SUPERCOMPUTING, 2022, 78 (02) :2597-2615
[4]  
Jafri Syed Mohammad Asad Hassan, 2017, 2017 IEEE Computer Society Annual Symposium on VLSI (ISVLSI). Proceedings, P25, DOI 10.1109/ISVLSI.2017.15
[5]   A Fine-Grained Pipelined 2-D Convolver for High-Performance Applications [J].
Kalbasi, Mahdi ;
Nikmehr, Hooman .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2019, 66 (01) :146-150
[6]   ImageNet Classification with Deep Convolutional Neural Networks [J].
Krizhevsky, Alex ;
Sutskever, Ilya ;
Hinton, Geoffrey E. .
COMMUNICATIONS OF THE ACM, 2017, 60 (06) :84-90
[7]   Mapping Parallel FFT Algorithm onto SmartCell Coarse-Grained Reconfigurable Architecture [J].
Liang, Cao ;
Huang, Xinming .
IEICE TRANSACTIONS ON ELECTRONICS, 2010, E93C (03) :407-415
[8]  
Licciardo GD, 2016, COMPUT SCI ELECTR, P180, DOI 10.1109/CEEC.2016.7835910
[9]  
Malik O., 2011, Proceedings of the 24th International Conference on VLSI Design: concurrently with the 10th International Conference on Embedded Systems Design, P153, DOI 10.1109/VLSID.2011.54
[10]  
Nelson Anthony Edward, 2000, Implementation of image processing algorithms on FPGA hardware