A 50.4 GOPs/W FPGA-Based MobileNetV2 Accelerator using the Double-Layer MAC and DSP Efficiency Enhancement

被引:5
作者
Li, Jixuan [1 ]
Chen, Jiabao [1 ]
Un, Ka-Fai [1 ]
Yu, Wei-Han [1 ]
Mak, Pui-In [1 ]
Martins, Rui P. [1 ,2 ]
机构
[1] Univ Macau, Macau, Peoples R China
[2] Univ Lisbon, Inst Super Tecn, Lisbon, Portugal
来源
IEEE ASIAN SOLID-STATE CIRCUITS CONFERENCE (A-SSCC 2021) | 2021年
关键词
D O I
10.1109/A-SSCC53895.2021.9634838
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
引用
收藏
页数:3
相关论文
共 8 条
[1]   Optimizing Hardware Accelerated General Matrix-Matrix Multiplication for CNNs on FPGAs [J].
Ahmad, Afzal ;
Pasha, Muhammad Adeel .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2020, 67 (11) :2692-2696
[2]   A CNN Accelerator on FPGA Using Depthwise Separable Convolution [J].
Bai, Lin ;
Zhao, Yiming ;
Huang, Xinming .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2018, 65 (10) :1415-1419
[3]   Xception: Deep Learning with Depthwise Separable Convolutions [J].
Chollet, Francois .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :1800-1807
[4]   A High Throughput MobileNetV2 FPGA implementation based on a Flexible Architecture for Depthwise Separable Convolution [J].
Knapheide, Justin ;
Stabernack, Benno ;
Kuhnke, Maximilian .
2020 30TH INTERNATIONAL CONFERENCE ON FIELD-PROGRAMMABLE LOGIC AND APPLICATIONS (FPL), 2020, :277-283
[5]   MobileNetV2: Inverted Residuals and Linear Bottlenecks [J].
Sandler, Mark ;
Howard, Andrew ;
Zhu, Menglong ;
Zhmoginov, Andrey ;
Chen, Liang-Chieh .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :4510-4520
[6]  
Sun W., 2020, IEEE Geosci. Remote. Sens. Lett., V19, P1
[7]   WRA: A 2.2-to-6.3 TOPS Highly Unified Dynamically Reconfigurable Accelerator Using a Novel Winograd Decomposition Algorithm for Convolutional Neural Networks [J].
Yang, Chen ;
Wang, Yizhou ;
Wang, Xiaoli ;
Geng, Li .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2019, 66 (09) :3480-3493
[8]   ShuffleNet: An Extremely Efficient Convolutional Neural Network for Mobile Devices [J].
Zhang, Xiangyu ;
Zhou, Xinyu ;
Lin, Mengxiao ;
Sun, Ran .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :6848-6856