An FPGA-Based Approach for Compressing and Accelerating Depthwise Separable Convolution

被引:0
|
作者
Yang, Ruiheng [1 ]
Chen, Zhikun [1 ]
Hu, Lingtong [1 ]
Cui, Xihang [1 ]
Guo, Yunfei [1 ]
机构
[1] Hangzhou Dianzi Univ, Sch Automation, Sch Artificial Intelligence, Hangzhou 310018, Peoples R China
基金
中国国家自然科学基金;
关键词
Convolution; Optimization; Throughput; Resource management; Quantization (signal); Parallel processing; Hardware acceleration; CLIP-Q; DSC; FPGA; hardware accelerator; CNN;
D O I
10.1109/LSP.2024.3425286
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The rapid progress of deep learning has led to an increase in the parameter count and computational requirements of convolutional neural networks (CNN), presenting difficulties in deploying networks on hardware platforms with constrained resources. Although depthwise separable convolution (DSC) is one method used to tackle this issue, it still maintains numerous redundant parameters. Meanwhile, compression learning by in parallel pruning-quantization (CLIP-Q) method represents an efficient approach to network compression. However, it does not have additional optimization for DSC. This study proposes a method named DSC-CLIP-Q, which is derived from the CLIP-Q approach and is designed to specifically address the parameter distribution characteristics of DSC. Furthermore, the research developed a highly energy-efficient and reconfigurable hardware accelerator specifically designed for this approach. Additional storage optimizations tailored to the hardware features of DSC-CLIP-Q is introduced, in conjunction with a reconfigurable processing element (PE) array specifically designed for the convolutional characteristics of DSC. The experimental results indicate that the suggested DSC accelerator attains a high level of throughput and energy efficiency, while also enhancing network accuracy.
引用
收藏
页码:2590 / 2594
页数:5
相关论文
共 50 条
  • [31] FPGA-based approach for organization of SDN switch
    Kalyaev, Anatoly
    Melnik, Eduard
    2015 9TH INTERNATIONAL CONFERENCE ON APPLICATION OF INFORMATION AND COMMUNICATION TECHNOLOGIES (AICT), 2015, : 363 - 366
  • [32] Dynamic Dataflow Scheduling and Computation Mapping Techniques for Efficient Depthwise Separable Convolution Acceleration
    Li, Baoting
    Wang, Hang
    Zhang, Xuchong
    Ren, Jie
    Liu, Longjun
    Sun, Hongbin
    Zheng, Nanning
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2021, 68 (08) : 3279 - 3292
  • [33] Accelerating FPGA-based evolution of wavelet transform filters by optimized task scheduling
    Salvador, Ruben
    Vidal, Alberto
    Moreno, Felix
    Riesgo, Teresa
    Sekanina, Lukas
    MICROPROCESSORS AND MICROSYSTEMS, 2012, 36 (05) : 427 - 438
  • [34] An FPGA-based edge computing and accelerating platform for fast diabetic retinopathy diagnosis
    Li, Silong
    Qiu, Jiaming
    Huang, Yijin
    Tang, Xiaoying
    Ye, Terry Tao
    MEDICAL IMAGING 2023, 2023, 12464
  • [35] Accelerating an FPGA-Based SAT Solver by Software and Hardware Co-design
    Ma, Kefan
    Xiao, Liquan
    Zhang, Jianmin
    Li, Tiejun
    CHINESE JOURNAL OF ELECTRONICS, 2019, 28 (05) : 953 - 961
  • [36] Accelerating Next-G Wireless Communications with FPGA-based AI Accelerators
    Lin, Chunxiao
    Azmine, Muhammad Farhan
    Yi, Yang
    2023 IEEE/ACM INTERNATIONAL CONFERENCE ON COMPUTER AIDED DESIGN, ICCAD, 2023,
  • [37] Accelerating an FPGA-Based SAT Solver by Software and Hardware Co-design
    MA Kefan
    XIAO Liquan
    ZHANG Jianmin
    LI Tiejun
    ChineseJournalofElectronics, 2019, 28 (05) : 953 - 961
  • [38] Implementation of FPGA-based Accelerator for Deep Neural Networks
    Tsai, Tsung-Han
    Ho, Yuan-Chen
    Sheu, Ming-Hwa
    2019 IEEE 22ND INTERNATIONAL SYMPOSIUM ON DESIGN AND DIAGNOSTICS OF ELECTRONIC CIRCUITS & SYSTEMS (DDECS), 2019,
  • [39] FPGA-based implementation of the VVC low-frequency non-separable transform
    Belghith, Fatma
    Ben Jdidia, Sonda
    Abdallah, Bouthaina
    Masmoudi, Nouri
    JOURNAL OF REAL-TIME IMAGE PROCESSING, 2024, 21 (03)
  • [40] Unified Accelerator for Attention and Convolution in Inference Based on FPGA
    Li, Tianyang
    Zhang, Fan
    Fan, Xitian
    Shen, Jianliang
    Guo, Wei
    Cao, Wei
    2023 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, ISCAS, 2023,