Live Demonstration: FPGA-based CNN Accelerator with Filter-Wise-Optimized Bit Precision

被引:0
|
作者
Nakata, Kengo [1 ]
Maki, Asuka [1 ]
Miyashita, Daisuke [1 ]
Tachibana, Fumihiko [1 ]
Suzuki, Tomoya [1 ]
Deguchi, Jun [1 ]
机构
[1] Toshiba Memory Corp, Kawasaki, Kanagawa, Japan
来源
2019 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS) | 2019年
关键词
deep learning; convolutional neural network; quantization; variable bit width; FPGA;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
To enhance the efficiency for inference of deep convolutional neural network without noticeable degradation of the recognition accuracy, we have proposed a filter-wise optimized quantization with variable bit precision. In addition, we have proposed the hardware architecture that fully supports the quantized variable bit precision. By using this hardware architecture, the execution time is reduced proportionally to the reduced bit precision by the filter-wise quantization. In the demonstration, we present image classification experiments on an FPGA where the proposed architecture is implemented with less execution time than the fixed 16-bit precision model.
引用
收藏
页数:1
相关论文
共 23 条
  • [1] FPGA-based CNN Processor with Filter-Wise-Optimized Bit Precision
    Maki, Asuka
    Miyashita, Daisuke
    Nakata, Kengo
    Tachibana, Fumihiko
    Suzuki, Tomoya
    Deguchi, Jun
    2018 IEEE ASIAN SOLID-STATE CIRCUITS CONFERENCE (A-SSCC): PROCEEDINGS OF TECHNICAL PAPERS, 2018, : 47 - 50
  • [2] Optimized FPGA-based Deep Learning Accelerator for Sparse CNN using High Bandwidth Memory
    Jiang, Chao
    Ojika, David
    Patel, Bhavesh
    Lam, Herman
    2021 IEEE 29TH ANNUAL INTERNATIONAL SYMPOSIUM ON FIELD-PROGRAMMABLE CUSTOM COMPUTING MACHINES (FCCM 2021), 2021, : 157 - 164
  • [3] SparkNoC: An energy-efficiency FPGA-based accelerator using optimized lightweight CNN for edge computing
    Xia, Ming
    Huang, Zunkai
    Tian, Li
    Wang, Hui
    Chang, Victor
    Zhu, Yongxin
    Feng, Songlin
    JOURNAL OF SYSTEMS ARCHITECTURE, 2021, 115
  • [4] Implementation of Data-optimized FPGA-based Accelerator for Convolutional Neural Network
    Cho, Mannhee
    Kim, Youngmin
    2020 INTERNATIONAL CONFERENCE ON ELECTRONICS, INFORMATION, AND COMMUNICATION (ICEIC), 2020,
  • [5] An FPGA-Based Reconfigurable CNN Training Accelerator Using Decomposable Winograd
    Wang, Hui
    Lu, Jinming
    Lin, Jun
    Wang, Zhongfeng
    2023 IEEE COMPUTER SOCIETY ANNUAL SYMPOSIUM ON VLSI, ISVLSI, 2023, : 175 - 180
  • [6] An FPGA-Based Reconfigurable Accelerator for Low-Bit DNN Training
    Shao, Haikuo
    Lu, Jinming
    Lin, Jun
    Wang, Zhongfeng
    2021 IEEE COMPUTER SOCIETY ANNUAL SYMPOSIUM ON VLSI (ISVLSI 2021), 2021, : 254 - 259
  • [7] Optimizing FPGA-Based CNN Accelerator Using Differentiable Neural Architecture Search
    Fan, Hongxiang
    Ferianc, Martin
    Liu, Shuanglong
    Que, Zhiqiang
    Niu, Xinyu
    Luk, Wayne
    2020 IEEE 38TH INTERNATIONAL CONFERENCE ON COMPUTER DESIGN (ICCD 2020), 2020, : 465 - 468
  • [8] Advantages and limitations of fully on-chip CNN FPGA-based hardware accelerator
    Dinelli, Gianmarco
    Meoni, Gabriele
    Rapuano, Emilio
    Fanucci, Luca
    2020 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2020,
  • [9] An Efficient FPGA-Based Accelerator for Perceptual Weighting Filter in Speech Coding
    Singh, Dilip
    Chandel, Rajeevan
    IETE TECHNICAL REVIEW, 2024, 41 (04) : 441 - 453
  • [10] Optimizing FPGA-based CNN accelerator for energy efficiency with an extended Roofline model
    Ayat, Sayed Omid
    Khalil-Hani, Mohamed
    Ab Rahman, Ab Al-Hadi
    TURKISH JOURNAL OF ELECTRICAL ENGINEERING AND COMPUTER SCIENCES, 2018, 26 (02) : 919 - 935