DIF-LUT: A Simple Yet Scalable Approximation for Non-linear Activation Function on FPGA

被引:0
作者
Liu, Yang [1 ]
He, Xiaoming [1 ]
Yu, Jun [1 ]
Wang, Kun [1 ]
机构
[1] Fudan Univ, State Key Lab ASIC & Syst, Shanghai, Peoples R China
来源
2023 33RD INTERNATIONAL CONFERENCE ON FIELD-PROGRAMMABLE LOGIC AND APPLICATIONS, FPL | 2023年
关键词
Non-linear Approximation; Activation Function; Neural Network; Look-up Table; FPGA; OPU;
D O I
10.1109/FPL60245.2023.00055
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Non-linear activation function plays an essential role in neural networks (NNs) for their generalization ability. However, deploying the intricate mathematical operations on hardware platforms like Field-Programmable Gate Array (FPGA) turns out a great challenge. Prior works based on piecewise functions or look-up table (LUT) either involve complex manual operations or neglect hardware overhead. To this end, this paper proposes a simple yet scalable and effective approximation called DIF-LUT, which is applicable to various non-linear functions. Specifically, the proposed method can achieve accurate approximation by piecewise linear matching to fit the function derivative roughly and range addressable LUT to offset the difference. Moreover, self-adaptive mechanisms are applied to automatically minimize hardware cost in terms of different accuracies. The experiments show that compared to state-of-the-art methods, DIF-LUT costs 43.68% fewer LUTs and 70.8% fewer flip-flops (FFs) without any digital signal processor (DSP), while achieving 2.7x approximation accuracy at 554.1MHz on Xilinx Zynq UltraScale+.
引用
收藏
页码:322 / 326
页数:5
相关论文
共 25 条
  • [11] Meher PK, 2010, IEEE INT CONF VLSI, P91, DOI 10.1109/VLSISOC.2010.5642617
  • [12] Efficient Hardware Implementation of the Hyperbolic Tangent Sigmoid Function
    Namin, Ashkan Hosseinzadeh
    Leboeuf, Karl
    Muscedere, Roberto
    Wu, Huapeng
    Ahmadi, Majid
    [J]. ISCAS: 2009 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOLS 1-5, 2009, : 2117 - 2120
  • [13] Ngah S., 2017, Journal of Telecommunication, Electronic and Computer Engineering, V9, P103
  • [14] A Modular Approximation Methodology for Efficient Fixed-Point Hardware Implementation of the Sigmoid Function
    Pan, Zhe
    Gu, Zonghua
    Jiang, Xiaohong
    Zhu, Guoquan
    Ma, De
    [J]. IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2022, 69 (10) : 10694 - 10703
  • [15] A Fast Hardware Approach for Approximate, Efficient Logarithm and Antilogarithm Computations
    Paul, Suganth
    Jayakumar, Nikhil
    Khatri, Sunil P.
    [J]. IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2009, 17 (02) : 269 - 277
  • [16] A Universal Method of Linear Approximation With Controllable Error for the Efficient Implementation of Transcendental Functions
    Sun, Huaqing
    Luo, Yuanyong
    Ha, Yajun
    Shi, Yinghuan
    Gao, Yang
    Shen, Qinghong
    Pan, Hongbing
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2020, 67 (01) : 177 - 188
  • [17] MP-OPU: A Mixed Precision FPGA-based Overlay Processor for Convolutional Neural Networks
    Wu, Chen
    Zhuang, Jinming
    Wang, Kun
    He, Lei
    [J]. 2021 31ST INTERNATIONAL CONFERENCE ON FIELD-PROGRAMMABLE LOGIC AND APPLICATIONS (FPL 2021), 2021, : 33 - 37
  • [18] A High-performance CNN Processor Based on FPGA for MobileNets
    Wu, Di
    Zhang, Yu
    Jia, Xijie
    Tian, Lu
    Li, Tianping
    Sui, Lingzhi
    Xie, Dongliang
    Shan, Yi
    [J]. 2019 29TH INTERNATIONAL CONFERENCE ON FIELD-PROGRAMMABLE LOGIC AND APPLICATIONS (FPL), 2019, : 136 - 143
  • [19] Range-Lookup Approximate Computing Acceleration for Any Activation Functions in Low-Power Neural Network
    Yang, Wen-Chang
    Lin, Shu-Yun
    Huang, Tsung-Chu
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS - TAIWAN (ICCE-TAIWAN), 2020,
  • [20] NN-LUT: Neural Approximation of Non-Linear Operations for Efficient Transformer Inference
    Yu, Joonsang
    Park, Junki
    Park, Seongmin
    Kim, Minsoo
    Lee, Sihwa
    Lee, Dong Hyun
    Choi, Jungwook
    [J]. PROCEEDINGS OF THE 59TH ACM/IEEE DESIGN AUTOMATION CONFERENCE, DAC 2022, 2022, : 577 - 582