Autonomous Binarized Focal Loss Enhanced Model Compression Design Using Tensor Train Decomposition

被引:0
作者
Liu, Mingshuo [1 ]
Luo, Shiyi [1 ]
Han, Kevin [1 ]
DeMara, Ronald F. [2 ]
Bai, Yu [1 ]
机构
[1] Calif State Univ Fullerton, Coll Engn & Comp Sci, Elect & Comp Engn Dept, 800 N State Coll Blvd, Fullerton, CA 92831 USA
[2] Univ Cent Florida, Coll Engn & Comp Sci, Dept Elect & Comp Engn, 4000 Cent Florida Blvd, Orlando, FL 32816 USA
关键词
tensor decomposition; focal loss; embedded hardware;
D O I
10.3390/mi13101738
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Deep learning methods have exhibited the great capacity to process object detection tasks, offering a practical and viable approach in many applications. When researchers have advanced deep learning models to improve their performance, the model derived from the algorithmic improvement may itself require complementary increases in computational and power demands. Recently, model compression and pruning techniques have received more attention to promote the wide employment of the DNN model. Although these techniques have achieved a remarkable performance, the class imbalance issue during the mode compression process does not vanish. This paper exploits the Autonomous Binarized Focal Loss Enhanced Model Compression (ABFLMC) model to address the issue. Additionally, our proposed ABFLMC can automatically receive the dynamic difficulty term during the training process to improve performance and reduce complexity. A novel hardware architecture is proposed to accelerate inference. Our experimental results show that the ABFLMC can achieve higher accuracy, faster speed, and smaller model size.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] Bochkovskiy A., 2020, ARXIV 200410934
  • [2] Cai Y., 2020, arXiv
  • [3] Comon P, 2009, Arxiv, DOI arXiv:0905.0454
  • [4] Dai JF, 2016, ADV NEUR IN, V29
  • [5] Histograms of oriented gradients for human detection
    Dalal, N
    Triggs, B
    [J]. 2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, : 886 - 893
  • [6] A multilinear singular value decomposition
    De Lathauwer, L
    De Moor, B
    Vandewalle, J
    [J]. SIAM JOURNAL ON MATRIX ANALYSIS AND APPLICATIONS, 2000, 21 (04) : 1253 - 1278
  • [7] TIE: Energy-efficient Tensor Train-based Inference Engine for Deep Neural Network
    Deng, Chunhua
    Sun, Fangxuan
    Qian, Xuehai
    Lin, Jun
    Wang, Zhongfeng
    Yuan, Bo
    [J]. PROCEEDINGS OF THE 2019 46TH INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE (ISCA '19), 2019, : 264 - 277
  • [8] REQ-YOLO: A Resource-Aware, Efficient Quantization Framework for Object Detection on FPGAs
    Ding, Caiwen
    Wang, Shuo
    Liu, Ning
    Xu, Kaidi
    Wang, Yanzhi
    Liang, Yun
    [J]. PROCEEDINGS OF THE 2019 ACM/SIGDA INTERNATIONAL SYMPOSIUM ON FIELD-PROGRAMMABLE GATE ARRAYS (FPGA'19), 2019, : 33 - 42
  • [9] CenterNet: Keypoint Triplets for Object Detection
    Duan, Kaiwen
    Bai, Song
    Xie, Lingxi
    Qi, Honggang
    Huang, Qingming
    Tian, Qi
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 6568 - 6577
  • [10] Ermis B., 2014, arXiv