Autonomous Binarized Focal Loss Enhanced Model Compression Design Using Tensor Train Decomposition

被引：0

作者：

Liu, Mingshuo ^{[1
]}

Luo, Shiyi ^{[1
]}

Han, Kevin ^{[1
]}

DeMara, Ronald F. ^{[2
]}

Bai, Yu ^{[1
]}

机构：

[1] Calif State Univ Fullerton, Coll Engn & Comp Sci, Elect & Comp Engn Dept, 800 N State Coll Blvd, Fullerton, CA 92831 USA

[2] Univ Cent Florida, Coll Engn & Comp Sci, Dept Elect & Comp Engn, 4000 Cent Florida Blvd, Orlando, FL 32816 USA

来源：

MICROMACHINES | 2022年 / 13卷 / 10期

关键词：

tensor decomposition; focal loss; embedded hardware;

D O I：

10.3390/mi13101738

中图分类号：

O65 [分析化学];

学科分类号：

070302 ; 081704 ;

摘要：

Deep learning methods have exhibited the great capacity to process object detection tasks, offering a practical and viable approach in many applications. When researchers have advanced deep learning models to improve their performance, the model derived from the algorithmic improvement may itself require complementary increases in computational and power demands. Recently, model compression and pruning techniques have received more attention to promote the wide employment of the DNN model. Although these techniques have achieved a remarkable performance, the class imbalance issue during the mode compression process does not vanish. This paper exploits the Autonomous Binarized Focal Loss Enhanced Model Compression (ABFLMC) model to address the issue. Additionally, our proposed ABFLMC can automatically receive the dynamic difficulty term during the training process to improve performance and reduce complexity. A novel hardware architecture is proposed to accelerate inference. Our experimental results show that the ABFLMC can achieve higher accuracy, faster speed, and smaller model size.

引用

页数：14

共 50 条

[1] Bochkovskiy A., 2020, ARXIV 200410934
[2] Cai Y., 2020, arXiv
[3] Comon P, 2009, Arxiv, DOI arXiv:0905.0454
[4] Dai JF, 2016, ADV NEUR IN, V29
[5] Histograms of oriented gradients for human detection
Dalal, N
Triggs, B
[J]. 2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, : 886 - 893
[6] A multilinear singular value decomposition
De Lathauwer, L
De Moor, B
Vandewalle, J
[J]. SIAM JOURNAL ON MATRIX ANALYSIS AND APPLICATIONS, 2000, 21 (04) : 1253 - 1278
[7] TIE: Energy-efficient Tensor Train-based Inference Engine for Deep Neural Network
Deng, Chunhua
Sun, Fangxuan
Qian, Xuehai
Lin, Jun
Wang, Zhongfeng
Yuan, Bo
[J]. PROCEEDINGS OF THE 2019 46TH INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE (ISCA '19), 2019, : 264 - 277
[8] REQ-YOLO: A Resource-Aware, Efficient Quantization Framework for Object Detection on FPGAs
Ding, Caiwen
Wang, Shuo
Liu, Ning
Xu, Kaidi
Wang, Yanzhi
Liang, Yun
[J]. PROCEEDINGS OF THE 2019 ACM/SIGDA INTERNATIONAL SYMPOSIUM ON FIELD-PROGRAMMABLE GATE ARRAYS (FPGA'19), 2019, : 33 - 42
[9] CenterNet: Keypoint Triplets for Object Detection
Duan, Kaiwen
Bai, Song
Xie, Lingxi
Qi, Honggang
Huang, Qingming
Tian, Qi
[J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 6568 - 6577
[10] Ermis B., 2014, arXiv

← 1 2 3 4 5 →