TempDiff: Temporal Difference-Based Feature Map-Level Sparsity Induction in CNNs with <4% Memory Overhead

被引:3
作者
De Alwis, Udari [1 ]
Alioto, Massimo [1 ]
机构
[1] Natl Univ Singapore, ECE Dept, Singapore, Singapore
来源
2021 IEEE 3RD INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE CIRCUITS AND SYSTEMS (AICAS) | 2021年
基金
新加坡国家研究基金会;
关键词
Object detection; deep neural networks; computational efficiency; Internet of Things; inference;
D O I
10.1109/AICAS51828.2021.9458463
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The diffusion of vision sensor nodes in a wide range of applications has given rise to higher computational demand at the edge of the Internet of Things (IoT). Indeed, in-node video sense-making has become essential in the form of high-level tasks such as object detection for visual monitoring, mitigating data deluge from the wireless network to the cloud storage level. In such applications, deep neural networks are well known to be a prime choice, in view of their performance and flexibility. However, such properties come at the cost of high computational requirements at inference time, which directly hamper power efficiency, lifetime and cost of self-powered edge devices. In this paper, a computationally-efficient inference technique is introduced to perform the ubiquitously required task of bounding box-based object detection. The proposed method leverages the correlation among frames in the temporal dimension, uniquely requires minor memory overhead for intermediate feature map storage and architectural changes, and does not require any retraining for immediate deployment in existing vision frameworks. The proposed method achieves 18.3% (35.8%) computation reduction at 3.3% (3.2%) memory overhead, and 3.8% (6.8%) accuracy drop in YOLOv1(VGG16) SSD(VGG16) neural networks under the CAMEL dataset.
引用
收藏
页数:4
相关论文
共 14 条
[1]   Low-Power Computer Vision: Status, Challenges, and Opportunities [J].
Alyamkin, Sergei ;
Ardi, Matthew ;
Berg, Alexander C. ;
Brighton, Achille ;
Chen, Bo ;
Chen, Yiran ;
Cheng, Hsin-Pai ;
Fan, Zichen ;
Feng, Chen ;
Fu, Bo ;
Gauen, Kent ;
Goel, Abhinav ;
Goncharenko, Alexander ;
Guo, Xuyang ;
Ha, Soonhoi ;
Howard, Andrew ;
Hu, Xiao ;
Huang, Yuanjun ;
Kim, Jaeyoun ;
Ko, Jong Gook ;
Kondratyev, Alexander ;
Lee, Junhyeok ;
Lee, Seungjae ;
Lee, Suwoong ;
Li, Zichao ;
Liang, Zhiyu ;
Liu, Juzheng ;
Liu, Xin ;
Lu, Yang ;
Lu, Yung-Hsiang ;
Malik, Deeptanshu ;
Nguyen, Hong Hanh ;
Park, Eunbyung ;
Repin, Denis ;
Shen, Liang ;
Sheng, Tao ;
Sun, Fei ;
Svitov, David ;
Thiruvathukal, George K. ;
Zhang, Baiwu ;
Zhang, Jingchi ;
Zhang, Xiaopeng ;
Zhuo, Shaojie ;
Kang, D. .
IEEE JOURNAL ON EMERGING AND SELECTED TOPICS IN CIRCUITS AND SYSTEMS, 2019, 9 (02) :411-421
[2]   CBinfer: Exploiting Frame-to-Frame Locality for Faster Convolutional Network Inference on Video Streams [J].
Cavigelli, Lukas ;
Benini, Luca .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2020, 30 (05) :1451-1465
[3]  
Fan Q, 2017, IEEE INT C COMMUNICA, P1
[4]  
Gebhardt E, 2018, 2018 15TH IEEE INTERNATIONAL CONFERENCE ON ADVANCED VIDEO AND SIGNAL BASED SURVEILLANCE (AVSS), P37
[5]   EIE: Efficient Inference Engine on Compressed Deep Neural Network [J].
Han, Song ;
Liu, Xingyu ;
Mao, Huizi ;
Pu, Jing ;
Pedram, Ardavan ;
Horowitz, Mark A. ;
Dally, William J. .
2016 ACM/IEEE 43RD ANNUAL INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE (ISCA), 2016, :243-254
[6]   A Survey of Deep Learning-Based Object Detection [J].
Jiao, Licheng ;
Zhang, Fan ;
Liu, Fang ;
Yang, Shuyuan ;
Li, Lingling ;
Feng, Zhixi ;
Qu, Rong .
IEEE ACCESS, 2019, 7 :128837-128868
[7]   Pack and Detect: Fast Object Detection in Videos Using Region-of-Interest Packing [J].
Kumar, Athindran Ramesh ;
Ravindran, Balaraman ;
Raghunathan, Anand .
PROCEEDINGS OF THE 6TH ACM IKDD CODS AND 24TH COMAD, 2019, :150-156
[8]   SSD: Single Shot MultiBox Detector [J].
Liu, Wei ;
Anguelov, Dragomir ;
Erhan, Dumitru ;
Szegedy, Christian ;
Reed, Scott ;
Fu, Cheng-Yang ;
Berg, Alexander C. .
COMPUTER VISION - ECCV 2016, PT I, 2016, 9905 :21-37
[9]   Recurrent Residual Module for Fast Inference in Videos [J].
Pan, Bowen ;
Lin, Wuwei ;
Fang, Xiaolin ;
Huang, Chaoqin ;
Zhou, Bolei ;
Lu, Cewu .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :1536-1545
[10]   SIGMA: A Sparse and Irregular GEMM Accelerator with Flexible Interconnects for DNN Training [J].
Qin, Eric ;
Samajdar, Ananda ;
Kwon, Hyoukjun ;
Nadella, Vineet ;
Srinivasan, Sudarshan ;
Das, Dipankar ;
Kaul, Bharat ;
Krishna, Tushar .
2020 IEEE INTERNATIONAL SYMPOSIUM ON HIGH PERFORMANCE COMPUTER ARCHITECTURE (HPCA 2020), 2020, :58-70