A 64.1mW Accurate Real-Time Visual Object Tracking Processor With Spatial Early Stopping on Siamese Network

被引:7
作者
Kim, Soyeon [1 ]
Kim, Sangjin [1 ]
Kim, Sangyeob [1 ]
Han, Donghyeon [1 ]
Yoo, Hoi-Jun [1 ]
机构
[1] Korea Adv Inst Sci & Technol, Sch Elect Engn, Daejeon 34141, South Korea
关键词
Feature extraction; Indexes; Kernel; Computer architecture; Real-time systems; Convolution; Search problems; Deep neural network; low-power accelerators; siamese network; visual object tracking; visual attention;
D O I
10.1109/TCSII.2021.3067351
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
A low power real-time visual object tracking (VOT) processor using the siamese network (SiamNet) is proposed for mobile devices. Two key features enable a real-time VOT with low power consumption on mobile devices. First, correlation-based spatial early stopping (CSES) is proposed to reduce the computational workload. CSES reduces similar to 56.8% of the overall computation of the SiamNet by gradually eliminating the background. Second, the dual mode reuse core (DMRC) is proposed for supporting both the convolution layer and the cross-correlation layer with high core utilization. Finally, the proposed VOT processor is implemented in 28 nm CMOS technology and occupies 0.42 mm(2). The proposed processor achieves 0.587 for the success rate and 0.778 for the precision in the OTB-100 dataset with SiamRPN++-AlexNet. Compared to previous VOT processors, the proposed processor shows state-of-the-art performance while showing lower power consumption. The proposed processor achieves 64.1 mW peak power and 58.2 mW tracking power consumption at 32.1 frame-per-second (fps) real-time VOT on mobile devices.
引用
收藏
页码:1675 / 1679
页数:5
相关论文
共 13 条
[1]  
Agarwal N., 2019, P FRONT COMP THEOR T, P629, DOI [10.1007/978-981-13-3648-5_76, DOI 10.1007/978-981-13-3648-5_76]
[2]   Integrating Stereo Vision with a CNN Tracker for a Person-Following Robot [J].
Chen, Bao Xin ;
Sahdev, Raghavender ;
Tsotsos, John K. .
COMPUTER VISION SYSTEMS, ICVS 2017, 2017, 10528 :300-313
[3]  
Chen BaoXin., 2019, Fast visual object tracking with rotated bounding boxes
[4]   Human Detection and Tracking for Video Surveillance: A Cognitive Science Approach [J].
Gajjar, Vandit ;
Gurnani, Ayesha ;
Khandhediya, Yash .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2017), 2017, :2805-2809
[5]  
Han D, 2019, SYMP VLSI CIRCUITS, pC304, DOI [10.23919/VLSIC.2019.8778006, 10.23919/vlsic.2019.8778006]
[6]   A Low-Power Deep Neural Network Online Learning Processor for Real-Time Object Tracking Application [J].
Han, Donghyeon ;
Lee, Jinsu ;
Lee, Jinmook ;
Yoo, Hoi-Jun .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2019, 66 (05) :1794-1804
[7]   Convolutional Channel Features-Based Person Identification for Person Following Robots [J].
Koide, Kenji ;
Miura, Jun .
INTELLIGENT AUTONOMOUS SYSTEMS 15, IAS-15, 2019, 867 :186-198
[8]   On-Road Pedestrian Tracking Across Multiple Driving Recorders [J].
Lee, Kuan-Hui ;
Hwang, Jenq-Neng .
IEEE TRANSACTIONS ON MULTIMEDIA, 2015, 17 (09) :1429-1438
[9]   Online-Learning-Based Human Tracking Across Non-Overlapping Cameras [J].
Lee, Young-Gun ;
Tang, Zheng ;
Hwang, Jenq-Neng .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2018, 28 (10) :2870-2883
[10]   SiamRPN plus plus : Evolution of Siamese Visual Tracking with Very Deep Networks [J].
Li, Bo ;
Wu, Wei ;
Wang, Qiang ;
Zhang, Fangyi ;
Xing, Junliang ;
Yan, Junjie .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :4277-4286