Discriminative Siamese Tracker Based on Multi-Channel-Aware and Adaptive Hierarchical Deep Features

被引：1

作者：

Zhang, Huanlong ^{[1
]}

Duan, Rui ^{[1
]}

Zheng, Anping ^{[1
]}

Zhang, Jie ^{[1
]}

Li, Linwei ^{[1
]}

Wang, Fengxian ^{[1
]}

机构：

[1] Zhengzhou Univ Light Ind, Sch Elect & Informat Engn, Zhengzhou 450002, Peoples R China

来源：

SYMMETRY-BASEL | 2021年 / 13卷 / 12期

基金：

中国国家自然科学基金;

关键词：

target features; siamese trackers; multi-channel aware; adaptive hierarchical features; visual tracking; CORRELATION FILTER TRACKER; OBJECT TRACKING; VISUAL TRACKING;

D O I：

10.3390/sym13122329

中图分类号：

O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];

学科分类号：

07 ; 0710 ; 09 ;

摘要：

Most existing Siamese trackers mainly use a pre-trained convolutional neural network to extract target features. However, due to the weak discrimination of the target and background information of pre-trained depth features, the performance of the Siamese tracker can be significantly degraded when facing similar targets or changes in target appearance. This paper proposes a multi-channel-aware and adaptive hierarchical deep features module to enhance the discriminative ability of the tracker. Firstly, through the multi-channel-aware deep features module, the importance values of feature channels are obtained from both the target details and overall information, to identify more important feature channels. Secondly, by introducing the adaptive hierarchical deep features module, the importance of each feature layer can be determined according to the response value of each frame, so that the hierarchical features can be integrated to represent the target, which can better adapt to changes in the appearance of the target. Finally, the proposed two modules are integrated into the Siamese framework for target tracking. The Siamese network used in this paper is a two-input branch symmetric neural network with two input branches, and they share the same weights, which are widely used in the field of target tracking. Experiments on some Benchmarks show that the proposed Siamese tracker has several points of improvement compared to the baseline tracker.

引用

页数：21

共 61 条

[1] Vision-based Hand Gesture Recognition for Human-Computer Interaction using MobileNetV2 [J].

Baumgartl, Hermann ;

Sauter, Daniel ;

Schenk, Christian ;

Atik, Cem ;

Buettner, Ricardo .

2021 IEEE 45TH ANNUAL COMPUTERS, SOFTWARE, AND APPLICATIONS CONFERENCE (COMPSAC 2021), 2021, :1667-1674

[2] Staple: Complementary Learners for Real-Time Tracking [J].

Bertinetto, Luca ;

Valmadre, Jack ;

Golodetz, Stuart ;

Miksik, Ondrej ;

Torr, Philip H. S. .

2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :1401-1409

[3] Fully-Convolutional Siamese Networks for Object Tracking [J].

Bertinetto, Luca ;

Valmadre, Jack ;

Henriques, Joao F. ;

Vedaldi, Andrea ;

Torr, Philip H. S. .

COMPUTER VISION - ECCV 2016 WORKSHOPS, PT II, 2016, 9914 :850-865

[4]

Bolme DS, 2010, PROC CVPR IEEE, P2544, DOI 10.1109/CVPR.2010.5539960

[5] Improved mean shift integrating texture and color features for robust real time object tracking [J].

Bousetouane, Fouad ;

Dib, Lynda ;

Snoussi, Hichem .

VISUAL COMPUTER, 2013, 29 (03) :155-170

[6] HiFT: Hierarchical Feature Transformer for Aerial Tracking [J].

Cao, Ziang ;

Fu, Changhong ;

Ye, Junjie ;

Li, Bowen ;

Li, Yiming .

2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :15437-15446

[7]

Chen D., 2018, P EUROPEAN C COMPUTE

[8] ECO: Efficient Convolution Operators for Tracking [J].

Danelljan, Martin ;

Bhat, Goutam ;

Khan, Fahad Shahbaz ;

Felsberg, Michael .

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :6931-6939

[9] Discriminative Scale Space Tracking [J].

Danelljan, Martin ;

Hager, Gustav ;

Khan, Fahad Shahbaz ;

Felsberg, Michael .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (08) :1561-1575

[10] Beyond Correlation Filters: Learning Continuous Convolution Operators for Visual Tracking [J].

Danelljan, Martin ;

Robinson, Andreas ;

Khan, Fahad Shahbaz ;

Felsberg, Michael .

COMPUTER VISION - ECCV 2016, PT V, 2016, 9909 :472-488

← 1 2 3 4 5 6 7 →