Video Object Detection with MeanShift Tracking

被引:1
作者
Zhang, Shuai [1 ,3 ]
Liu, Wei [2 ]
Fu, Haijie [1 ]
Yue, Xiaodong [1 ,3 ]
机构
[1] Shanghai Univ, Sch Comp Engn & Sci, Shanghai 200444, Peoples R China
[2] Tongji Univ, Coll Elect & Informat Engn, Shanghai 200444, Peoples R China
[3] Shanghai Univ, Artificial Intelligence Inst, Shanghai, Peoples R China
来源
ROUGH SETS, IJCRS 2022 | 2022年 / 13633卷
基金
中国国家自然科学基金; 上海市自然科学基金;
关键词
Video object detection; Deep neural networks; MeanShift; ADAPTIVE MEAN-SHIFT; SCALE;
D O I
10.1007/978-3-031-21244-4_17
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Video object detection, a basic task in the computer vision, is rapidly evolving and widely used in various real-world applications. Recently, with the success of deep learning, deep video object detection has become an important research direction. Although existing deep video object detection methods have achieved excellent results compared with those of traditional methods, they ignore the motion laws of objects and are hard to improve the detection performance of the fast moving objects suffering from deteriorated problems such as the motion blur, video defocus, object occlusion and rare poses. To address this limitation, we add the object trajectory information into the process of the video object detection and devise a novel deep video object detection method which utilizes the MeanShift algorithm to guide the deep neural networks to enhance the video object detection performance. The experiments on ImageNet VID dataset validate that the proposed method can improve the recognition performance of fast moving objects with taking into account the motion laws of objects.
引用
收藏
页码:224 / 237
页数:14
相关论文
共 37 条
[1]   Cascade R-CNN: Delving into High Quality Object Detection [J].
Cai, Zhaowei ;
Vasconcelos, Nuno .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :6154-6162
[2]   Memory Enhanced Global-Local Aggregation for Video Object Detection [J].
Chen, Yihong ;
Cao, Yue ;
Hu, Han ;
Wang, Liwei .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, :10334-10343
[3]   Kernel-based object tracking [J].
Comaniciu, D ;
Ramesh, V ;
Meer, P .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2003, 25 (05) :564-577
[4]  
Comaniciu D, 2000, PROC CVPR IEEE, P142, DOI 10.1109/CVPR.2000.854761
[5]   Short-term anchor linking and long-term self-guided attention for video object detection [J].
Cores, Daniel ;
Brea, Victor M. ;
Mucientes, Manuel .
IMAGE AND VISION COMPUTING, 2021, 110
[6]  
Dai JF, 2016, ADV NEUR IN, V29
[7]  
Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
[8]   Relation Distillation Networks for Video Object Detection [J].
Deng, Jiajun ;
Pan, Yingwei ;
Yao, Ting ;
Zhou, Wengang ;
Li, Houqiang ;
Mei, Tao .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :7022-7031
[9]  
Fu CY, 2017, PREPRINT
[10]  
FUKUNAGA K, 1975, IEEE T INFORM THEORY, V21, P32, DOI 10.1109/TIT.1975.1055330