Online Scene Text Tracking with Spatial-Temporal Relation

被引：1

作者：

Xiu, Yan ^{[1
]}

Zhou, Hong-Yang ^{[1
]}

Tian, Shu ^{[1
]}

Yin, Xu-Cheng ^{[1
]}

机构：

[1] Univ Sci & Technol Beijing, Beijing, Peoples R China

来源：

IMAGE AND GRAPHICS (ICIG 2021), PT III | 2021年 / 12890卷

基金：

中国国家自然科学基金;

关键词：

Spatial-temporal relation; Scene text tracking; Multiple object tracking; VIDEO;

D O I：

10.1007/978-3-030-87361-5_50

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

Scene texts in video are not fixed in color, size, format and are easily confused with the background, which imposes significant challenges in video scene text tracking. The trajectories are often be fragmented caused by these. Most tracking methods focus on the matching of the appearance features and the temporal information across frames, treating each text as a separate object. However, the relations among all texts are also important cues. In this paper, we propose a novel online video scene text tracking approach with the spatial-temporal relation module utilizing multiple cues, i.e. appearance, geometry and temporal. The spatial-temporal relation module enhances appearance features by modeling the relations between texts with each other in the same frame, which can avoid the influence of bad detection results, and track text stably and consistently. We achieved more tracked texts and more complete trajectories on IC15 with the spatial-temporal relation module.

引用

页码：610 / 622

页数：13

共 50 条

[1] Online spatial-temporal data fusion for robust adaptive tracking
Chen, Jixu
Ji, Qiang
2007 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOLS 1-8, 2007, : 3326 - +
[2] Spatial-Temporal Relation Networks for Multi-Object Tracking
Xu, Jiarui
Cao, Yue
Zhang, Zheng
Hu, Han
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 3987 - 3997
[3] Online object tracking based on CNN with spatial-temporal saliency guided sampling
Zhang, Peng
Zhuo, Tao
Huang, Wei
Chen, Kangli
Kankanhalli, Mohan
NEUROCOMPUTING, 2017, 257 : 115 - 127
[4] Spatial-Temporal Transformer for Dynamic Scene Graph Generation
Cong, Yuren
Liao, Wentong
Ackermann, Hanno
Rosenhahn, Bodo
Yang, Michael Ying
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 16352 - 16362
[5] Video Scene Graph Generation with Spatial-Temporal Knowledge
Pu, Tao
PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 9340 - 9344
[6] Online Learning of Spatial-Temporal Convolution Response for Robust Real-Time Tracking
Zhou, Jinglin
Wang, Rong
Ding, Jianwei
2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 1821 - 1826
[7] Online learning and joint optimization of combined spatial-temporal models for robust visual tracking
Zhou, Tao
Bhaskar, Harish
Liu, Fanghui
Yang, Jie
Cai, Ping
NEUROCOMPUTING, 2017, 226 : 221 - 237
[8] Spatial-Temporal Context-Aware Tracking
Han, Yuqi
Deng, Chenwei
Zhao, Boya
Zhao, Baojun
IEEE SIGNAL PROCESSING LETTERS, 2019, 26 (03) : 500 - 504
[9] Mining Spatial-Temporal Similarity for Visual Tracking
Zhang, Yu
Gao, Xingyu
Chen, Zhenyu
Zhong, Huicai
Xie, Hongtao
Yan, Chenggang
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 8107 - 8119
[10] Tracking the Evolving Spatial-Temporal Gene Networks
Gong, Weikang
Wan, Lin
IFAC PAPERSONLINE, 2015, 48 (28): : 1365 - 1368

← 1 2 3 4 5 →