Online Scene Text Tracking with Spatial-Temporal Relation

被引:1
|
作者
Xiu, Yan [1 ]
Zhou, Hong-Yang [1 ]
Tian, Shu [1 ]
Yin, Xu-Cheng [1 ]
机构
[1] Univ Sci & Technol Beijing, Beijing, Peoples R China
来源
IMAGE AND GRAPHICS (ICIG 2021), PT III | 2021年 / 12890卷
基金
中国国家自然科学基金;
关键词
Spatial-temporal relation; Scene text tracking; Multiple object tracking; VIDEO;
D O I
10.1007/978-3-030-87361-5_50
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Scene texts in video are not fixed in color, size, format and are easily confused with the background, which imposes significant challenges in video scene text tracking. The trajectories are often be fragmented caused by these. Most tracking methods focus on the matching of the appearance features and the temporal information across frames, treating each text as a separate object. However, the relations among all texts are also important cues. In this paper, we propose a novel online video scene text tracking approach with the spatial-temporal relation module utilizing multiple cues, i.e. appearance, geometry and temporal. The spatial-temporal relation module enhances appearance features by modeling the relations between texts with each other in the same frame, which can avoid the influence of bad detection results, and track text stably and consistently. We achieved more tracked texts and more complete trajectories on IC15 with the spatial-temporal relation module.
引用
收藏
页码:610 / 622
页数:13
相关论文
共 50 条
  • [1] Online spatial-temporal data fusion for robust adaptive tracking
    Chen, Jixu
    Ji, Qiang
    2007 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOLS 1-8, 2007, : 3326 - +
  • [2] Spatial-Temporal Relation Networks for Multi-Object Tracking
    Xu, Jiarui
    Cao, Yue
    Zhang, Zheng
    Hu, Han
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 3987 - 3997
  • [3] Online object tracking based on CNN with spatial-temporal saliency guided sampling
    Zhang, Peng
    Zhuo, Tao
    Huang, Wei
    Chen, Kangli
    Kankanhalli, Mohan
    NEUROCOMPUTING, 2017, 257 : 115 - 127
  • [4] Spatial-Temporal Transformer for Dynamic Scene Graph Generation
    Cong, Yuren
    Liao, Wentong
    Ackermann, Hanno
    Rosenhahn, Bodo
    Yang, Michael Ying
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 16352 - 16362
  • [5] Video Scene Graph Generation with Spatial-Temporal Knowledge
    Pu, Tao
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 9340 - 9344
  • [6] Online Learning of Spatial-Temporal Convolution Response for Robust Real-Time Tracking
    Zhou, Jinglin
    Wang, Rong
    Ding, Jianwei
    2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 1821 - 1826
  • [7] Online learning and joint optimization of combined spatial-temporal models for robust visual tracking
    Zhou, Tao
    Bhaskar, Harish
    Liu, Fanghui
    Yang, Jie
    Cai, Ping
    NEUROCOMPUTING, 2017, 226 : 221 - 237
  • [8] Spatial-Temporal Context-Aware Tracking
    Han, Yuqi
    Deng, Chenwei
    Zhao, Boya
    Zhao, Baojun
    IEEE SIGNAL PROCESSING LETTERS, 2019, 26 (03) : 500 - 504
  • [9] Mining Spatial-Temporal Similarity for Visual Tracking
    Zhang, Yu
    Gao, Xingyu
    Chen, Zhenyu
    Zhong, Huicai
    Xie, Hongtao
    Yan, Chenggang
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 8107 - 8119
  • [10] Tracking the Evolving Spatial-Temporal Gene Networks
    Gong, Weikang
    Wan, Lin
    IFAC PAPERSONLINE, 2015, 48 (28): : 1365 - 1368