Video text extraction using temporal feature vectors

被引:8
作者
Tang, XO [1 ]
Luo, B [1 ]
Gao, XB [1 ]
Pissaloux, E [1 ]
Zhang, HJ [1 ]
机构
[1] Chinese Univ Hong Kong, Dept Informat Engn, Shatin, Hong Kong, Peoples R China
来源
IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOL I AND II, PROCEEDINGS | 2002年
关键词
D O I
10.1109/ICME.2002.1035724
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
A new caption text extraction algorithm that takes full advantage of the temporal information in a video sequence is developed. By detecting the (dis)appearance of caption text in a video stream, we first identify video segment that contains the same caption text. Then using the gray-level vector traced across the segment as the feature vector for a pixel point, we can clearly separate a caption pixel from a background pixel for the entire segment.
引用
收藏
页码:85 / 88
页数:4
相关论文
共 15 条
  • [1] AGNIHOTRI L, 1999, WORKSH CONTENT BASED
  • [2] GAO X, 2000, P INT DAT ENG AUT LE, P425
  • [3] Garcia C, 2000, INT CONF ACOUST SPEE, P2326, DOI 10.1109/ICASSP.2000.859306
  • [4] Automatic text location in images and video frames
    Jain, AK
    Yu, B
    [J]. PATTERN RECOGNITION, 1998, 31 (12) : 2055 - 2076
  • [5] AUTOMATIC EXTRACTION OF CHARACTERS IN COMPLEX SCENE IMAGES
    LEE, CM
    KANKANHALLI, A
    [J]. INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 1995, 9 (01) : 67 - 82
  • [6] Automatic text detection and tracking in digital video
    Li, HP
    Doermann, D
    Kia, O
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2000, 9 (01) : 147 - 156
  • [7] Automatic text recognition in digital videos
    Lienhart, R
    Stuber, F
    [J]. IMAGE AND VIDEO PROCESSING IV, 1996, 2666 : 180 - 188
  • [8] RECOGNIZING CHARACTERS IN SCENE IMAGES
    OHYA, J
    SHIO, A
    AKAMATSU, S
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1994, 16 (02) : 214 - 220
  • [9] Video OCR: indexing digital news libraries by recognition of superimposed captions
    Sato, T
    Kanade, T
    Hughes, EK
    Smith, MA
    Satoh, S
    [J]. MULTIMEDIA SYSTEMS, 1999, 7 (05) : 385 - 395
  • [10] Shim JC, 1998, INT C PATT RECOG, P618, DOI 10.1109/ICPR.1998.711219