Caption analysis and recognition for building video indexing systems

被引：13

作者：

Chang, F ^{[1
]}

Chen, GC

Lin, CC

Lin, WH

机构：

[1] Acad Sinica, Inst Informat Sci, Taipei, Taiwan

[2] Natl Taipei Univ technol, Dept Elect Engn, Taipei, Taiwan

来源：

MULTIMEDIA SYSTEMS | 2005年 / 10卷 / 04期

关键词：

background removal; caption tracking; character recognition; support vector machines; prototype classification;

D O I：

10.1007/s00530-004-0159-y

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In this paper, we propose several methods for analyzing and recognizing Chinese video captions, which constitute a very useful information source for video content. Image binarization, performed by combining a global threshold method and a window-based method, is used to obtain clearer images of characters, and a caption-tracking scheme is used to locate caption regions and detect caption changes. The separation of characters from possibly complex backgrounds is achieved by using size and color constraints and by cross examination of multiframe images. To segment individual characters, we use a dynamic split-and-merge strategy. Finally, we propose a character recognition process using a prototype classification method, supplemented by a disambiguation process using support vector machines, to improve recognition outcomes. This is followed by a postprocess that integrates multiple recognition results. The overall accuracy rate for the entire process applied to test video films is 94.11%.

引用

页码：344 / 355

页数：12

共 31 条

[1]

[Anonymous], 1991, NEAREST NEIGHB NORMS

[2]

Antani S, 2000, INT C PATT RECOG, P831, DOI 10.1109/ICPR.2000.905537

[3] Techniques and systems for image and video retrieval [J].

Aslandogan, YA ;

Yu, CT .

IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 1999, 11 (01) :56-63

[4] LIBSVM: A Library for Support Vector Machines [J].

Chang, Chih-Chung ;

Lin, Chih-Jen .

ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2011, 2 (03)

[5] A linear-time component-labeling algorithm using contour tracing technique [J].

Chang, F ;

Chen, CJ ;

Lu, CJ .

COMPUTER VISION AND IMAGE UNDERSTANDING, 2004, 93 (02) :206-220

[6]

CHANG F, 2001, INT J DOC ANAL RECOG, V4, P46

[7]

CHANG F, 1999, 5 INT C DOC AN REC B

[8]

Cherkassky V, 1997, IEEE Trans Neural Netw, V8, P1564, DOI 10.1109/TNN.1997.641482

[9]

Doermann D, 2003, PROC INT CONF DOC, P606

[10]

Hua XS, 2002, 2002 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOL II, PROCEEDINGS, P397

← 1 2 3 4 →