A Survey on Visual Content-Based Video Indexing and Retrieval

被引:352
作者
Hu, Weiming [1 ]
Xie, Nianhua [1 ]
Li, Li [1 ]
Zeng, Xianglin [1 ]
Maybank, Stephen [2 ]
机构
[1] Chinese Acad Sci, Natl Lab Pattern Recognit, Inst Automat, Beijing 100190, Peoples R China
[2] Univ London Birkbeck Coll, Dept Comp Sci & Informat Syst, London WC1E 7HX, England
来源
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS | 2011年 / 41卷 / 06期
基金
中国国家自然科学基金;
关键词
Feature extraction; video annotation; video browsing; video retrieval; video structure analysis; SHOT-BOUNDARY DETECTION; KEY-FRAME-EXTRACTION; MULTIMEDIA INFORMATION-RETRIEVAL; OF-THE-ART; UNIFIED FRAMEWORK; COMPRESSED VIDEO; CONCEPT ONTOLOGY; SCENE DETECTION; EVENT DETECTION; SOCCER VIDEO;
D O I
10.1109/TSMCC.2011.2109710
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Video indexing and retrieval have a wide spectrum of promising applications, motivating the interest of researchers worldwide. This paper offers a tutorial and an overview of the landscape of general strategies in visual content-based video indexing and retrieval, focusing on methods for video structure analysis, including shot boundary detection, key frame extraction and scene segmentation, extraction of features including static key frame features, object features and motion features, video data mining, video annotation, video retrieval including query interfaces, similarity measure and relevance feedback, and video browsing. Finally, we analyze future research directions.
引用
收藏
页码:797 / 819
页数:23
相关论文
共 277 条
[71]   Movie scene segmentation using background information [J].
Chen, Liang-Hua ;
Lai, Yu-Chun ;
Liao, Hong-Yuan Mark .
PATTERN RECOGNITION, 2008, 41 (03) :1056-1065
[72]   A new methodology to predict energy bandgaps in GaxIn1-xAsyP1-y compounds by ANFIS theories [J].
Chen, SL ;
Fann, DA .
OPTOELECTRONIC MATERIALS AND DEVICES II, 2000, 4078 :544-550
[73]   A Human-Centered Multiple Instance Learning Framework for Semantic Video Retrieval [J].
Chen, Xin ;
Zhang, Chengcui ;
Chen, Shu-Ching ;
Rubin, Stuart .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS, 2009, 39 (02) :228-233
[74]   A knowledge-based approach to video content classification [J].
Chen, Y ;
Wong, EK .
STORAGE AND RETRIEVAL FOR MEDIA DATABASES 2001, 2001, 4315 :292-300
[75]   Summarization of visual content in instructional videos [J].
Choudary, Chekuri ;
Liu, Tiecheng .
IEEE TRANSACTIONS ON MULTIMEDIA, 2007, 9 (07) :1443-1455
[76]  
Christel MG, 2004, 2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL III, PROCEEDINGS, P1032
[77]  
CHRISTEL MG, 2008, P 2 ACM TRECVID VID, P35
[78]  
Christel MG, 2006, LECT NOTES COMPUT SC, V4071, P21
[79]   Supervised and unsupervised classification post-processing for visual video summaries [J].
Ciocca, Gianluigi ;
Schettini, Raimondo .
IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2006, 52 (02) :630-638
[80]   Discriminative techniques for keyframe selection [J].
Cooper, M ;
Foote, J .
2005 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), VOLS 1 AND 2, 2005, :502-505