A Novel Video Summarization Based on Mining the Story-Structure and Semantic Relations Among Concept Entities

被引:82
作者
Chen, Bo-Wei [1 ]
Wang, Jia-Ching [1 ]
Wang, Jhing-Fa [1 ]
机构
[1] Natl Cheng Kung Univ, Dept Elect Engn, Tainan 701, Taiwan
关键词
Concept expansion tree; graph entropy; graph mining; structural video contents; video browsing; video indexing; video summarization; RETRIEVAL; ATTENTION; FRAMEWORK; MODEL;
D O I
10.1109/TMM.2008.2009703
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Video summarization techniques have been proposed for years to offer people comprehensive understanding of the whole story in the video. Roughly speaking, existing approaches can be classified into the two types: one is static storyboard, and the other is dynamic skimming. However, despite that these traditional methods give brief summaries for users, they still do not provide with a concept-organized and systematic view. In this paper, we present a structural video content browsing system and a novel summarization method by utilizing the four kinds of entities: who, what, where, and when to establish the framework of the video contents. With the assistance of the above-mentioned indexed information, the structure of the story can be built up according to the characters, the things, the places, and the time. Therefore, users can not only browse the video efficiently but also focus on what they are interested in via the browsing interface. In order to construct the fundamental system, we employ maximum entropy criterion to integrate visual and text features extracted from video frames and speech transcripts, generating high-level concept entities. A novel concept expansion method is introduced to explore the associations among these entities. After constructing the relational graph, we exploit graph entropy model to detect meaningful shots and relations, which serve as the indices for users. The results demonstrate that our system can achieve better performance and information coverage.
引用
收藏
页码:295 / 312
页数:18
相关论文
共 54 条
[1]  
Aner A, 2002, IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOL I AND II, PROCEEDINGS, pA237
[2]  
[Anonymous], 2000, Computational geometry: algorithms and applications
[3]  
Argillander J, 2005, INT CONF ACOUST SPEE, P153
[4]  
Ba Tu Truong, 2000, Proceedings ACM Multimedia 2000, P219, DOI 10.1145/354384.354481
[5]  
BAGESHREE S, 2007, P 2007 C DIG LIB VAN, P127
[6]  
Bagga A, 2002, INT C PATT RECOG, P818, DOI 10.1109/ICPR.2002.1048428
[7]  
Berger AL, 1996, COMPUT LINGUIST, V22, P39
[8]   Information theory-based shot cut/fade detection and video summarization [J].
Cerneková, Z ;
Pitas, I ;
Nikou, C .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2006, 16 (01) :82-91
[9]  
Christel M.G., 2008, Proceedings of the International Conference on Content-based Image and Video Retrieval (CIVR '08), P447
[10]  
Christel MG, 2001, INT CONF ACOUST SPEE, P1409, DOI 10.1109/ICASSP.2001.941193