Multi-View Video Summarization

被引:130
作者
Fu, Yanwei [1 ]
Guo, Yanwen [2 ,3 ]
Zhu, Yanshu [1 ]
Liu, Feng [4 ]
Song, Chuanming [1 ]
Zhou, Zhi-Hua [1 ]
机构
[1] Nanjing Univ, Natl Key Lab Novel Software Technol, Nanjing 210093, Peoples R China
[2] Nanjing Univ, Natl Key Lab Novel Software Technol, Nanjing 210093, Peoples R China
[3] Nanjing Univ, Jiangyin Informat Technol Res Inst, Nanjing 210093, Peoples R China
[4] Univ Wisconsin, Dept Comp Sci, Madison, WI 53562 USA
基金
美国国家科学基金会;
关键词
Multi-objective optimization; multi-view video; random walks; spatio-temporal graph; video summarization; RETRIEVAL; FRAMEWORK; IMAGE;
D O I
10.1109/TMM.2010.2052025
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Previous video summarization studies focused on monocular videos, and the results would not be good if they were applied to multi-view videos directly, due to problems such as the redundancy in multiple views. In this paper, we present a method for summarizing multi-view videos. We construct a spatio-temporal shot graph and formulate the summarization problem as a graph labeling task. The spatio-temporal shot graph is derived from a hypergraph, which encodes the correlations with different attributes among multi-view video shots in hyperedges. We then partition the shot graph and identify clusters of event-centered shots with similar contents via random walks. The summarization result is generated through solving a multi-objective optimization problem based on shot importance evaluated using a Gaussian entropy fusion scheme. Different summarization objectives, such as minimum summary length and maximum information coverage, can be accomplished in the framework. Moreover, multi-level summarization can be achieved easily by configuring the optimization parameters. We also propose the multi-view storyboard and event board for presenting multi-view summaries. The storyboard naturally reflects correlations among multi-view summarized shots that describe the same important event. The event-board serially assembles event-centered multi-view shots in temporal order. Single video summary which facilitates quick browsing of the summarized multi-view video can be easily generated based on the event board representation.
引用
收藏
页码:717 / 729
页数:13
相关论文
共 53 条
[1]  
[Anonymous], P SIGGRAPH
[2]  
[Anonymous], P BRIT MACH VIS C
[3]  
[Anonymous], 2003, P 11 ACM INT C MULTI, DOI DOI 10.1145/957013.957094
[4]  
[Anonymous], 2007, Numerical Recipes
[5]   Semantic annotation of soccer videos: automatic highlights identification [J].
Assfalg, E ;
Bertini, M ;
Colombo, C ;
Del Bimbo, A ;
Nunziati, W .
COMPUTER VISION AND IMAGE UNDERSTANDING, 2003, 92 (2-3) :285-305
[6]  
Babaguchi N, 2000, 2000 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, PROCEEDINGS VOLS I-III, P1519, DOI 10.1109/ICME.2000.871056
[7]  
BAILER W, 2008, P ICIP, P29
[8]  
BENMOKHTAR R, 2007, P 10 INT C INF FUS, P1
[9]  
Berge Claude, 1989, Combinatorics of finite sets
[10]  
Christodoulou MI, 2008, ANN RHEUM DIS, V67, pA12