Automatic video summarization by graph modeling

被引:73
作者
Ngo, CW [1 ]
Ma, YF [1 ]
Zhang, HJ [1 ]
机构
[1] City Univ Hong Kong, Dept Comp Sci, Hong Kong, Hong Kong, Peoples R China
来源
NINTH IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION, VOLS I AND II, PROCEEDINGS | 2003年
关键词
D O I
10.1109/ICCV.2003.1238320
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a unified approach for summarization based on the analysis of video structures and video highlights. Our approach emphasizes both the content balance and perceptual quality of a summary. Normalized cut algorithm is employed to globally and optimally partition a video into clusters. A motion attention model based on human perception is employed to compute the perceptual quality of shots and clusters. The clusters, together with the computed attention values, form a temporal graph similar to Markov chain that inherently describes the evolution and perceptual importance of video clusters. In our application, the flow of a temporal graph is utilized to group similar clusters into scenes, while the attention values are used as guidelines to select appropriate sub-shots in scenes for summarization.
引用
收藏
页码:104 / 109
页数:6
相关论文
共 17 条
[11]  
Ma Y. F., 2002, INT C IM PROC
[12]   Video partitioning by temporal slice coherency [J].
Ngo, CW ;
Pong, TC ;
Chin, RT .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2001, 11 (08) :941-953
[13]  
NGO CW, 2002, INT J COMPUTER VISIO
[14]  
Orriols X., 2001, INT C COMP VIS
[15]   Normalized cuts and image segmentation [J].
Shi, JB ;
Malik, J .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2000, 22 (08) :888-905
[16]  
Smith M. A., 1997, INT C COMP VIS PATT
[17]  
VASCONCELOS N, 1998, INT C CVPR