Automatic video summarization by graph modeling

被引：73

作者：

Ngo, CW ^{[1
]}

Ma, YF ^{[1
]}

Zhang, HJ ^{[1
]}

机构：

[1] City Univ Hong Kong, Dept Comp Sci, Hong Kong, Hong Kong, Peoples R China

来源：

NINTH IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION, VOLS I AND II, PROCEEDINGS | 2003年

关键词：

D O I：

10.1109/ICCV.2003.1238320

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We propose a unified approach for summarization based on the analysis of video structures and video highlights. Our approach emphasizes both the content balance and perceptual quality of a summary. Normalized cut algorithm is employed to globally and optimally partition a video into clusters. A motion attention model based on human perception is employed to compute the perceptual quality of shots and clusters. The clusters, together with the computed attention values, form a temporal graph similar to Markov chain that inherently describes the evolution and perceptual importance of video clusters. In our application, the flow of a temporal graph is utilized to group similar clusters into scenes, while the attention values are used as guidelines to select appropriate sub-shots in scenes for summarization.

引用

页码：104 / 109

页数：6

共 17 条

[11]

Ma Y. F., 2002, INT C IM PROC

[12] Video partitioning by temporal slice coherency [J].

Ngo, CW ;

Pong, TC ;

Chin, RT .

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2001, 11 (08) :941-953

[13]

NGO CW, 2002, INT J COMPUTER VISIO

[14]

Orriols X., 2001, INT C COMP VIS

[15] Normalized cuts and image segmentation [J].

Shi, JB ;

Malik, J .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2000, 22 (08) :888-905

[16]

Smith M. A., 1997, INT C COMP VIS PATT

[17]

VASCONCELOS N, 1998, INT C CVPR

← 1 2 →