Scene Detection in Videos Using Shot Clustering and Sequence Alignment

被引:110
作者
Chasanis, Vasileios T. [1 ]
Likas, Aristidis C. [1 ]
Galatsanos, Nikolaos P. [2 ]
机构
[1] Univ Ioannina, Dept Comp Sci, GR-45110 Ioannina, Greece
[2] Univ Patras, Dept Elect & Comp Engn, Rion 26500, Greece
关键词
Global k-means; key-frames; scene detection; sequence alignment; SEGMENTATION;
D O I
10.1109/TMM.2008.2008924
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Video indexing requires the efficient segmentation of video into scenes. The video is first segmented into shots and a set of key-frames is extracted for each shot. Typical scene detection algorithms incorporate time distance in a shot similarity metric. In the method we propose, to overcome the difficulty of having prior knowledge of the scene duration, the shots are clustered into groups based only on their visual similarity and a label is assigned to each shot according to the group that it belongs to. Then, a sequence alignment algorithm is applied to detect when the pattern of shot labels changes, providing the final scene segmentation result. In this way shot similarity is computed based only on visual features, while ordering of shots is taken into account during sequence alignment. To cluster the shots into groups we propose an improved spectral clustering method that both estimates the number of clusters and employs the fast global k-means algorithm in the clustering stage after the eigenvector computation of the similarity matrix. The same spectral clustering method is applied to extract the key-frames of each shot and numerical experiments indicate that the content of each shot is efficiently summarized using the method we propose herein. Experiments on TV-series and movies also indicate that the proposed scene detection method accurately detects most of the scene boundaries while preserving a good tradeoff between recall and precision.
引用
收藏
页码:89 / 100
页数:12
相关论文
共 22 条
[1]  
[Anonymous], 2001, Proceedings of Neural Information Processing Systems (NIPS)
[2]  
[Anonymous], 1999, Visual Information Retrieval
[3]  
[Anonymous], 1993, Multimedia Systems, DOI DOI 10.1007/BF01210504
[4]  
Chang HS, 1999, IEEE T CIRC SYST VID, V9, P1269, DOI 10.1109/76.809161
[5]   An innovative algorithm for key frame extraction in video summarization [J].
Gianluigi, Ciocca ;
Raimondo, Schettini .
JOURNAL OF REAL-TIME IMAGE PROCESSING, 2006, 1 (01) :69-88
[6]   Automated high-level movie segmentation for advanced video-retrieval systems [J].
Hanjalic, A ;
Lagendijk, RL ;
Biemond, J .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 1999, 9 (04) :580-588
[7]  
Jones NC., 2004, An introduction to bioinformatics algorithms
[8]  
Likas A, 2003, PATTERN RECOGN, V36, P451, DOI 10.1016/S0031-3203(02)00060-2
[9]   Shot reconstruction degree: a novel criterion for key frame selection [J].
Liu, TY ;
Zhang, XD ;
Feng, H ;
Lo, KT .
PATTERN RECOGNITION LETTERS, 2004, 25 (12) :1451-1457
[10]   A GENERAL METHOD APPLICABLE TO SEARCH FOR SIMILARITIES IN AMINO ACID SEQUENCE OF 2 PROTEINS [J].
NEEDLEMAN, SB ;
WUNSCH, CD .
JOURNAL OF MOLECULAR BIOLOGY, 1970, 48 (03) :443-+