An innovative algorithm for key frame extraction in video summarization

被引:110
作者
Gianluigi, Ciocca [1 ]
Raimondo, Schettini [1 ]
机构
[1] Univ Milano Bicocca, Dipartimento Informat Sistemist & Comunicaz DISCo, I-20126 Milan, Italy
关键词
Video summarization; Visual summary evaluation; Dynamic key frames extraction; Frame content description;
D O I
10.1007/s11554-006-0001-1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Video summarization, aimed at reducing the amount of data that must be examined in order to retrieve the information desired from information in a video, is an essential task in video analysis and indexing applications. We propose an innovative approach for the selection of representative (key) frames of a video sequence for video summarization. By analyzing the differences between two consecutive frames of a video sequence, the algorithm determines the complexity of the sequence in terms of changes in the visual content expressed by different frame descriptors. The algorithm, which escapes the complexity of existing methods based, for example, on clustering or optimization strategies, dynamically and rapidly selects a variable number of key frames within each sequence. The key frames are extracted by detecting curvature points within the curve of the cumulative frame differences. Another advantage is that it can extract the key frames on the fly: curvature points can be determined while computing the frame differences and the key frames can be extracted as soon as a second high curvature point has been detected. We compare the performance of this algorithm with that of other key frame extraction algorithms based on different approaches. The summaries obtained have been objectively evaluated by three quality measures: the Fidelity measure, the Shot Reconstruction Degree measure and the Compression Ratio measure.
引用
收藏
页码:69 / 88
页数:20
相关论文
共 38 条
  • [1] Content-based representation and retrieval of visual media: A state-of-the-art review
    Aigrain, P
    Zhang, HJ
    Petkovic, D
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 1996, 3 (03) : 179 - 202
  • [2] [Anonymous], 1992, DIGITAL IMAGE PROCES
  • [3] A survey on the use of pattern recognition methods for abstraction, indexing and retrieval of images and video
    Antani, S
    Kasturi, R
    Jain, R
    [J]. PATTERN RECOGNITION, 2002, 35 (04) : 945 - 965
  • [4] Arman F., 1993, Proceedings ACM Multimedia 93, P267, DOI 10.1145/166266.166297
  • [5] Efficient key-frame extraction and video analysis
    Calic, J
    Izquierdo, E
    [J]. INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY: CODING AND COMPUTING, PROCEEDINGS, 2002, : 28 - 33
  • [6] Chang HS, 1999, IEEE T CIRC SYST VID, V9, P1269, DOI 10.1109/76.809161
  • [7] Chetverikov D., 1999, 23 WORKSH AUSTR PATT, V27-28 May 1999, P175
  • [8] Quicklook2:: An integrated multimedia system
    Ciocca, G
    Gagliardi, I
    Schettini, R
    [J]. JOURNAL OF VISUAL LANGUAGES AND COMPUTING, 2001, 12 (01) : 81 - 103
  • [9] Factoring wavelet transforms into lifting steps
    Daubechies, I
    Sweldens, W
    [J]. JOURNAL OF FOURIER ANALYSIS AND APPLICATIONS, 1998, 4 (03) : 247 - 269
  • [10] Davenport Glorianna, 1994, IEEE MULTIMEDIA, V1, P73