Summarizing video sequence using a graph-based hierarchical approach

被引:24
作者
Belo, Luciana dos Santos [1 ]
Caetano, Carlos Antonio, Jr. [1 ]
do Patrocinio, Zenilton Kleber Goncalves, Jr. [1 ]
Ferzoli Guimaraes, Silvio Jamil [1 ]
机构
[1] Pontificia Univ Catolica Minas Gerais, Dept Comp Sci, Audio Visual Informat Proc Lab, Belo Horizonte, MG, Brazil
关键词
Graph-based hierarchical video summarization; Covering; Global descriptors; Observation scales; REPRESENTATION; SCENE;
D O I
10.1016/j.neucom.2015.08.057
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Video summarization is a simplification of video content for compacting the video information. The video summarization problem can be transformed into a clustering problem, in which some frames are selected to saliently represent the video content. In this work, we use a graph-based hierarchical clustering method for computing a video summary. In fact, the proposed approach, called HSUMM, adopts a hierarchical clustering method to generate a weight map from the frame similarity graph in which the clusters (or connected components of the graph) can easily be inferred. Moreover, the use of this strategy allows the application of a similarity measure between clusters during graph partition, instead of considering only the similarity between isolated frames. We also provide a unified framework for video summarization based on minimum spanning tree and weight maps in which HSUMM could be seen as an instance that uses a minimum spanning tree of frames and a weight map based on hierarchical observation scales computed over that tree. Furthermore, a new evaluation measure that assesses the diversity of opinions among users when they produce a summary for the same video, called Covering, is also proposed. During tests, different strategies for the identification of summary size and for the selection of keyframes were analyzed. Experimental results provide quantitative and qualitative comparison between the new approach and other popular algorithms from the literature, showing that the new algorithm is robust. Concerning quality measures, HSUMM outperforms the compared methods regardless of the visual feature used in terms of F-measure. (C) 2015 Elsevier B.V. All rights reserved.
引用
收藏
页码:1001 / 1016
页数:16
相关论文
共 26 条
[1]   Pooling in image representation: The visual codeword point of view [J].
Avila, Sandra ;
Thome, Nicolas ;
Cord, Matthieu ;
Valle, Eduardo ;
Araujo, Arnaldo de A. .
COMPUTER VISION AND IMAGE UNDERSTANDING, 2013, 117 (05) :453-465
[2]   Graph-based hierarchical video summarization using global descriptors [J].
Belo, Luciana ;
Caetano, Carlos ;
Patrocinio, Zenilton, Jr. ;
Guimaraes, Silvio .
2014 IEEE 26TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI), 2014, :822-829
[3]   Constraint satisfaction programming for video summarization [J].
Berrani, Sid-Ahmed ;
Boukadida, Haykel ;
Gros, Patrick .
2013 IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA (ISM), 2013, :195-202
[4]   Video summarization by a graph-theoretic FCM based algorithm [J].
Besiris, D. ;
Fotopoulou, F. ;
Economou, G. ;
Fotopoulos, S. .
PROCEEDINGS OF IWSSIP 2008: 15TH INTERNATIONAL CONFERENCE ON SYSTEMS, SIGNALS AND IMAGE PROCESSING, 2008, :511-514
[5]  
De Souza Kleber Jacques, 2013, 2013 XXVI Conference on Graphics, Patterns and Images (SIBGRAPI 2013), P320, DOI 10.1109/SIBGRAPI.2013.51
[6]  
Douze M., 2009, P ACM INT C IM VID R, P1
[7]   Efficient graph-based image segmentation [J].
Felzenszwalb, PF ;
Huttenlocher, DP .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2004, 59 (02) :167-181
[8]   VSUMM: A mechanism designed to produce static video summaries and a novel evaluation method [J].
Fontes de Avila, Sandra Eliza ;
Brandao Lopes, Ana Paula ;
da Luz, Antonio, Jr. ;
Araujo, Arnaldo de Albuquerque .
PATTERN RECOGNITION LETTERS, 2011, 32 (01) :56-68
[9]  
Furini M, 2007, P 6 ACM INT C IM VID, P635
[10]  
Girgensohn A, 2011, P 1 ACM INT C MULT R, DOI 10.1145/1991996.1992030