Temporal segmentation and keyframe selection methods for user-generated video search-based annotation

被引:11
作者
Gonzalez-Diaz, Ivan [1 ]
Martinez-Cortes, Tomas [1 ]
Gallardo-Antolin, Ascension [1 ]
Diaz-de-Maria, Fernando
机构
[1] Univ Carlos III Madrid, Dept Signal Theory & Commun, Madrid 28911, Spain
关键词
User Generated Video; Video annotation; Video temporal segmentation; Camera motion analysis; Keyframe selection; HIDDEN MARKOV MODEL; RETRIEVAL;
D O I
10.1016/j.eswa.2014.08.001
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we propose a temporal segmentation and a keyframe selection method for User-Generated Video (UGV). Since UGV is rarely structured in shots and usually user's interest are revealed through camera movements, a UGV temporal segmentation system has been proposed that generates a video partition based on a camera motion classification. Motion-related mid-level features have been suggested to feed a Hierarchical Hidden Markov Model (HHMM) that produces a user-meaningful UGV temporal segmentation. Moreover, a keyframe selection method has been proposed that picks a keyframe for fixed-content camera motion patterns such as zoom, still, or shake and a set of keyframes for varying-content translation patterns. The proposed video segmentation approach has been compared to a state-of-the-art algorithm, achieving 8% performance improvement in a segmentation-based evaluation. Furthermore, a complete search-based UGV annotation system has been developed to assess the influence of the proposed algorithms on an end-user task. To that purpose, two UGV datasets have been developed and made available online. Specifically, the relevance of the considered camera motion types has been analyzed for these two datasets, and some guidelines are given to achieve the desired performance-complexity tradeoff. The keyframe selection algorithm for varying-content translation patterns has also been assessed, revealing a notable contribution to the performance of the global UGV annotation system. Finally, it has been shown that the UGV segmentation algorithm also produces improved annotation results with respect to a fixed-rate keyframe selection baseline or a traditional method relying on frame-level visual features. (C) 2014 Elsevier Ltd. All rights reserved.
引用
收藏
页码:488 / 502
页数:15
相关论文
共 60 条
[1]   Camera Motion-Based Analysis of User Generated Video [J].
Abdollahian, Golnaz ;
Taskiran, Cuneyt M. ;
Pizlo, Zygmunt ;
Delp, Edward J. .
IEEE TRANSACTIONS ON MULTIMEDIA, 2010, 12 (01) :28-41
[2]  
[Anonymous], INT WORKSH CONT BAS
[3]  
[Anonymous], 1966, SOVIET PHYS DOKL
[4]  
[Anonymous], INT C CONS EL ICCE 2
[5]  
[Anonymous], INT WORKSH CONT BAS
[6]  
[Anonymous], P 1 INT WORKSH INT M
[7]  
[Anonymous], P 1 BANG ANN COMP C
[8]  
[Anonymous], P INT C CONT BAS IM
[9]  
[Anonymous], P 1 ACM INT C MULT R
[10]  
[Anonymous], INT WORKSH CONT BAS