3D KEY-FRAME EXTRACTION METHOD BASED ON VISUAL SALIENCY

被引:0
作者
Ferreira, Lino [1 ]
Assuncao, Pedro [1 ]
da Silva Cruz, Luis A. [2 ]
机构
[1] Inst Politecn Leiria, ESTG, Inst Telecomunicacoes, Leiria, Portugal
[2] Univ Coimbra, DEEC, Inst Telecomunicacoes, Coimbra, Portugal
来源
2016 INTERNATIONAL CONFERENCE ON 3D IMAGING (IC3D) | 2016年
关键词
Video summaries; 3D key-frames; visual saliency maps; MODEL; DEPTH;
D O I
暂无
中图分类号
TB8 [摄影技术];
学科分类号
0804 ;
摘要
This paper presents a method for key-frame extraction from 3D video using visual saliency to weight the 3D content according to a user attention model. Key-frames are found in temporal segments of arbitrary length (i.e., 3D scenes) using a dynamic programming algorithm which minimises the dissimilarity between the reconstructed and the original temporal segment. The dissimilarity measure is based on a combination of frame difference and visual relevance estimated through visual saliency maps. These maps result from attention modeling, taking into account spatial, temporal and depth features of the 3D video content. The results, evaluated using the Shot Reconstruction Degree and the Fidelity measure, show that the proposed method outperforms those obtained from uniform sampling and attention curve methods. This method may be useful for fast browsing of 3D video repositories.
引用
收藏
页数:7
相关论文
共 23 条
[1]  
[Anonymous], P 4 INT S 3D DAT PRO
[2]  
[Anonymous], 2007, PROC IEEE C COMPUT V, DOI 10.1109/CVPR.2007.383267
[3]  
[Anonymous], JTC1SC29WG11 ISOIEC
[4]   THE ANALOGY BETWEEN STEREO DEPTH AND BRIGHTNESS [J].
BROOKES, A ;
STEVENS, KA .
PERCEPTION, 1989, 18 (05) :601-614
[5]  
Chang HS, 1999, IEEE T CIRC SYST VID, V9, P1269, DOI 10.1109/76.809161
[6]   Efficient summarization of stereoscopic video sequences [J].
Doulamis, ND ;
Doulamis, AD ;
Avrithis, YS ;
Ntalianis, KS ;
Kollias, SD .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2000, 10 (04) :501-517
[7]   Visual Conspicuity Index: Spatial Dissimilarity, Distance, and Central Bias [J].
Duan, Lijuan ;
Wu, Chunpeng ;
Miao, Jun ;
Bovik, Alan C. .
IEEE SIGNAL PROCESSING LETTERS, 2011, 18 (11) :690-693
[8]  
Dufaux F., 2013, Emerging_Technologies_for_3D_Video:_ Creation,_Coding,_Transmission_and_Rendering
[9]   Feature aggregation based visual attention model for video summarization [J].
Ejaz, Naveed ;
Mehmood, Irfan ;
Baik, Sung Wook .
COMPUTERS & ELECTRICAL ENGINEERING, 2014, 40 (03) :993-1005
[10]   A generic framework for optimal 2D/3D key-frame extraction driven by aggregated saliency maps [J].
Ferreira, Lino ;
da Silva Cruz, Luis A. ;
Assuncao, Pedro .
SIGNAL PROCESSING-IMAGE COMMUNICATION, 2015, 39 :98-110