Keyframe extraction using Pearson correlation coefficient and color moments

被引:0
作者
Reddy Mounika Bommisetty
Om Prakash
Ashish Khare
机构
[1] University of Allahabad,Department of Electronics and Communication
[2] Inferigence Quotient LLC,undefined
来源
Multimedia Systems | 2020年 / 26卷
关键词
Pearson correlation coefficient (PCC); Color moments; Shot detection; Keyframe extraction;
D O I
暂无
中图分类号
学科分类号
摘要
Keyframe extraction plays a significant role in wide variety of real-time video processing applications such as video summarization, video management and retrieval, etc. A keyframe captures the whole content of its shot and does not contain any redundant information. The keyframe extraction algorithms are facing challenges due to different visual characteristics in videos of different categories. Therefore, a single feature is not enough to capture visual characteristics of a variety of videos. In order to tackle this problem, we propose an approach of keyframe extraction that uses hybridization of features. In the present article, we propose a novel shot detection-based keyframe extraction algorithm based on combination of two features: one is Pearson correlation coefficient (PCC) and other is color moments (CM). The linear transformation invariance property of PCC facilitates the proposed algorithm to work well under varying lighting conditions. On the other hand, the scale and rotation invariance properties of color moments are beneficial for representation of complex objects that may be present in different poses and orientations. These sustained reasons support the combination of these two features, which brings significant benefits for keyframe extraction in the proposed method. The proposed method detects shot boundaries by employing combo feature set (PCC and CM). From each shot, the frame with highest mean and standard deviation is selected as keyframe. Furthermore, another important contribution is that we developed a new dataset by collecting the videos of different categories such as movies, news, serials, animations and personal interviews and made it available online. The proposed method is experimented on three datasets: two publicly available datasets and one dataset developed by us. The performance of the proposed method on these datasets has been evaluated on the basis of different evaluation parameters: figure of merit, detection percentage, accuracy, and missing factor. Principal advantage of proposed work lies in the fact that it is capable to detect both the abrupt and gradual shot transitions. In real-time videos, it is common to have abrupt and small transitions. The experimental results show the superior performance of the proposed method over the other state-of-the-art methods.
引用
收藏
页码:267 / 299
页数:32
相关论文
共 124 条
[1]  
Khare M(2017)Object tracking using combination of Daubechies complex wavelet transform and Zernike moment Multimed Tools Appl 76 1247-1290
[2]  
Srivastava RK(2018)Human detection in complex real scenes based on combination of biorthogonal wavelet transform and Zernike moments Optik Int J Light Electron Opt 1 1267-1281
[3]  
Khare A(2018)Summarization of videos by analyzing affective state of the user through crowdsource Cognit Syst Res 1 917-930
[4]  
Prakash O(2018)An intelligent recommendation system using gaze and emotion detection Multimed Tools Appl 2018 1-20
[5]  
Gwak J(2016)Integration of moment invariants and uniform local binary patterns for human activity recognition in video sequences Multimed Tools Appl 75 17303-17332
[6]  
Khare M(2014)Vehicle identification in traffic surveillance-complex wavelet transform based approach J Sci Technol 52 29-38
[7]  
Khare A(2014)Single change detection-based moving object segmentation by using Daubechies complex wavelet transform IET Image Proc. 8 334-344
[8]  
Jeon M(2014)A perceptual scheme for fully automatic video shot boundary detection Signal process Image Commun 29 410-423
[9]  
Singhal A(2012)A model-based shot boundary detection technique using frame transition parameters IEEE Trans Multimed 14 223-233
[10]  
Kumar P(2014)Event detection and summarization in soccer videos using Bayesian network and copula IEEE Trans Circ Syst Video Technol 24 291-304