Video Summarization Based on Mutual Information and Entropy Sliding Window Method

被引:5
作者
Li, WenLin [1 ]
Qi, Deyu [2 ]
Zhang, ChangJian [1 ]
Guo, Jing [2 ]
Yao, JiaJun [2 ]
机构
[1] South China Univ Technol, Sch Comp Sci & Engn, Guangzhou 510006, Peoples R China
[2] South China Univ Technol, Sch Software Engn, Guangzhou 510006, Peoples R China
基金
中国国家自然科学基金;
关键词
entropy; video summarization; key frame extraction; video analysis; gesture videos; feature extraction; SHOT BOUNDARY DETECTION; KEY-FRAME SELECTION; KEYFRAME EXTRACTION;
D O I
10.3390/e22111285
中图分类号
O4 [物理学];
学科分类号
0702 ;
摘要
This paper proposes a video summarization algorithm called the Mutual Information and Entropy based adaptive Sliding Window (MIESW) method, which is specifically for the static summary of gesture videos. Considering that gesture videos usually have uncertain transition postures and unclear movement boundaries or inexplicable frames, we propose a three-step method where the first step involves browsing a video, the second step applies the MIESW method to select candidate key frames, and the third step removes most redundant key frames. In detail, the first step is to convert the video into a sequence of frames and adjust the size of the frames. In the second step, a key frame extraction algorithm named MIESW is executed. The inter-frame mutual information value is used as a metric to adaptively adjust the size of the sliding window to group similar content of the video. Then, based on the entropy value of the frame and the average mutual information value of the frame group, the threshold method is applied to optimize the grouping, and the key frames are extracted. In the third step, speeded up robust features (SURF) analysis is performed to eliminate redundant frames in these candidate key frames. The calculation of Precision, Recall, and Fmeasure are optimized from the perspective of practicality and feasibility. Experiments demonstrate that key frames extracted using our method provide high-quality video summaries and basically cover the main content of the gesture video.
引用
收藏
页码:1 / 16
页数:16
相关论文
共 31 条
[1]   Hierarchical Keyframe-based Video Summarization Using QR-Decomposition and Modified k-Means Clustering [J].
Amiri, Ali ;
Fathy, Mahmood .
EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2010,
[2]  
[Anonymous], 2018, P 32 AAAI C ART INT
[3]  
Chasanis VT, 2014, INT CONF SIGN PROCES, P1133, DOI 10.1109/ICOSP.2014.7015179
[4]   Adaptive key frame extraction for video summarization using an aggregation mechanism [J].
Ejaz, Naveed ;
Bin Tariq, Tayyab ;
Baik, Sung Wook .
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2012, 23 (07) :1031-1040
[5]   VSUMM: A mechanism designed to produce static video summaries and a novel evaluation method [J].
Fontes de Avila, Sandra Eliza ;
Brandao Lopes, Ana Paula ;
da Luz, Antonio, Jr. ;
Araujo, Arnaldo de Albuquerque .
PATTERN RECOGNITION LETTERS, 2011, 32 (01) :56-68
[6]   Shot-based video retrieval with optical flow tensor and HMMs [J].
Gao, Xinbo ;
Li, Xuelong ;
Fen, Jun ;
Tao, Dacheng .
PATTERN RECOGNITION LETTERS, 2009, 30 (02) :140-147
[7]   MSKVS: Adaptive mean shift-based keyframe extraction for video summarization and a new objective verification approach [J].
Hannane, Rachida ;
Elboushaki, Abdessamad ;
Afdel, Karim .
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2018, 55 :179-200
[8]   An efficient method for video shot boundary detection and keyframe extraction using SIFT-point distribution histogram [J].
Hannane, Rachida ;
Elboushaki, Abdessamad ;
Afdel, Karim ;
Naghabhushan, P. ;
Javed, Mohammed .
INTERNATIONAL JOURNAL OF MULTIMEDIA INFORMATION RETRIEVAL, 2016, 5 (02) :89-104
[9]  
He KM, 2020, IEEE T PATTERN ANAL, V42, P386, DOI [10.1109/TPAMI.2018.2844175, 10.1109/ICCV.2017.322]
[10]   A Survey on Visual Content-Based Video Indexing and Retrieval [J].
Hu, Weiming ;
Xie, Nianhua ;
Li, Li ;
Zeng, Xianglin ;
Maybank, Stephen .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS, 2011, 41 (06) :797-819