Detecting shot boundary with sparse coding for video summarization

被引:26
作者
Li, Jiatong [1 ]
Yao, Ting [2 ]
Ling, Qiang [1 ]
Mei, Tao [2 ]
机构
[1] Univ Sci & Technol China, Dept Automat, Hefei 230027, Anhui, Peoples R China
[2] Microsoft Res Asia, Beijing 100080, Peoples R China
关键词
Video summarization; Shot boundary detection; Keyframe selection; Sparse coding; Dictionary learning; KEY FRAME EXTRACTION; ALGORITHM;
D O I
10.1016/j.neucom.2017.04.065
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Keyframe selection is a common way to summarize video contents. However, delimiting shot boundaries to extract a representative keyframe from each shot is not trivial as most shot boundary techniques are heuristic and sensitive to the types of video transitions. This paper proposes a new shot boundary detection algorithm, that learns a dictionary from the given video using sparse coding and updates atoms in the dictionary, following the philosophy that different shots cannot be reconstructed using the learned dictionary. Technically, our algorithm conducts the learning by simultaneously minimizing the reconstruction loss, restricting the sparsity of the reconstruction matrix, and preserving the structure across patches and frames. Once shot boundaries are determined, one representative keyframe is selected from each shot and then a video summary is constructed by concatenating the representative keyframes through a post process. On two standard video datasets across various genres, i.e., VSUMM and YouTube datasets, our method is shown to be powerful for video summarization with superior performance over several state-of-the-art techniques. (C) 2017 Elsevier B.V. All rights reserved.
引用
收藏
页码:66 / 78
页数:13
相关论文
共 50 条
  • [31] Unleashing the Power of Contrastive Learning for Zero-Shot Video Summarization
    Pang, Zongshang
    Nakashima, Yuta
    Otani, Mayu
    Nagahara, Hajime
    JOURNAL OF IMAGING, 2024, 10 (09)
  • [32] Video Summarization Based on Mutual Information and Entropy Sliding Window Method
    Li, WenLin
    Qi, Deyu
    Zhang, ChangJian
    Guo, Jing
    Yao, JiaJun
    ENTROPY, 2020, 22 (11) : 1 - 16
  • [33] Echocardiogram video summarization
    Ebadollahi, S
    Chang, SF
    Wu, H
    Takoma, S
    MEDICAL IMAGING 2001: ULTRASONIC IMAGING AND SIGNAL PROCESSING, 2001, 4325 : 492 - 501
  • [34] Hierarchical video summarization
    Ratakonda, K
    Sezan, MI
    Crinon, R
    VISUAL COMMUNICATIONS AND IMAGE PROCESSING '99, PARTS 1-2, 1998, 3653 : 1531 - 1541
  • [35] An improved algorithm of video shot boundary detection
    Zhang Jianfeng
    Wei Zhiqiang
    Jiang Shuming
    Li Jian
    Xu Shijie
    Wang Shuai
    MEMS, NANO AND SMART SYSTEMS, PTS 1-6, 2012, 403-408 : 1258 - 1261
  • [36] Multi-scale deep feature fusion based sparse dictionary selection for video summarization
    Wu, Xiao
    Ma, Mingyang
    Wan, Shuai
    Han, Xiuxiu
    Mei, Shaohui
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2023, 118
  • [37] Sparse coded handcrafted and deep features for colon capsule video summarization
    Mohammed, Ahmed
    Yildirim, Sule
    Pedersen, Marius
    Hovde, Oistein
    Cheikh, Faouzi
    2017 IEEE 30TH INTERNATIONAL SYMPOSIUM ON COMPUTER-BASED MEDICAL SYSTEMS (CBMS), 2017, : 728 - 733
  • [38] Simple Method for Detecting Visual Saliencies based on Dictionary Learning and Sparse Coding
    Leal, Nallig
    Moreno, Silvia
    Zurek, Eduardo
    2019 14TH IBERIAN CONFERENCE ON INFORMATION SYSTEMS AND TECHNOLOGIES (CISTI), 2019,
  • [39] Sparse Coding based Frequency Adaptive Loop Filtering for Video Coding
    Schneider, Jens
    Blaeser, Max
    Wien, Mathias
    PROCEEDINGS OF THE 23TH ACM WORKSHOP ON PACKET VIDEO (PV'18), 2018, : 48 - 53
  • [40] Interactive Exploration of Surveillance Video through Action Shot Summarization and Trajectory Visualization
    Meghdadi, Amir H.
    Irani, Pourang
    IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2013, 19 (12) : 2119 - 2128