Detecting shot boundary with sparse coding for video summarization

被引:26
|
作者
Li, Jiatong [1 ]
Yao, Ting [2 ]
Ling, Qiang [1 ]
Mei, Tao [2 ]
机构
[1] Univ Sci & Technol China, Dept Automat, Hefei 230027, Anhui, Peoples R China
[2] Microsoft Res Asia, Beijing 100080, Peoples R China
关键词
Video summarization; Shot boundary detection; Keyframe selection; Sparse coding; Dictionary learning; KEY FRAME EXTRACTION; ALGORITHM;
D O I
10.1016/j.neucom.2017.04.065
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Keyframe selection is a common way to summarize video contents. However, delimiting shot boundaries to extract a representative keyframe from each shot is not trivial as most shot boundary techniques are heuristic and sensitive to the types of video transitions. This paper proposes a new shot boundary detection algorithm, that learns a dictionary from the given video using sparse coding and updates atoms in the dictionary, following the philosophy that different shots cannot be reconstructed using the learned dictionary. Technically, our algorithm conducts the learning by simultaneously minimizing the reconstruction loss, restricting the sparsity of the reconstruction matrix, and preserving the structure across patches and frames. Once shot boundaries are determined, one representative keyframe is selected from each shot and then a video summary is constructed by concatenating the representative keyframes through a post process. On two standard video datasets across various genres, i.e., VSUMM and YouTube datasets, our method is shown to be powerful for video summarization with superior performance over several state-of-the-art techniques. (C) 2017 Elsevier B.V. All rights reserved.
引用
收藏
页码:66 / 78
页数:13
相关论文
共 50 条
  • [21] Nonlinear Block Sparse Dictionary Selection for Video Summarization
    Ma M.
    Mei S.
    Wan S.
    Hsi-An Chiao Tung Ta Hsueh/Journal of Xi'an Jiaotong University, 2019, 53 (05): : 142 - 148
  • [22] Patch Based Video Summarization With Block Sparse Representation
    Mei, Shaohui
    Ma, Mingyang
    Wan, Shuai
    Hou, Junhui
    Wang, Zhiyong
    Feng, David Dagan
    IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 23 : 732 - 747
  • [23] Color video denoising using epitome and sparse coding
    Lee, Hwea Yee
    Hoo, Wai Lam
    Chan, Chee Seng
    EXPERT SYSTEMS WITH APPLICATIONS, 2015, 42 (02) : 751 - 759
  • [24] Video summarization via block sparse dictionary selection
    Ma, Mingyang
    Mei, Shaohui
    Wan, Shuai
    Hou, Junhui
    Wang, Zhiyong
    Feng, David Dagan
    NEUROCOMPUTING, 2020, 378 : 197 - 209
  • [25] Shot Boundary Detection in Video Retrieval
    Wu, Zhonglan
    Xu, Pin
    2013 IEEE 4TH INTERNATIONAL CONFERENCE ON ELECTRONICS INFORMATION AND EMERGENCY COMMUNICATION (ICEIEC), 2014, : 86 - 89
  • [26] Video Shot Boundary Detection: A Review
    SenGupta, Ananya
    Thounaojam, Dalton Meitei
    Singh, Kh. Manglem
    Roy, Sudipta
    2015 IEEE INTERNATIONAL CONFERENCE ON ELECTRICAL, COMPUTER AND COMMUNICATION TECHNOLOGIES, 2015,
  • [27] Video Shot Boundary Detection: A Review
    Pal, Gautam
    Rudrapaul, Dwijen
    Acharjee, Suvojit
    Ray, Ruben
    Chakraborty, Sayan
    Dey, Nilanjan
    EMERGING ICT FOR BRIDGING THE FUTURE, VOL 2, 2015, 338 : 119 - 127
  • [28] Similarity Based Block Sparse Subset Selection for Video Summarization
    Ma, Mingyang
    Mei, Shaohui
    Wan, Shuai
    Wang, Zhiyong
    Feng, David Dagan
    Bennamoun, Mohammed
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (10) : 3967 - 3980
  • [29] Graph-based structural difference analysis for video summarization
    Chai, Chunlei
    Lu, Guoliang
    Wang, Ruyun
    Lyu, Chen
    Lyu, Lei
    Zhang, Peng
    Liu, Hong
    INFORMATION SCIENCES, 2021, 577 : 483 - 509
  • [30] Video shot-boundary detection: issues, challenges and solutions
    Kar, T.
    Kanungo, P.
    Mohanty, Sachi Nandan
    Groppe, Sven
    Groppe, Jinghua
    ARTIFICIAL INTELLIGENCE REVIEW, 2024, 57 (04)