Detecting shot boundary with sparse coding for video summarization

被引:26
作者
Li, Jiatong [1 ]
Yao, Ting [2 ]
Ling, Qiang [1 ]
Mei, Tao [2 ]
机构
[1] Univ Sci & Technol China, Dept Automat, Hefei 230027, Anhui, Peoples R China
[2] Microsoft Res Asia, Beijing 100080, Peoples R China
关键词
Video summarization; Shot boundary detection; Keyframe selection; Sparse coding; Dictionary learning; KEY FRAME EXTRACTION; ALGORITHM;
D O I
10.1016/j.neucom.2017.04.065
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Keyframe selection is a common way to summarize video contents. However, delimiting shot boundaries to extract a representative keyframe from each shot is not trivial as most shot boundary techniques are heuristic and sensitive to the types of video transitions. This paper proposes a new shot boundary detection algorithm, that learns a dictionary from the given video using sparse coding and updates atoms in the dictionary, following the philosophy that different shots cannot be reconstructed using the learned dictionary. Technically, our algorithm conducts the learning by simultaneously minimizing the reconstruction loss, restricting the sparsity of the reconstruction matrix, and preserving the structure across patches and frames. Once shot boundaries are determined, one representative keyframe is selected from each shot and then a video summary is constructed by concatenating the representative keyframes through a post process. On two standard video datasets across various genres, i.e., VSUMM and YouTube datasets, our method is shown to be powerful for video summarization with superior performance over several state-of-the-art techniques. (C) 2017 Elsevier B.V. All rights reserved.
引用
收藏
页码:66 / 78
页数:13
相关论文
共 50 条
  • [41] Video Shot Boundary Detection Based on MB Coding Mode and SIFT Features on H.264/AVC
    Zhang, Qingming
    PROCEEDINGS OF 2014 IEEE INTERNATIONAL CONFERENCE ON PROGRESS IN INFORMATICS AND COMPUTING (PIC), 2014, : 299 - 302
  • [42] Vehicle Recognition for Surveillance Video Using Sparse Coding
    Zeng, Shirong
    Niu, Xin
    Dou, Yong
    PATTERN RECOGNITION (CCPR 2014), PT II, 2014, 484 : 228 - 234
  • [43] Low bit-rate SNR scalable video coding based on overcomplete dictionary learning and sparse representation
    Irannejad, Maziar
    Mahdavi-Nasab, Homayoun
    MULTIDIMENSIONAL SYSTEMS AND SIGNAL PROCESSING, 2020, 31 (02) : 465 - 489
  • [44] Action based Video Summarization
    Raksha, H.
    Namitha, G.
    Sejal, N.
    PROCEEDINGS OF THE 2019 IEEE REGION 10 CONFERENCE (TENCON 2019): TECHNOLOGY, KNOWLEDGE, AND SOCIETY, 2019, : 457 - 462
  • [45] Efficient Bronchoscopic Video Summarization
    Byrnes, Patrick D.
    Higgins, William Evan
    IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2019, 66 (03) : 848 - 863
  • [46] Forward-Backward Nonlinear Sparse Dictionary Selection based Video Summarization
    Ma, Mingyang
    Mei, Shaohui
    Wan, Shuai
    Wang, Zhiyong
    Feng, David Dagan
    2018 IEEE FOURTH INTERNATIONAL CONFERENCE ON MULTIMEDIA BIG DATA (BIGMM), 2018,
  • [47] Adaptive Multiview Graph Difference Analysis for Video Summarization
    Ma, Caixia
    Lyu, Lei
    Lu, Guoliang
    Lyu, Chen
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (12) : 8795 - 8808
  • [48] A Novel Metric for Efficient Video Shot Boundary Detection
    Sun, Juan
    Wan, Yi
    2014 IEEE VISUAL COMMUNICATIONS AND IMAGE PROCESSING CONFERENCE, 2014, : 45 - 48
  • [49] Dichotomic Decision Cascading for Video Shot Boundary Detection
    Guder, Mennan
    Cicekli, Nihan Kesim
    2013 IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA (ISM), 2013, : 227 - 230
  • [50] L2,0 CONSTRAINED SPARSE DICTIONARY SELECTION FOR VIDEO SUMMARIZATION
    Mei, Shaohui
    Guan, Genliang
    Wang, Zhiyong
    He, Mingyi
    Hua, Xian-Sheng
    Feng, David Dagan
    2014 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2014,