Detecting shot boundary with sparse coding for video summarization

被引：26

作者：

Li, Jiatong ^{[1
]}

Yao, Ting ^{[2
]}

Ling, Qiang ^{[1
]}

Mei, Tao ^{[2
]}

机构：

[1] Univ Sci & Technol China, Dept Automat, Hefei 230027, Anhui, Peoples R China

[2] Microsoft Res Asia, Beijing 100080, Peoples R China

来源：

NEUROCOMPUTING | 2017年 / 266卷

关键词：

Video summarization; Shot boundary detection; Keyframe selection; Sparse coding; Dictionary learning; KEY FRAME EXTRACTION; ALGORITHM;

D O I：

10.1016/j.neucom.2017.04.065

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Keyframe selection is a common way to summarize video contents. However, delimiting shot boundaries to extract a representative keyframe from each shot is not trivial as most shot boundary techniques are heuristic and sensitive to the types of video transitions. This paper proposes a new shot boundary detection algorithm, that learns a dictionary from the given video using sparse coding and updates atoms in the dictionary, following the philosophy that different shots cannot be reconstructed using the learned dictionary. Technically, our algorithm conducts the learning by simultaneously minimizing the reconstruction loss, restricting the sparsity of the reconstruction matrix, and preserving the structure across patches and frames. Once shot boundaries are determined, one representative keyframe is selected from each shot and then a video summary is constructed by concatenating the representative keyframes through a post process. On two standard video datasets across various genres, i.e., VSUMM and YouTube datasets, our method is shown to be powerful for video summarization with superior performance over several state-of-the-art techniques. (C) 2017 Elsevier B.V. All rights reserved.

引用

页码：66 / 78

页数：13

共 50 条

[21] Nonlinear Block Sparse Dictionary Selection for Video Summarization
Ma M.
Mei S.
Wan S.
Hsi-An Chiao Tung Ta Hsueh/Journal of Xi'an Jiaotong University, 2019, 53 (05): : 142 - 148
[22] Patch Based Video Summarization With Block Sparse Representation
Mei, Shaohui
Ma, Mingyang
Wan, Shuai
Hou, Junhui
Wang, Zhiyong
Feng, David Dagan
IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 23 : 732 - 747
[23] Color video denoising using epitome and sparse coding
Lee, Hwea Yee
Hoo, Wai Lam
Chan, Chee Seng
EXPERT SYSTEMS WITH APPLICATIONS, 2015, 42 (02) : 751 - 759
[24] Video summarization via block sparse dictionary selection
Ma, Mingyang
Mei, Shaohui
Wan, Shuai
Hou, Junhui
Wang, Zhiyong
Feng, David Dagan
NEUROCOMPUTING, 2020, 378 : 197 - 209
[25] Shot Boundary Detection in Video Retrieval
Wu, Zhonglan
Xu, Pin
2013 IEEE 4TH INTERNATIONAL CONFERENCE ON ELECTRONICS INFORMATION AND EMERGENCY COMMUNICATION (ICEIEC), 2014, : 86 - 89
[26] Video Shot Boundary Detection: A Review
SenGupta, Ananya
Thounaojam, Dalton Meitei
Singh, Kh. Manglem
Roy, Sudipta
2015 IEEE INTERNATIONAL CONFERENCE ON ELECTRICAL, COMPUTER AND COMMUNICATION TECHNOLOGIES, 2015,
[27] Video Shot Boundary Detection: A Review
Pal, Gautam
Rudrapaul, Dwijen
Acharjee, Suvojit
Ray, Ruben
Chakraborty, Sayan
Dey, Nilanjan
EMERGING ICT FOR BRIDGING THE FUTURE, VOL 2, 2015, 338 : 119 - 127
[28] Similarity Based Block Sparse Subset Selection for Video Summarization
Ma, Mingyang
Mei, Shaohui
Wan, Shuai
Wang, Zhiyong
Feng, David Dagan
Bennamoun, Mohammed
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (10) : 3967 - 3980
[29] Graph-based structural difference analysis for video summarization
Chai, Chunlei
Lu, Guoliang
Wang, Ruyun
Lyu, Chen
Lyu, Lei
Zhang, Peng
Liu, Hong
INFORMATION SCIENCES, 2021, 577 : 483 - 509
[30] Video shot-boundary detection: issues, challenges and solutions
Kar, T.
Kanungo, P.
Mohanty, Sachi Nandan
Groppe, Sven
Groppe, Jinghua
ARTIFICIAL INTELLIGENCE REVIEW, 2024, 57 (04)

← 1 2 3 4 5 →