CNN-Based Shot Boundary Detection and Video Annotation

被引：0

作者：

Tong, Wenjing ^{[1
]}

Song, Li ^{[1
,2
]}

Yang, Xiaokang ^{[1
,2
]}

Qu, Hui ^{[1
]}

Xie, Rong ^{[1
,2
]}

机构：

[1] Shanghai Jiao Tong Univ, Inst Image Commun & Network Engn, Shanghai 200030, Peoples R China

[2] Cooperat Medianet Innovat Ctr, Shanghai, Peoples R China

来源：

2015 IEEE INTERNATIONAL SYMPOSIUM ON BROADBAND MULTIMEDIA SYSTEMS AND BROADCASTING (BMSB) | 2015年

关键词：

Retrieval and indexing; shot boundary detection; deep learning; convolutional neural networks; video coding and processing; FRAMEWORK;

D O I：

暂无

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

With the explosive growth of video data, content-based video analysis and management technologies such as indexing, browsing and retrieval have drawn much attention. Video shot boundary detection (SBD) is usually the first and important step for those technologies. Great efforts have been made to improve the accuracy of SBD algorithms. However, most works are based on signal rather than interpretable features of frames. In this paper, we propose a novel video shot boundary detection framework based on interpretable TAGs learned by Convolutional Neural Networks (CNNs). Firstly, we adopt a candidate segment selection to predict the positions of shot boundaries and discard most non-boundary frames. This preprocessing method can help to improve both accuracy and speed of the SBD algorithm. Then, cut transition and gradual transition detections which are based on the interpretable TAGs are conducted to identify the shot boundaries in the candidate segments. Afterwards, we synthesize the features of frames in a shot and get semantic labels for the shot. Experiments on TRECVID 2001 test data show that the proposed scheme can achieve a better performance compared with the state-of-the-art schemes. Besides, the semantic labels obtained by the framework can be used to depict the content of a shot.

引用

页数：5

共 50 条

[1] A Video Shot Boundary Detection Approach based on CNN Feature
Liang, Rui
Zhu, Qingxin
Wei, Honglei
Liao, Shujiao
2017 IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA (ISM), 2017, : 489 - 494
[2] A CNN-based misleading video detection model
Xiaojun Li
Xvhao Xiao
Jia Li
Changhua Hu
Junping Yao
Shaochen Li
Scientific Reports, 12
[3] A CNN-based misleading video detection model
Li, Xiaojun
Xiao, Xvhao
Li, Jia
Hu, Changhua
Yao, Junping
Li, Shaochen
SCIENTIFIC REPORTS, 2022, 12 (01)
[4] Joint motion boundary detection and CNN-based feature visualization for video object segmentation
Zahra Kamranian
Ahmad Reza Naghsh Nilchi
Hamid Sadeghian
Federico Tombari
Nassir Navab
Neural Computing and Applications, 2020, 32 : 4073 - 4091
[5] Joint motion boundary detection and CNN-based feature visualization for video object segmentation
Kamranian, Zahra
Nilchi, Ahmad Reza Naghsh
Sadeghian, Hamid
Tombari, Federico
Navab, Nassir
NEURAL COMPUTING & APPLICATIONS, 2020, 32 (08): : 4073 - 4091
[6] CNN-Based Traffic Volume Video Detection Method
Chen, Tao
Li, Xuchuan
Guo, Congshuai
Fan, Linkun
CICTP 2020: TRANSPORTATION EVOLUTION IMPACTING FUTURE MOBILITY, 2020, : 2435 - 2445
[7] CNN-Based Prediction of Frame-Level Shot Importance for Video Summarization
Al Nahian, Mohaiminul
Iftekhar, A. S. M.
Islam, Mohammad Tariqul
Rahman, S. M. Mahbubur
Hatzinakos, Dimitrios
2017 INTERNATIONAL CONFERENCE ON NEW TRENDS IN COMPUTING SCIENCES (ICTCS), 2017, : 24 - 29
[8] Video Compression With CNN-Based Postprocessing
Zhang, Fan
Ma, Di
Feng, Chen
Bull, David R.
IEEE MULTIMEDIA, 2021, 28 (04) : 74 - 83
[9] Training Strategies and Data Augmentations in CNN-based DeepFake Video Detection
Bondi, Luca
Cannas, Edoardo Daniele
Bestagini, Paolo
Tubaro, Stefano
2020 IEEE INTERNATIONAL WORKSHOP ON INFORMATION FORENSICS AND SECURITY (WIFS), 2020,
[10] High speed road boundary detection with CNN-based dynamic programming
Kim, H
Hong, S
Oh, T
Lee, J
ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2002, PROCEEDING, 2002, 2532 : 806 - 813

← 1 2 3 4 5 →