CNN-Based Shot Boundary Detection and Video Annotation

被引:0
|
作者
Tong, Wenjing [1 ]
Song, Li [1 ,2 ]
Yang, Xiaokang [1 ,2 ]
Qu, Hui [1 ]
Xie, Rong [1 ,2 ]
机构
[1] Shanghai Jiao Tong Univ, Inst Image Commun & Network Engn, Shanghai 200030, Peoples R China
[2] Cooperat Medianet Innovat Ctr, Shanghai, Peoples R China
关键词
Retrieval and indexing; shot boundary detection; deep learning; convolutional neural networks; video coding and processing; FRAMEWORK;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
With the explosive growth of video data, content-based video analysis and management technologies such as indexing, browsing and retrieval have drawn much attention. Video shot boundary detection (SBD) is usually the first and important step for those technologies. Great efforts have been made to improve the accuracy of SBD algorithms. However, most works are based on signal rather than interpretable features of frames. In this paper, we propose a novel video shot boundary detection framework based on interpretable TAGs learned by Convolutional Neural Networks (CNNs). Firstly, we adopt a candidate segment selection to predict the positions of shot boundaries and discard most non-boundary frames. This preprocessing method can help to improve both accuracy and speed of the SBD algorithm. Then, cut transition and gradual transition detections which are based on the interpretable TAGs are conducted to identify the shot boundaries in the candidate segments. Afterwards, we synthesize the features of frames in a shot and get semantic labels for the shot. Experiments on TRECVID 2001 test data show that the proposed scheme can achieve a better performance compared with the state-of-the-art schemes. Besides, the semantic labels obtained by the framework can be used to depict the content of a shot.
引用
收藏
页数:5
相关论文
共 50 条
  • [1] A Video Shot Boundary Detection Approach based on CNN Feature
    Liang, Rui
    Zhu, Qingxin
    Wei, Honglei
    Liao, Shujiao
    2017 IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA (ISM), 2017, : 489 - 494
  • [2] A CNN-based misleading video detection model
    Xiaojun Li
    Xvhao Xiao
    Jia Li
    Changhua Hu
    Junping Yao
    Shaochen Li
    Scientific Reports, 12
  • [3] A CNN-based misleading video detection model
    Li, Xiaojun
    Xiao, Xvhao
    Li, Jia
    Hu, Changhua
    Yao, Junping
    Li, Shaochen
    SCIENTIFIC REPORTS, 2022, 12 (01)
  • [4] Joint motion boundary detection and CNN-based feature visualization for video object segmentation
    Zahra Kamranian
    Ahmad Reza Naghsh Nilchi
    Hamid Sadeghian
    Federico Tombari
    Nassir Navab
    Neural Computing and Applications, 2020, 32 : 4073 - 4091
  • [5] Joint motion boundary detection and CNN-based feature visualization for video object segmentation
    Kamranian, Zahra
    Nilchi, Ahmad Reza Naghsh
    Sadeghian, Hamid
    Tombari, Federico
    Navab, Nassir
    NEURAL COMPUTING & APPLICATIONS, 2020, 32 (08): : 4073 - 4091
  • [6] CNN-Based Traffic Volume Video Detection Method
    Chen, Tao
    Li, Xuchuan
    Guo, Congshuai
    Fan, Linkun
    CICTP 2020: TRANSPORTATION EVOLUTION IMPACTING FUTURE MOBILITY, 2020, : 2435 - 2445
  • [7] CNN-Based Prediction of Frame-Level Shot Importance for Video Summarization
    Al Nahian, Mohaiminul
    Iftekhar, A. S. M.
    Islam, Mohammad Tariqul
    Rahman, S. M. Mahbubur
    Hatzinakos, Dimitrios
    2017 INTERNATIONAL CONFERENCE ON NEW TRENDS IN COMPUTING SCIENCES (ICTCS), 2017, : 24 - 29
  • [8] Video Compression With CNN-Based Postprocessing
    Zhang, Fan
    Ma, Di
    Feng, Chen
    Bull, David R.
    IEEE MULTIMEDIA, 2021, 28 (04) : 74 - 83
  • [9] Training Strategies and Data Augmentations in CNN-based DeepFake Video Detection
    Bondi, Luca
    Cannas, Edoardo Daniele
    Bestagini, Paolo
    Tubaro, Stefano
    2020 IEEE INTERNATIONAL WORKSHOP ON INFORMATION FORENSICS AND SECURITY (WIFS), 2020,
  • [10] High speed road boundary detection with CNN-based dynamic programming
    Kim, H
    Hong, S
    Oh, T
    Lee, J
    ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2002, PROCEEDING, 2002, 2532 : 806 - 813