Video Summarization Using Deep Neural Networks: A Survey

被引:98
|
作者
Apostolidis, Evlampios [1 ,2 ]
Adamantidou, Eleni [1 ]
Metsai, Alexandros, I [1 ]
Mezaris, Vasileios [1 ]
Patras, Ioannis [2 ]
机构
[1] Ctr Res & Technol Hellas, Informat Technol Inst, GR-57001 Thessaloniki, Greece
[2] Queen Mary Univ London, Sch Elect Engn & Comp Sci, London E1 4N5, England
基金
英国工程与自然科学研究理事会; 欧盟地平线“2020”;
关键词
Training data; Deep learning; Taxonomy; Systematics; Recurrent neural networks; VIdeo sequences; Neural networks; Deep neural networks; evaluation protocols; summarization datasets; supervised learning; unsupervised learning; video summarization;
D O I
10.1109/JPROC.2021.3117472
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Video summarization technologies aim to create a concise and complete synopsis by selecting the most informative parts of the video content. Several approaches have been developed over the last couple of decades, and the current state of the art is represented by methods that rely on modern deep neural network architectures. This work focuses on the recent advances in the area and provides a comprehensive survey of the existing deep-learning-based methods for generic video summarization. After presenting the motivation behind the development of technologies for video summarization, we formulate the video summarization task and discuss the main characteristics of a typical deep-learning-based analysis pipeline. Then, we suggest a taxonomy of the existing algorithms and provide a systematic review of the relevant literature that shows the evolution of the deep-learning-based video summarization technologies and leads to suggestions for future developments. We then report on protocols for the objective evaluation of video summarization algorithms, and we compare the performance of several deep-learning-based approaches. Based on the outcomes of these comparisons, as well as some documented considerations about the amount of annotated data and the suitability of evaluation protocols, we indicate potential future research directions.
引用
收藏
页码:1838 / 1863
页数:26
相关论文
共 50 条
  • [1] Unsupervised video summarization using deep Non-Local video summarization networks
    Zang, Sha-Sha
    Yu, Hui
    Song, Yan
    Zeng, Ru
    NEUROCOMPUTING, 2023, 519 : 26 - 35
  • [2] Summarization-based Video Caption via Deep Neural Networks
    Li, Guang
    Ma, Shubo
    Han, Yahong
    MM'15: PROCEEDINGS OF THE 2015 ACM MULTIMEDIA CONFERENCE, 2015, : 1191 - 1194
  • [3] Foveated convolutional neural networks for video summarization
    Jiaxin Wu
    Sheng-hua Zhong
    Zheng Ma
    Stephen J. Heinen
    Jianmin Jiang
    Multimedia Tools and Applications, 2018, 77 : 29245 - 29267
  • [4] Foveated convolutional neural networks for video summarization
    Wu, Jiaxin
    Zhong, Sheng-hua
    Ma, Zheng
    Heinen, Stephen J.
    Jiang, Jianmin
    MULTIMEDIA TOOLS AND APPLICATIONS, 2018, 77 (22) : 29245 - 29267
  • [5] Information Graphic Summarization using a Collection of Multimodal Deep Neural Networks
    Kim, Edward
    Onweller, Connor
    McCoy, Kathleen E.
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 10188 - 10195
  • [6] Deep hierarchical LSTM networks with attention for video summarization
    Lin, Jingxu
    Zhong, Sheng-hua
    Fares, Ahmed
    COMPUTERS & ELECTRICAL ENGINEERING, 2022, 97
  • [7] Unsupervised Video Summarization with Independently Recurrent Neural Networks
    Yaliniz, Gokhan
    Ikizler-Cinbis, Nazli
    2019 27TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2019,
  • [8] Video Summarization using Deep Convolutional Neural Networks and Mutual Probability-based K-Nearest Neighbour
    La, Jimson
    Ananth, J. P.
    JOURNAL OF EXPERIMENTAL & THEORETICAL ARTIFICIAL INTELLIGENCE, 2023, 35 (08) : 1251 - 1267
  • [9] VIDEO ERROR CONCEALMENT USING DEEP NEURAL NETWORKS
    Sankisa, Arun
    Punjabi, Arjun
    Katsaggelos, Aggelos K.
    2018 25TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2018, : 380 - 384
  • [10] Video Deblocking Using Multipath Deep Neural Networks
    Chou, Ping-Peng
    Leou, Jin-Jang
    Communications in Computer and Information Science, 2024, 2075 CCIS : 28 - 39