A multi-stage deep adversarial network for video summarization with knowledge distillation

被引:0
|
作者
M. U. Sreeja
Binsu C. Kovoor
机构
[1] Cochin University of Science and Technology,Division of Information Technology
关键词
GAN; Static summaries; Dynamic summaries; Knowledge distillation; Adversarial learning; Key frame; Key segment;
D O I
暂无
中图分类号
学科分类号
摘要
Video summarization is defined as the process of automatically identifying and extracting the relevant contents from a video that can best represent the contents of the video. The proposed model implements a video summarization framework based on generative adversarial network (GAN) for feature extraction and knowledge distillation for key frame or segment selection. The ideal characteristics of a video summary is diversity and representativeness. The primary stage of the proposed model based on adversarial learning ensures that the extracted features contain diverse and representative elements from the video. The generator is a convolutional recurrent autoencoder that learns the hidden representation of the video through the reconstruction loss. The generator model is followed by a discriminator that aims at improving the efficiency of the generator model by trying to discriminate between the original and reconstructed video samples. The adversarial network is followed by a knowledge distillation phase which acts as a key frame or segment selector by employing a simple network whose input data is retrieved from the preceding GAN model. Comprehensive evaluations conducted on public and custom datasets substantiate the relevance of GANs and knowledge distillation phase for video summarization. Quantitative and qualitative evaluations further prove that the proposed model produces remarkable results with summaries that are diverse, representative and concise.
引用
收藏
页码:9823 / 9838
页数:15
相关论文
共 50 条
  • [41] Unsupervised video summarization with adversarial graph-based attention network
    Gunuganti, Jeshmitha
    Yeh, Zhi-Ting
    Wang, Jenq-Haur
    Norouzi, Mehdi
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2024, 102
  • [42] MTMS: Multi-teacher Multi-stage Knowledge Distillation for Reasoning-Based Machine Reading Comprehension
    Zhao, Zhuo
    Xie, Zhiwen
    Zhou, Guangyou
    Huang, Jimmy Xiangji
    PROCEEDINGS OF THE 47TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2024, 2024, : 1995 - 2005
  • [43] Deep Semantic and Attentive Network for Unsupervised Video Summarization
    Zhong, Sheng-Hua
    Lin, Jingxu
    Lu, Jianglin
    Fares, Ahmed
    Ren, Tongwei
    ACM Transactions on Multimedia Computing, Communications and Applications, 2022, 18 (02)
  • [45] Adversarial Encoder-Multi-Task-Decoder for Multi-Stage Processes
    Mendes, Andre
    Togelius, Julian
    Coelho, Leandro dos Santos
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 763 - 770
  • [46] Stochastic reconstruction of porous media based on attention mechanisms and multi-stage generative adversarial network
    Ting Zhang
    Peng Zhu
    Fangfang Lu
    Computational Geosciences, 2023, 27 : 515 - 536
  • [47] Stochastic reconstruction of porous media based on attention mechanisms and multi-stage generative adversarial network
    Zhang, Ting
    Zhu, Peng
    Lu, Fangfang
    COMPUTATIONAL GEOSCIENCES, 2023, 27 (03) : 515 - 536
  • [48] Lightweight intelligent fault diagnosis method based on a multi-stage pruning distillation interleaving network
    Ren, Linlin
    Li, Xiaoming
    Ma, Hongbo
    Zhang, Guowei
    Huang, Song
    Chen, Ke
    Wang, Xiaoqing
    Yue, Weijie
    ADVANCES IN MECHANICAL ENGINEERING, 2024, 16 (09)
  • [49] Compressing deep graph convolution network with multi-staged knowledge distillation
    Kim, Junghun
    Jung, Jinhong
    Kang, U.
    PLOS ONE, 2021, 16 (08):
  • [50] Multi-Level Spatiotemporal Network for Video Summarization
    Yao, Ming
    Bai, Yu
    Du, Wei
    Zhang, Xuejun
    Quan, Heng
    Cai, Fuli
    Kang, Hongwei
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022,