Unsupervised Video Summarization via Attention-Driven Adversarial Learning

被引:45
|
作者
Apostolidis, Evlampios [1 ,2 ]
Adamantidou, Eleni [1 ]
Metsai, Alexandros, I [1 ]
Mezaris, Vasileios [1 ]
Patras, Ioannis [2 ]
机构
[1] Ctr Res & Technol Hellas, Informat Technol Inst, Thermi, Greece
[2] Queen Mary Univ London, Sch Elect Engn & Comp Sci, London, England
来源
MULTIMEDIA MODELING (MMM 2020), PT I | 2020年 / 11961卷
基金
英国工程与自然科学研究理事会; 欧盟地平线“2020”;
关键词
Video summarization; Unsupervised learning; Attention mechanism; Adversarial learning;
D O I
10.1007/978-3-030-37731-1_40
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a new video summarization approach that integrates an attention mechanism to identify the significant parts of the video, and is trained unsupervisingly via generative adversarial learning. Starting from the SUM-GAN model, we first develop an improved version of it (called SUM-GAN-sl) that has a significantly reduced number of learned parameters, performs incremental training of the model's components, and applies a stepwise label-based strategy for updating the adversarial part. Subsequently, we introduce an attention mechanism to SUM-GAN-sl in two ways: (i) by integrating an attention layer within the variational auto-encoder (VAE) of the architecture (SUM-GAN-VAAE), and (ii) by replacing the VAE with a deterministic attention auto-encoder (SUM-GAN-AAE). Experimental evaluation on two datasets (SumMe and TVSum) documents the contribution of the attention auto-encoder to faster and more stable training of the model, resulting in a significant performance improvement with respect to the original model and demonstrating the competitiveness of the proposed SUM-GAN-AAE against the state of the art (Software publicly available at: https://github.com/e-apostolidis/SUM-GAN-AAE).
引用
收藏
页码:492 / 504
页数:13
相关论文
共 50 条
  • [1] Unsupervised video summarization with adversarial graph-based attention network
    Gunuganti, Jeshmitha
    Yeh, Zhi-Ting
    Wang, Jenq-Haur
    Norouzi, Mehdi
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2024, 102
  • [2] Unsupervised Video Summarization with Adversarial LSTM Networks
    Mahasseni, Behrooz
    Lam, Michael
    Todorovic, Sinisa
    30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 2982 - 2991
  • [3] Integrate the Temporal Scheme for Unsupervised Video Summarization via Attention Mechanism
    Bang, Vo Quoc
    Viet, Vo Hoai
    IEEE ACCESS, 2025, 13 : 38147 - 38162
  • [4] ADVERSARIAL UNSUPERVISED VIDEO SUMMARIZATION AUGMENTED WITH DICTIONARY LOSS
    Kaseris, Michail
    Mademlis, Ioannis
    Pitas, Ioannis
    2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 2683 - 2687
  • [5] Attentive and Adversarial Learning for Video Summarization
    Fu, Tsu-Jui
    Tai, Shao-Heng
    Chen, Hwann-Tzong
    2019 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2019, : 1579 - 1587
  • [6] Unsupervised Video Summarization via Relation-Aware Assignment Learning
    Gao, Junyu
    Yang, Xiaoshan
    Zhang, Yingying
    Xu, Changsheng
    IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 23 : 3203 - 3214
  • [7] Recurrent generative adversarial networks for unsupervised WCE video summarization
    Lan, Libin
    Ye, Chunxiao
    KNOWLEDGE-BASED SYSTEMS, 2021, 222
  • [8] Unsupervised Video Summarization with Attentive Conditional Generative Adversarial Networks
    He, Xufeng
    Hua, Yang
    Song, Tao
    Zhang, Zongpu
    Xue, Zhengui
    Ma, Ruhui
    Robertson, Neil
    Guan, Haibing
    PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, : 2296 - 2304
  • [9] Attention-Driven Loss for Anomaly Detection in Video Surveillance
    Zhou, Joey Tianyi
    Zhang, Le
    Fang, Zhiwen
    Du, Jiawei
    Peng, Xi
    Xiao, Yang
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2020, 30 (12) : 4639 - 4647
  • [10] Discriminative Feature Learning for Unsupervised Video Summarization
    Jung, Yunjae
    Cho, Donghyeon
    Kim, Dahun
    Woo, Sanghyun
    Kweon, In So
    THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 8537 - 8544