Unsupervised Video Summarization via Attention-Driven Adversarial Learning

被引:45
作者
Apostolidis, Evlampios [1 ,2 ]
Adamantidou, Eleni [1 ]
Metsai, Alexandros, I [1 ]
Mezaris, Vasileios [1 ]
Patras, Ioannis [2 ]
机构
[1] Ctr Res & Technol Hellas, Informat Technol Inst, Thermi, Greece
[2] Queen Mary Univ London, Sch Elect Engn & Comp Sci, London, England
来源
MULTIMEDIA MODELING (MMM 2020), PT I | 2020年 / 11961卷
基金
欧盟地平线“2020”; 英国工程与自然科学研究理事会;
关键词
Video summarization; Unsupervised learning; Attention mechanism; Adversarial learning;
D O I
10.1007/978-3-030-37731-1_40
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a new video summarization approach that integrates an attention mechanism to identify the significant parts of the video, and is trained unsupervisingly via generative adversarial learning. Starting from the SUM-GAN model, we first develop an improved version of it (called SUM-GAN-sl) that has a significantly reduced number of learned parameters, performs incremental training of the model's components, and applies a stepwise label-based strategy for updating the adversarial part. Subsequently, we introduce an attention mechanism to SUM-GAN-sl in two ways: (i) by integrating an attention layer within the variational auto-encoder (VAE) of the architecture (SUM-GAN-VAAE), and (ii) by replacing the VAE with a deterministic attention auto-encoder (SUM-GAN-AAE). Experimental evaluation on two datasets (SumMe and TVSum) documents the contribution of the attention auto-encoder to faster and more stable training of the model, resulting in a significant performance improvement with respect to the original model and demonstrating the competitiveness of the proposed SUM-GAN-AAE against the state of the art (Software publicly available at: https://github.com/e-apostolidis/SUM-GAN-AAE).
引用
收藏
页码:492 / 504
页数:13
相关论文
共 50 条
[21]   A Novel Attention-Driven Framework for Unsupervised Pedestrian Re-identification with Clustering Optimization [J].
Wang, Xuan ;
Sun, Zhaojie ;
Chehri, Abdellah ;
Jeon, Gwanggil ;
Song, Yongchao .
PATTERN RECOGNITION, 2024, 146
[22]   LEARNING HIERARCHICAL SELF-ATTENTION FOR VIDEO SUMMARIZATION [J].
Liu, Yen-Ting ;
Li, Yu-Jhe ;
Yang, Fu-En ;
Chen, Shang-Fu ;
Wang, Yu-Chiang Frank .
2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2019, :3377-3381
[23]   Meta Learning for Task-Driven Video Summarization [J].
Li, Xuelong ;
Li, Hongli ;
Dong, Yongsheng .
IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2020, 67 (07) :5778-5786
[24]   Attention-Driven Transfer Learning Model for Improved IoT Intrusion Detection [J].
Abdelhamid, Salma ;
Hegazy, Islam ;
Aref, Mostafa ;
Roushdy, Mohamed .
BIG DATA AND COGNITIVE COMPUTING, 2024, 8 (09)
[25]   Automatic video summarization driven by a spatio-temporal attention model [J].
Barland, R. ;
Saadane, A. .
HUMAN VISION AND ELECTRONIC IMAGING XIII, 2008, 6806
[26]   An Unsupervised Video Summarization Method Based on Multimodal Representation [J].
Lei, Zhuo ;
Yu, Qiang ;
Shou, Lidan ;
Li, Shengquan ;
Mao, Yunqing .
ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, ICIC 2023, PT V, 2023, 14090 :171-180
[27]   A Study on the Use of Attention for Explaining Video Summarization [J].
Apostolidis, Evlampios ;
Mezaris, Vasileios ;
Patras, Ioannis .
PROCEEDINGS OF THE 2ND WORKSHOP ON USER-CENTRIC NARRATIVE SUMMARIZATION OF LONG VIDEOS, NARSUM 2023, 2023, :41-49
[28]   Explaining video summarization based on the focus of attention [J].
Apostolidis, Evlampios ;
Balaouras, Georgios ;
Mezaris, Vasileios ;
Patras, Ioannis .
2022 IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA (ISM), 2022, :146-150
[29]   Enhancing BVR Air Combat Agent Development With Attention-Driven Reinforcement Learning [J].
Kuroswiski, Andre R. ;
Wu, Annie S. ;
Passaro, Angelo .
IEEE ACCESS, 2025, 13 :70446-70463
[30]   Adaptive Attention-Driven Few-Shot Learning for Robust Fault Diagnosis [J].
Wang, Zhe ;
Ding, Yi ;
Han, Te ;
Xu, Qiang ;
Yan, Hong ;
Xie, Min .
IEEE SENSORS JOURNAL, 2024, 24 (16) :26034-26043