Unsupervised Video Summarization via Attention-Driven Adversarial Learning

被引：45

作者：

Apostolidis, Evlampios ^{[1
,2
]}

Adamantidou, Eleni ^{[1
]}

Metsai, Alexandros, I ^{[1
]}

Mezaris, Vasileios ^{[1
]}

Patras, Ioannis ^{[2
]}

机构：

[1] Ctr Res & Technol Hellas, Informat Technol Inst, Thermi, Greece

[2] Queen Mary Univ London, Sch Elect Engn & Comp Sci, London, England

来源：

MULTIMEDIA MODELING (MMM 2020), PT I | 2020年 / 11961卷

基金：

英国工程与自然科学研究理事会; 欧盟地平线“2020”;

关键词：

Video summarization; Unsupervised learning; Attention mechanism; Adversarial learning;

D O I：

10.1007/978-3-030-37731-1_40

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper presents a new video summarization approach that integrates an attention mechanism to identify the significant parts of the video, and is trained unsupervisingly via generative adversarial learning. Starting from the SUM-GAN model, we first develop an improved version of it (called SUM-GAN-sl) that has a significantly reduced number of learned parameters, performs incremental training of the model's components, and applies a stepwise label-based strategy for updating the adversarial part. Subsequently, we introduce an attention mechanism to SUM-GAN-sl in two ways: (i) by integrating an attention layer within the variational auto-encoder (VAE) of the architecture (SUM-GAN-VAAE), and (ii) by replacing the VAE with a deterministic attention auto-encoder (SUM-GAN-AAE). Experimental evaluation on two datasets (SumMe and TVSum) documents the contribution of the attention auto-encoder to faster and more stable training of the model, resulting in a significant performance improvement with respect to the original model and demonstrating the competitiveness of the proposed SUM-GAN-AAE against the state of the art (Software publicly available at: https://github.com/e-apostolidis/SUM-GAN-AAE).

引用

页码：492 / 504

页数：13

共 50 条

[1] Unsupervised video summarization with adversarial graph-based attention network
Gunuganti, Jeshmitha
Yeh, Zhi-Ting
Wang, Jenq-Haur
Norouzi, Mehdi
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2024, 102
[2] Unsupervised Video Summarization with Adversarial LSTM Networks
Mahasseni, Behrooz
Lam, Michael
Todorovic, Sinisa
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 2982 - 2991
[3] Integrate the Temporal Scheme for Unsupervised Video Summarization via Attention Mechanism
Bang, Vo Quoc
Viet, Vo Hoai
IEEE ACCESS, 2025, 13 : 38147 - 38162
[4] ADVERSARIAL UNSUPERVISED VIDEO SUMMARIZATION AUGMENTED WITH DICTIONARY LOSS
Kaseris, Michail
Mademlis, Ioannis
Pitas, Ioannis
2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 2683 - 2687
[5] Attentive and Adversarial Learning for Video Summarization
Fu, Tsu-Jui
Tai, Shao-Heng
Chen, Hwann-Tzong
2019 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2019, : 1579 - 1587
[6] Unsupervised Video Summarization via Relation-Aware Assignment Learning
Gao, Junyu
Yang, Xiaoshan
Zhang, Yingying
Xu, Changsheng
IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 23 : 3203 - 3214
[7] Recurrent generative adversarial networks for unsupervised WCE video summarization
Lan, Libin
Ye, Chunxiao
KNOWLEDGE-BASED SYSTEMS, 2021, 222
[8] Unsupervised Video Summarization with Attentive Conditional Generative Adversarial Networks
He, Xufeng
Hua, Yang
Song, Tao
Zhang, Zongpu
Xue, Zhengui
Ma, Ruhui
Robertson, Neil
Guan, Haibing
PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, : 2296 - 2304
[9] Attention-Driven Loss for Anomaly Detection in Video Surveillance
Zhou, Joey Tianyi
Zhang, Le
Fang, Zhiwen
Du, Jiawei
Peng, Xi
Xiao, Yang
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2020, 30 (12) : 4639 - 4647
[10] Discriminative Feature Learning for Unsupervised Video Summarization
Jung, Yunjae
Cho, Donghyeon
Kim, Dahun
Woo, Sanghyun
Kweon, In So
THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 8537 - 8544

← 1 2 3 4 5 →