Cluster-Based Video Summarization with Temporal Context Awareness

被引：0

作者：

Hai-Dang Huynh-Lam ^{[1
,2
]}

Ngoc-Phuong Ho-Thi ^{[1
,2
]}

Minh-Triet Tran ^{[1
,2
]}

Trung-Nghia Le ^{[1
,2
]}

机构：

[1] VNU HCM, Univ Sci, Ho Chi Minh City, Vietnam

[2] Vietnam Natl Univ, Ho Chi Minh City, Vietnam

来源：

IMAGE AND VIDEO TECHNOLOGY, PSIVT 2023 | 2024年 / 14403卷

关键词：

video summarization; clustering; unsupervised learning;

D O I：

10.1007/978-981-97-0376-0_2

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper, we present TAC-SUM, a novel and efficient training-free approach for video summarization that addresses the limitations of existing cluster-based models by incorporating temporal context. Our method partitions the input video into temporally consecutive segments with clustering information, enabling the injection of temporal awareness into the clustering process, setting it apart from prior cluster-based summarization methods. The resulting temporal-aware clusters are then utilized to compute the final summary, using simple rules for keyframe selection and frame importance scoring. Experimental results on the SumMe dataset demonstrate the effectiveness of our proposed approach, outperforming existing unsupervised methods and achieving comparable performance to state-of-the-art supervised summarization techniques. Our source code is available for reference at https://github.com/hcmus-thesis-gulu/TAC-SUM.

引用

页码：15 / 28

页数：14

共 30 条

[21] Video Summarization Using Fully Convolutional Sequence Networks
Rochan, Mrigank
Ye, Linwei
Wang, Yang
[J]. COMPUTER VISION - ECCV 2018, PT XII, 2018, 11216 : 358 - 374
[22] Video Precis: Highlighting Diverse Aspects of Videos
Shroff, Nitesh
Turaga, Pavan
Chellappa, Rama
[J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2010, 12 (08) : 853 - 868
[23] Stacked Memory Network for Video Summarization
Wang, Junbo
Wang, Wei
Wang, Zhiyong
Wang, Liang
Feng, Dagan
Tan, Tieniu
[J]. PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, : 836 - 844
[24] Yunjae Jung, 2020, Computer Vision - ECCV 2020. 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12370), P167, DOI 10.1007/978-3-030-58595-2_11
[25] Video Summarization with Long Short-Term Memory
Zhang, Ke
Chao, Wei-Lun
Sha, Fei
Grauman, Kristen
[J]. COMPUTER VISION - ECCV 2016, PT VII, 2016, 9911 : 766 - 782
[26] Zhang T., 1996, P 1996 ACM SIGMOD IN, P103, DOI [DOI 10.1145/235968.233324, 10.1145/235968.233324]
[27] TTH-RNN: Tensor-Train Hierarchical Recurrent Neural Network for Video Summarization
Zhao, Bin
Li, Xuelong
Lu, Xiaoqiang
[J]. IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2021, 68 (04) : 3629 - 3637
[28] Hierarchical Recurrent Neural Network for Video Summarization
Zhao, Bin
Li, Xuelong
Lu, Xiaoqiang
[J]. PROCEEDINGS OF THE 2017 ACM MULTIMEDIA CONFERENCE (MM'17), 2017, : 863 - 871
[29] HSA-RNN: Hierarchical Structure-Adaptive RNN for Video Summarization
Zhao, Bin
Li, Xuelong
Lu, Xiaoqiang
[J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 7405 - 7414
[30] Zhou KY, 2018, AAAI CONF ARTIF INTE, P7582

← 1 2 3 →