Cluster-Based Video Summarization with Temporal Context Awareness

被引:0
作者
Hai-Dang Huynh-Lam [1 ,2 ]
Ngoc-Phuong Ho-Thi [1 ,2 ]
Minh-Triet Tran [1 ,2 ]
Trung-Nghia Le [1 ,2 ]
机构
[1] VNU HCM, Univ Sci, Ho Chi Minh City, Vietnam
[2] Vietnam Natl Univ, Ho Chi Minh City, Vietnam
来源
IMAGE AND VIDEO TECHNOLOGY, PSIVT 2023 | 2024年 / 14403卷
关键词
video summarization; clustering; unsupervised learning;
D O I
10.1007/978-981-97-0376-0_2
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we present TAC-SUM, a novel and efficient training-free approach for video summarization that addresses the limitations of existing cluster-based models by incorporating temporal context. Our method partitions the input video into temporally consecutive segments with clustering information, enabling the injection of temporal awareness into the clustering process, setting it apart from prior cluster-based summarization methods. The resulting temporal-aware clusters are then utilized to compute the final summary, using simple rules for keyframe selection and frame importance scoring. Experimental results on the SumMe dataset demonstrate the effectiveness of our proposed approach, outperforming existing unsupervised methods and achieving comparable performance to state-of-the-art supervised summarization techniques. Our source code is available for reference at https://github.com/hcmus-thesis-gulu/TAC-SUM.
引用
收藏
页码:15 / 28
页数:14
相关论文
共 30 条
  • [21] Video Summarization Using Fully Convolutional Sequence Networks
    Rochan, Mrigank
    Ye, Linwei
    Wang, Yang
    [J]. COMPUTER VISION - ECCV 2018, PT XII, 2018, 11216 : 358 - 374
  • [22] Video Precis: Highlighting Diverse Aspects of Videos
    Shroff, Nitesh
    Turaga, Pavan
    Chellappa, Rama
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2010, 12 (08) : 853 - 868
  • [23] Stacked Memory Network for Video Summarization
    Wang, Junbo
    Wang, Wei
    Wang, Zhiyong
    Wang, Liang
    Feng, Dagan
    Tan, Tieniu
    [J]. PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, : 836 - 844
  • [24] Yunjae Jung, 2020, Computer Vision - ECCV 2020. 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12370), P167, DOI 10.1007/978-3-030-58595-2_11
  • [25] Video Summarization with Long Short-Term Memory
    Zhang, Ke
    Chao, Wei-Lun
    Sha, Fei
    Grauman, Kristen
    [J]. COMPUTER VISION - ECCV 2016, PT VII, 2016, 9911 : 766 - 782
  • [26] Zhang T., 1996, P 1996 ACM SIGMOD IN, P103, DOI [DOI 10.1145/235968.233324, 10.1145/235968.233324]
  • [27] TTH-RNN: Tensor-Train Hierarchical Recurrent Neural Network for Video Summarization
    Zhao, Bin
    Li, Xuelong
    Lu, Xiaoqiang
    [J]. IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2021, 68 (04) : 3629 - 3637
  • [28] Hierarchical Recurrent Neural Network for Video Summarization
    Zhao, Bin
    Li, Xuelong
    Lu, Xiaoqiang
    [J]. PROCEEDINGS OF THE 2017 ACM MULTIMEDIA CONFERENCE (MM'17), 2017, : 863 - 871
  • [29] HSA-RNN: Hierarchical Structure-Adaptive RNN for Video Summarization
    Zhao, Bin
    Li, Xuelong
    Lu, Xiaoqiang
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 7405 - 7414
  • [30] Zhou KY, 2018, AAAI CONF ARTIF INTE, P7582