Unsupervised video summarization using deep Non-Local video summarization networks

被引:7
|
作者
Zang, Sha-Sha [1 ]
Yu, Hui [1 ,2 ]
Song, Yan [1 ]
Zeng, Ru [1 ]
机构
[1] Univ Shanghai Sci & Technol, Sch Control Engn, Shanghai 200093, Peoples R China
[2] Univ Portsmouth, Sch Creat Technol, Portsmouth PO1 2DJ, England
关键词
Video summarization; Non -local convolutional network; Reinforcement learning; LSTM; Non -local Video Summarization Network;
D O I
10.1016/j.neucom.2022.11.028
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Video summarization is to extract effective information from videos to quickly obtain the most informative summary. Most of the existing video summarization methods use recurrent neural networks and their variants such as long and short-term memory (LSTM), to simulates the variable range time dependence between video frames. However, those methods can only process serial inputs of the video frames along with the hidden layer information from the previous time step, which affects the performance and the quality of video summarization. To tackle this issue, we present a deep non-local video summarization network (DN-VSN) for original video abstracts in this paper. Our unsupervised model treats video summarization as a sequence of decision problems. Given an input video, the probability that a video frame is selected as a part of the summary is obtained through a non-local convolutional network, and a strategy gradient algorithm of reinforcement learning is adopted for optimization in the training phase. The proposed method has been tested on four widely used datasets. The experimental results show the superiority of the proposed unsupervised model over the state-of-the-art methods. (c) 2022 Elsevier B.V. All rights reserved.
引用
收藏
页码:26 / 35
页数:10
相关论文
共 50 条
  • [41] GVSUM: generic video summarization using deep visual features
    Basavarajaiah, Madhushree
    Sharma, Priyanka
    MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (09) : 14459 - 14476
  • [42] Summarization-based Video Caption via Deep Neural Networks
    Li, Guang
    Ma, Shubo
    Han, Yahong
    MM'15: PROCEEDINGS OF THE 2015 ACM MULTIMEDIA CONFERENCE, 2015, : 1191 - 1194
  • [43] Video Summarization with LSTM and Deep Attention Models
    Casas, Luis Lebron
    Koblents, Eugenia
    MULTIMEDIA MODELING, MMM 2019, PT II, 2019, 11296 : 67 - 79
  • [44] Hierarchical Multimodal Attention for Deep Video Summarization
    Sanabria, Melissa
    Precioso, Frederic
    Menguy, Thomas
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 7977 - 7984
  • [45] Video summarization via knowledge-aware multimodal deep networks
    Xie, Jiehang
    Chen, Xuanbai
    Zhao, Sicheng
    Lu, Shao-Ping
    KNOWLEDGE-BASED SYSTEMS, 2024, 293
  • [46] Online Video Summarization Based on Local Features
    Iparraguirre, Javier
    Delrieux, Claudio A.
    INTERNATIONAL JOURNAL OF MULTIMEDIA DATA ENGINEERING & MANAGEMENT, 2014, 5 (02): : 41 - 53
  • [47] Deep attentive and semantic preserving video summarization
    Ji, Zhong
    Jiao, Fang
    Pang, Yanwei
    Shao, Ling
    NEUROCOMPUTING, 2020, 405 : 200 - 207
  • [48] Video Summarization Using Feature Dissimilarity
    Kim, Hyuncheol
    Yoon, Inhye
    Kim, TaeYong
    Paik, Joonki
    2016 INTERNATIONAL CONFERENCE ON ELECTRONICS, INFORMATION, AND COMMUNICATIONS (ICEIC), 2016,
  • [49] Object of Interest and Unsupervised Learning-based Framework for an Effective Video Summarization Using Deep Learning
    Negi, Alok
    Kumar, Krishan
    Saini, Parul
    IETE JOURNAL OF RESEARCH, 2024, 70 (05) : 5019 - 5030
  • [50] Video summarization using motion descriptors
    Divakaran, A
    Peker, KA
    Sun, H
    STORAGE AND RETRIEVAL FOR MEDIA DATABASES 2001, 2001, 4315 : 517 - 522