HSA-RNN: Hierarchical Structure-Adaptive RNN for Video Summarization

被引:140
作者
Zhao, Bin [1 ,2 ]
Li, Xuelong [3 ]
Lu, Xiaoqiang [3 ]
机构
[1] Northwestern Polytech Univ, Sch Comp Sci, Xian, Shaanxi, Peoples R China
[2] Northwestern Polytech Univ, Ctr OPT IMagery Anal & Learning OPTIMAL, Xian, Shaanxi, Peoples R China
[3] Chinese Acad Sci, Xian Inst Opt & Precis Mech, Xian, Shaanxi, Peoples R China
来源
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2018年
关键词
SHOT;
D O I
10.1109/CVPR.2018.00773
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Although video summarization has achieved great success in recent years, few approaches have realized the influence of video structure on the summarization results. As we know, the video data follow ahierarchical structure, i.e., a video is composed of shots, and a shot is composed of several frames. Generally, shots provide the activity-level information for people to understand the video content. While few existing summarization approaches pay attention to the shot segmentation procedure. They generate shots by some trivial strategies, such as fixed length segmentation, which may destroy the underlying hierarchical structure of video data and further reduce the quality of generated summaries. To address this problem, we propose a structure-adaptive video summarization approach that integrates shot segmentation and video summarization into a Hierarchical Structure-Adaptive RNN, denoted as HSA-RNN. We evaluate the proposed approach on four popular datasets, i.e., SumMe, TVsum, CoSum and VTW. The experimental results have demonstrated the effectiveness of HSA-RNN in the video summarization task.
引用
收藏
页码:7405 / 7414
页数:10
相关论文
共 40 条
  • [1] Aner A, 2002, LECT NOTES COMPUT SC, V2353, P388
  • [2] [Anonymous], P IEEE INT C MULT EX
  • [3] [Anonymous], CORR
  • [4] [Anonymous], 2017, P IEEE C COMP VIS PA
  • [5] [Anonymous], 2014, CORR
  • [6] [Anonymous], IEEE C COMP VIS PATT
  • [7] [Anonymous], 2014, CORR
  • [8] [Anonymous], 2016, CORR
  • [9] [Anonymous], CORR
  • [10] Apostolidis Evlampios, 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), P6583, DOI 10.1109/ICASSP.2014.6854873