HSA-RNN: Hierarchical Structure-Adaptive RNN for Video Summarization

被引：140

作者：

Zhao, Bin ^{[1
,2
]}

Li, Xuelong ^{[3
]}

Lu, Xiaoqiang ^{[3
]}

机构：

[1] Northwestern Polytech Univ, Sch Comp Sci, Xian, Shaanxi, Peoples R China

[2] Northwestern Polytech Univ, Ctr OPT IMagery Anal & Learning OPTIMAL, Xian, Shaanxi, Peoples R China

[3] Chinese Acad Sci, Xian Inst Opt & Precis Mech, Xian, Shaanxi, Peoples R China

来源：

2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2018年

关键词：

SHOT;

D O I：

10.1109/CVPR.2018.00773

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Although video summarization has achieved great success in recent years, few approaches have realized the influence of video structure on the summarization results. As we know, the video data follow ahierarchical structure, i.e., a video is composed of shots, and a shot is composed of several frames. Generally, shots provide the activity-level information for people to understand the video content. While few existing summarization approaches pay attention to the shot segmentation procedure. They generate shots by some trivial strategies, such as fixed length segmentation, which may destroy the underlying hierarchical structure of video data and further reduce the quality of generated summaries. To address this problem, we propose a structure-adaptive video summarization approach that integrates shot segmentation and video summarization into a Hierarchical Structure-Adaptive RNN, denoted as HSA-RNN. We evaluate the proposed approach on four popular datasets, i.e., SumMe, TVsum, CoSum and VTW. The experimental results have demonstrated the effectiveness of HSA-RNN in the video summarization task.

引用

页码：7405 / 7414

页数：10

共 40 条

[1] Aner A, 2002, LECT NOTES COMPUT SC, V2353, P388
[2] [Anonymous], P IEEE INT C MULT EX
[3] [Anonymous], CORR
[4] [Anonymous], 2017, P IEEE C COMP VIS PA
[5] [Anonymous], 2014, CORR
[6] [Anonymous], IEEE C COMP VIS PATT
[7] [Anonymous], 2014, CORR
[8] [Anonymous], 2016, CORR
[9] [Anonymous], CORR
[10] Apostolidis Evlampios, 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), P6583, DOI 10.1109/ICASSP.2014.6854873

← 1 2 3 4 →