Meta Learning for Task-Driven Video Summarization

被引:16
作者
Li, Xuelong [1 ,2 ]
Li, Hongli [1 ,2 ]
Dong, Yongsheng [1 ,2 ]
机构
[1] Northwestern Polytech Univ, Sch Comp Sci, Xian 710072, Peoples R China
[2] Northwestern Polytech Univ, Ctr Opt Imagery Anal & Learning, Xian 710072, Peoples R China
基金
中国国家自然科学基金;
关键词
Task analysis; Training; Metals; Streaming media; Computational modeling; Data models; Decoding; Keyframe extraction; meta learning; video summarization;
D O I
10.1109/TIE.2019.2931283
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Existing video summarization approaches mainly concentrate on the sequential or structural characteristic of video data. However, they do not pay enough attention to the video summarization task itself. In this article, we propose a meta learning method for performing task-driven video summarization, denoted by MetaL-TDVS, to explicitly explore the video summarization mechanism among summarizing processes on different videos. Particularly, MetaL-TDVS aims to excavate the latent mechanism for summarizing video by reformulating video summarization as a meta learning problem and promote the generalization ability of the trained model. MetaL-TDVS regards summarizing each video as a single task to make better use of the experience and knowledge learned from processes of summarizing other videos to summarize new ones. Furthermore, MetaL-TDVS updates models via a twofold backpropagation, which forces the model optimized on one video to obtain high accuracy on another video in every training step. Extensive experiments on benchmark datasets demonstrate the superiority and better generalization ability of MetaL-TDVS against several state-of-the-art methods.
引用
收藏
页码:5778 / 5786
页数:9
相关论文
共 34 条
[1]  
Andrychowicz M, 2016, ADV NEUR IN, V29
[2]  
[Anonymous], 2018, P 32 AAAI C ART INT
[3]   THE ROLE OF METALEARNING IN STUDY PROCESSES [J].
BIGGS, JB .
BRITISH JOURNAL OF EDUCATIONAL PSYCHOLOGY, 1985, 55 (NOV) :185-212
[4]  
Finn C, 2017, PR MACH LEARN RES, V70
[5]   VSUMM: A mechanism designed to produce static video summaries and a novel evaluation method [J].
Fontes de Avila, Sandra Eliza ;
Brandao Lopes, Ana Paula ;
da Luz, Antonio, Jr. ;
Araujo, Arnaldo de Albuquerque .
PATTERN RECOGNITION LETTERS, 2011, 32 (01) :56-68
[6]  
Graves A, 2005, IEEE IJCNN, P2047
[7]  
Gygli M, 2015, PROC CVPR IEEE, P3090, DOI 10.1109/CVPR.2015.7298928
[8]   Creating Summaries from User Videos [J].
Gygli, Michael ;
Grabner, Helmut ;
Riemenschneider, Hayko ;
Van Gool, Luc .
COMPUTER VISION - ECCV 2014, PT VII, 2014, 8695 :505-520
[9]   Video Summarization With Attention-Based Encoder-Decoder Networks [J].
Ji, Zhong ;
Xiong, Kailin ;
Pang, Yanwei ;
Li, Xuelong .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2020, 30 (06) :1709-1717
[10]   Query-aware sparse coding for web multi-video summarization [J].
Ji, Zhong ;
Ma, Yaru ;
Pang, Yanwei ;
Li, Xuelong .
INFORMATION SCIENCES, 2019, 478 :152-166