Refocused Attention: Long Short-Term Rewards Guided Video Captioning

被引:0
作者
Jiarong Dong
Ke Gao
Xiaokai Chen
Juan Cao
机构
[1] Chinese Academy of Sciences,Institute of Computing Technology
[2] University of Chinese Academy of Sciences,undefined
来源
Neural Processing Letters | 2020年 / 52卷
关键词
Video captioning; Hierarchical attention; Reinforcement learning; Reward;
D O I
暂无
中图分类号
学科分类号
摘要
The adaptive cooperation of visual model and language model is essential for video captioning. However, due to the lack of proper guidance for each time step in end-to-end training, the over-dependence of language model often results in the invalidation of attention-based visual model, which is called ‘Attention Defocus’ problem in this paper. Based on an important observation that the recognition precision of entity word can reflect the effectiveness of the visual model, we propose a novel strategy called refocused attention to optimize the training and cooperating of visual model and language model, using ingenious guidance at appropriate time step. The strategy consists of a short-term-reward guided local entity recognition and a long-term-reward guided global relation understanding, neither requires any external training data. Moreover, a framework with hierarchical visual representations and hierarchical attention is established to fully exploit the potential strength of the proposed learning strategy. Extensive experiments demonstrate that the ingenious guidance strategy together with the optimized structure outperform state-of-the-art video captioning methods with relative improvements 7.7% in BLEU-4 and 5.0% in CIDEr-D on MSVD dataset, even without multi-modal features.
引用
收藏
页码:935 / 948
页数:13
相关论文
共 11 条
[1]  
Hochreiter S(1997)Long short-term memory Neural Comput 9 1735-1780
[2]  
Schmidhuber J(2018)NAIS: neural attentive item similarity model for recommendation IEEE Trans Knowl Data Eng 12 2354-2366
[3]  
He X(2017)Modeling temporal information of mitotic for mitotic event detection IEEE Trans Big Data 3 458-469
[4]  
He Z(undefined)undefined undefined undefined undefined-undefined
[5]  
Song J(undefined)undefined undefined undefined undefined-undefined
[6]  
Liu Z(undefined)undefined undefined undefined undefined-undefined
[7]  
Jiang YG(undefined)undefined undefined undefined undefined-undefined
[8]  
Chua TS(undefined)undefined undefined undefined undefined-undefined
[9]  
Nie W(undefined)undefined undefined undefined undefined-undefined
[10]  
Cheng H(undefined)undefined undefined undefined undefined-undefined