Temporal Action Localization with Pyramid of Score Distribution Features

被引:132
作者
Yuan, Jun [1 ]
Ni, Bingbing [2 ]
Yang, Xiaokang [2 ]
Kassim, Ashraf A. [1 ]
机构
[1] Natl Univ Singapore, Singapore, Singapore
[2] Shanghai Jiao Tong Univ, Shanghai, Peoples R China
来源
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2016年
基金
中国国家自然科学基金;
关键词
D O I
10.1109/CVPR.2016.337
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We investigate the feature design and classification architectures in temporal action localization. This application focuses on detecting and labeling actions in untrimmed videos, which brings more challenge than classifying presegmented videos. The major difficulty for action localization is the uncertainty of action occurrence and utilization of information from different scales. Two innovations are proposed to address this issue. First, we propose a Pyramid of Score Distribution Feature (PSDF) to capture the motion information at multiple resolutions centered at each detection window. This novel feature mitigates the influence of unknown action position and duration, and shows significant performance gain over previous detection approaches. Second, inter-frame consistency is further explored by incorporating PSDF into the state-of-the-art Recurrent Neural Networks, which gives additional performance gain in detecting actions in temporally untrimmed videos. We tested our action localization framework on the THUMOS'15 and MPII Cooking Activities Dataset, both of which show a large performance improvement over previous attempts.
引用
收藏
页码:3093 / 3102
页数:10
相关论文
共 40 条
[1]  
Ahad M. A. R., 2011, SICE 2011 - 50th Annual Conference of the Society of Instrument and Control Engineers of Japan, P1650
[2]  
[Anonymous], 2015, ARXIV150401561
[3]  
[Anonymous], 2011, P INT C COMP VIS ICC
[4]  
[Anonymous], 2015, PROC INT C LEARN REP
[5]  
[Anonymous], 2010, P PYTHON SCI COMPUTI
[6]  
[Anonymous], WORKSHOP DEEP LEARNI
[7]  
[Anonymous], ACTION RECOGNITION D
[8]  
[Anonymous], ACTION RECOGNITION D
[9]  
[Anonymous], 2017, ACM, DOI [DOI 10.2165/00129785-200404040-00005, DOI 10.1145/3065386]
[10]  
[Anonymous], 2009, IEEE C COMP VIS PATT