Interactive Phrases: Semantic Descriptions for Human Interaction Recognition

被引:94
作者
Kong, Yu [1 ]
Jia, Yunde [2 ]
Fu, Yun [1 ,3 ,4 ,5 ,6 ]
机构
[1] Northeastern Univ, Dept Elect & Comp Engn, Boston, MA 02115 USA
[2] Beijing Inst Technol, Sch Comp Sci, Beijing Lab Intelligent Informat Technol, Beijing 100081, Peoples R China
[3] Northeastern Univ, Coll Comp & Informat Sci, Boston, MA 02115 USA
[4] BBN Technol, Cambridge, MA USA
[5] Tufts Univ, Dept Comp Sci, Medford, MA 02155 USA
[6] SUNY Buffalo, Dept Comp Sci & Engn, Buffalo, NY 14260 USA
基金
美国国家科学基金会;
关键词
Human interaction; action recognition; latent structural SVM;
D O I
10.1109/TPAMI.2014.2303090
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper addresses the problem of recognizing human interactions from videos. We propose a novel approach that recognizes human interactions by the learned high-level descriptions, interactive phrases. Interactive phrases describe motion relationships between interacting people. These phrases naturally exploit human knowledge and allow us to construct a more descriptive model for recognizing human interactions. We propose a discriminative model to encode interactive phrases based on the latent SVM formulation. Interactive phrases are treated as latent variables and are used as mid-level features. To complement manually specified interactive phrases, we also discover data-driven phrases from data in order to find potentially useful and discriminative phrases for differentiating human interactions. An information-theoretic approach is employed to learn the data-driven phrases. The interdependencies between interactive phrases are explicitly captured in the model to deal with motion ambiguity and partial occlusion in the interactions. We evaluate our method on the BIT-Interaction data set, UT-Interaction data set, and Collective Activity data set. Experimental results show that our approach achieves superior performance over previous approaches.
引用
收藏
页码:1775 / 1788
页数:14
相关论文
共 51 条
[1]   Human Activity Analysis: A Review [J].
Aggarwal, J. K. ;
Ryoo, M. S. .
ACM COMPUTING SURVEYS, 2011, 43 (03)
[2]  
[Anonymous], 2011, P IEEE C COMP VIS PA
[3]  
[Anonymous], 2009, P IEEE C COMP VIS PA
[4]  
[Anonymous], P 23 ANN INT ACM SIG
[5]  
[Anonymous], 2007, P ADV NEUR INF PROC
[6]  
[Anonymous], 2008, P EUR C COMP VIS
[7]  
[Anonymous], 2008, P IEEE C COMP VIS PA
[8]  
[Anonymous], P IEEE C COMP VIS PA
[9]  
[Anonymous], P IEEE 2 JOINT INT W
[10]  
[Anonymous], P IEEE C COMP VIS PA