Human interaction representation and recognition through motion decomposition

被引:31
作者
Du, Youtian [1 ]
Chen, Feng [1 ]
Xu, Wenli [1 ]
机构
[1] Tsinghua Univ, Dept Automat, Beijing 100084, Peoples R China
关键词
dynamic Bayesian network; human interaction recognition; intelligent surveillance; motion decomposition;
D O I
10.1109/LSP.2007.908035
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Human action recognition is one of the most important problems in video content analysis and computer vision. In this letter, we propose a novel framework of human interaction recognition through motion decomposition. Interactions contain not only motions corresponding to each person but also motion details on different scales. Hence, we decompose an interaction into multiple interacting stochastic processes in the above two aspects. Under the framework, we present a Coupled Hierarchical Durational-State Dynamic Bayesian Network (CHDS-DBN) to model interactions by modeling the multiple stochastic processes. The effectiveness of the approach is demonstrated by experiments of two-person interaction recognition.
引用
收藏
页码:952 / 955
页数:4
相关论文
共 12 条
[1]  
BRAND M, 1997, COUPLED HIDDEN MARKO, P405
[2]   Policy recognition in the Abstract Hidden Markov model [J].
Bui, HH ;
Venkatesh, S ;
West, G .
JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2002, 17 :451-499
[3]  
Du YT, 2006, INT C PATT RECOG, P618
[4]  
Duong TV, 2005, PROC CVPR IEEE, P838
[5]   The hierarchical hidden Markov model: Analysis and applications [J].
Fine, S ;
Singer, Y ;
Tishby, N .
MACHINE LEARNING, 1998, 32 (01) :41-62
[6]   Factorial hidden Markov models [J].
Ghahramani, Z ;
Jordan, MI .
MACHINE LEARNING, 1997, 29 (2-3) :245-273
[7]   Recognition of group activities using dynamic Probabilistic networks [J].
Gong, SG ;
Xiang, T .
NINTH IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION, VOLS I AND II, PROCEEDINGS, 2003, :742-749
[8]   Multi-agent activity recognition using observation decomposed hidden Markov models [J].
Liu, XH ;
Chua, CS .
IMAGE AND VISION COMPUTING, 2006, 24 (02) :166-175
[9]  
NATARAJAN P, 2007, P WMVC
[10]   Layered representations for learning and inferring office activity from multiple sensory channels [J].
Oliver, N ;
Garg, A ;
Horvitz, E .
COMPUTER VISION AND IMAGE UNDERSTANDING, 2004, 96 (02) :163-180