Semantic action recognition by learning a pose lexicon

被引:15
作者
Zhou, Lijuan [1 ]
Li, Wanqing [1 ]
Ogunbona, Philip [1 ]
Zhang, Zhengyou [2 ]
机构
[1] Univ Wollongong, Sch Comp & Informat Technol, Wollongong, NSW 2522, Australia
[2] Microsoft Res, Redmond, WA 98052 USA
关键词
Lexicon; Semantic pose; Visual pose; Action recognition; HIDDEN MARKOV-MODELS;
D O I
10.1016/j.patcog.2017.06.035
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper proposes a semantic representation, pose lexicon, for action recognition. The lexicon is composed of a set of semantic poses, a set of visual poses and a probabilistic mapping between the visual and semantic poses. Specially, an action can be represented by a sequence of semantic poses extracted from an associated textual instruction. Visual frames of the action are considered to be generated from a sequence of hidden visual poses. To learn the lexicon, a visual pose model is learned from training samples by a Gaussian Mixture model to characterize the likelihood of an observed visual frame being generated by a visual pose. A pose lexicon model is also learned by an extended hidden Markov alignment model to encode the probabilistic mapping between hidden visual poses and semantic poses sequences. With the lexicon, action classification is formulated as a problem of finding the maximum posterior probability of a given sequence of visual frames that fits to a given sequence of semantic poses through the most likely visual pose and alignment sequences. The efficacy of the proposed method was evaluated on MSRC-12, WorkoutSU-10, WorkoutUOW-18, Combined-15 and Combined-17 action datasets using cross-subject, cross-dataset and zero-shot protocols. (C) 2017 Elsevier Ltd. All rights reserved.
引用
收藏
页码:548 / 562
页数:15
相关论文
共 61 条
[1]  
[Anonymous], 2003, P 8 INT C PARSING TE
[2]  
[Anonymous], P BRIT MACH VIS C BM
[3]  
[Anonymous], ARXIV151103028
[4]  
[Anonymous], 2015, P 2015 C N AM CHAPTE, DOI DOI 10.3115/V1/N15-1017
[5]  
[Anonymous], 15 ANN C N AM CHAPT
[6]  
[Anonymous], P IEEE INT C MULT
[7]  
[Anonymous], 2014, P 18 C EMP METH NAT, DOI DOI 10.3115/V1/D14-1082
[8]  
[Anonymous], 2012, P SIGCHI C HUM FACT
[9]  
[Anonymous], 1996, P 16 C COMP LING, DOI [DOI 10.3115/993268.993313, 10.3115/993268.993313]
[10]   Latent Dirichlet allocation [J].
Blei, DM ;
Ng, AY ;
Jordan, MI .
JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (4-5) :993-1022