A supervised dictionary learning and discriminative weighting model for action recognition

被引:13
作者
Dong, Jian [1 ,2 ]
Sun, Changyin [1 ,2 ]
Yang, Wankou [1 ,2 ,3 ]
机构
[1] Southeast Univ, Sch Automat, Nanjing 210096, Jiangsu, Peoples R China
[2] Southeast Univ, Minist Educ, Key Lab Measurement & Control Complex Syst Engn, Nanjing 210096, Jiangsu, Peoples R China
[3] Southeast Univ, Jiangsu Key Lab Image & Video Understanding Socia, Nanjing 210096, Jiangsu, Peoples R China
关键词
Dictionary learning; Local Fisher Discrimination; Supervised sparse coding; Discriminative weighting model; Multiple Kernel Learning; Action recognition; REPRESENTATION;
D O I
10.1016/j.neucom.2015.01.024
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we propose a supervised dictionary learning algorithm for action recognition in still images followed by a discriminative weighting model. The dictionary is learned based on Local Fisher Discrimination which takes into account the local manifold structure and discrimination information of local descriptors. The label information of local descriptors is considered in both dictionary learning and sparse coding stage which generates a supervised sparse coding algorithm and makes the coding coefficients discriminative. Instead of using spatial pyramid features, sliding window-based features with max-pooling are computed from coding coefficients. And then a discriminative weighting model combining a max-margin classifier is proposed using the features. Both the weighting coefficients and model parameters can be jointly learned using the same way in Multiple Kernel Learning algorithm. We validate our model on the following action recognition datasets: Willow 7 human actions dataset, People Playing Music Instrument (PPMI) dataset, and Sports dataset. To show the generality of our model, we also validate it on Scene15 dataset. The experiment results show that only with single scale local descriptors, our algorithm is comparable to some state-of-the-art algorithms. (C) 2015 Elsevier B.V. All rights reserved.
引用
收藏
页码:246 / 256
页数:11
相关论文
共 40 条
[1]  
[Anonymous], 2011, P INT C NEUR INF PRO
[2]  
[Anonymous], 2010, P BRIT MACH VIS C 20
[3]  
Boureau YL, 2011, IEEE I CONF COMP VIS, P2651, DOI 10.1109/ICCV.2011.6126555
[4]   Sparse Modeling of Human Actions from Motion Imagery [J].
Castrodad, Alexey ;
Sapiro, Guillermo .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2012, 100 (01) :1-15
[5]   LIBSVM: A Library for Support Vector Machines [J].
Chang, Chih-Chung ;
Lin, Chih-Jen .
ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2011, 2 (03)
[6]   Extended SRC: Undersampled Face Recognition via Intraclass Variant Dictionary [J].
Deng, Weihong ;
Hu, Jiani ;
Guo, Jun .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2012, 34 (09) :1864-1870
[7]  
Desai C, 2012, LECT NOTES COMPUT SC, V7575, P158, DOI 10.1007/978-3-642-33765-9_12
[8]   Kernelized Supervised Dictionary Learning [J].
Gangeh, Mehrdad J. ;
Ghodsi, Ali ;
Kamel, Mohamed S. .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2013, 61 (19) :4753-4767
[9]   Local Features Are Not Lonely - Laplacian Sparse Coding for Image Classification [J].
Gao, Shenghua ;
Tsang, Ivor Wai-Hung ;
Chia, Liang-Tien ;
Zhao, Peilin .
2010 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2010, :3555-3561
[10]   A survey on still image based human action recognition [J].
Guo, Guodong ;
Lai, Alice .
PATTERN RECOGNITION, 2014, 47 (10) :3343-3361