Supervised Local Descriptor Learning for Human Action Recognition

被引:16
作者
Zhen, Xiantong [1 ]
Zheng, Feng [1 ]
Shao, Ling [2 ]
Cao, Xianbin [3 ]
Xu, Dan [4 ]
机构
[1] Univ Texas Arlington, Dept Comp Sci & Engn, Arlington, TX 76019 USA
[2] Univ East Anglia, Sch Comp Sci, Norwich NR4 7TJ, Norfolk, England
[3] Beihang Univ, Sch Elect & Informat Engn, Beijing 100191, Peoples R China
[4] Univ Trento, Dept Informat Engn & Comp Sci, I-38122 Trento, Italy
基金
美国国家科学基金会;
关键词
Action recognition; dimensionality reduction; image-to-class distance; large scale local features; manifold regularization; naive Bayes nearest neighbor; MANIFOLD REGULARIZATION; REGRESSION; EIGENMAPS; FRAMEWORK;
D O I
10.1109/TMM.2017.2700204
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Local features have been widely used in computer vision tasks, e.g., human action recognition, but it tends to be an extremely challenging task to deal with large-scale local features of high dimensionality with redundant information. In this paper, we propose a novel fully supervised local descriptor learning algorithm called discriminative embedding method based on the image-to-class distance (I2CDDE) to learn compact but highly discriminative local feature descriptors for more accurate and efficient action recognition. By leveraging the advantages of the I2C distance, the proposed I2CDDE incorporates class labels to enable fully supervised learning of local feature descriptors, which achieves highly discriminative but compact local descriptors. The objective of our I2CDDE is to minimize the I2C distances from samples to their corresponding classes while maximizing the I2C distances to the other classes in the low-dimensional space. To further improve the performance, we propose incorporating a manifold regularization based on the graph Laplacian into the objective function, which can enhance the smoothness of the embedding by extracting the local intrinsic geometrical structure. The proposed I2CDDE for the first time achieves fully supervised learning of local feature descriptors. It significantly improves the performance of I2C-based methods by increasing the discriminative ability of local features while greatly reducing the computational burden by dimensionality reduction to handle large-scale data. We apply the proposed I2CDDE algorithm to human action recognition on four widely used benchmark datasets. The results have shown that I2CDDE can significantly improve I2C-based classifiers and achieves state-of-the-art performance.
引用
收藏
页码:2056 / 2065
页数:10
相关论文
共 58 条
[1]  
[Anonymous], INTRO HUMAN ACTION R
[2]  
[Anonymous], 1997, AM MATH SOC, DOI DOI 10.1090/CBMS/092
[3]  
[Anonymous], P AS C COMP VIS
[4]  
[Anonymous], P BR MACH VIS C
[5]  
[Anonymous], 2012, CoRR
[6]   Towards Optimal Naive Bayes Nearest Neighbor [J].
Behmo, Regis ;
Marcombes, Paul ;
Dalalyan, Arnak ;
Prinet, Veronique .
COMPUTER VISION-ECCV 2010, PT IV, 2010, 6314 :171-+
[7]  
Belkin M, 2002, ADV NEUR IN, V14, P585
[8]  
Belkin M, 2006, J MACH LEARN RES, V7, P2399
[9]  
Bo L., 2009, Advances in neural information processing systems, P135
[10]   In defense of Nearest-Neighbor based image classification [J].
Boiman, Oren ;
Shechtman, Eli ;
Irani, Michal .
2008 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOLS 1-12, 2008, :1992-+