A video semantic detection method based. on locality-sensitive discriminant sparse representation and weighted KNN

被引:19
作者
Zhan, Yongzhao [1 ]
Liu, Junqi [1 ]
Gou, Jianping [1 ]
Wang, Minchao [1 ]
机构
[1] Jiangsu Univ, Sch Comp Sci & Telecommun Engn, Zhenjiang 212013, Jiangsu, Peoples R China
基金
中国国家自然科学基金; 中国博士后科学基金;
关键词
Sparse representation; Discrimination; Weighted KNN; Video semantic concept detection; K-SVD; CLASSIFICATION; DICTIONARIES; ALGORITHM;
D O I
10.1016/j.jvcir.2016.09.006
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Video semantic detection has been one research hotspot in the field of human-computer interaction. In video features-oriented sparse representation, the features from the same category video could not achieve similar coding results. To address this, the Locality-Sensitive Discriminant Sparse Representation (LSDSR) is developed, in order that the video samples belonging to the same video category are encoded as similar sparse codes which make them have better category discrimination. In the LSDSR, a discriminative loss function based on sparse coefficients is imposed on the locality-sensitive sparse representation, which makes the optimized dictionary for sparse representation be discriminative. The LSDSR for video features enhances the power of semantic discrimination to optimize the dictionary and build the better discriminant sparse model. More so, to further improve the accuracy of video semantic detection after sparse representation, a weighted K-Nearest Neighbor (KNN) classification method with the loss function that integrates reconstruction error and discrimination for the sparse representation is adopted to detect video semantic concepts. The proposed methods are evaluated on the related video databases in comparison with existing sparse representation methods. The experimental results show that the proposed methods significantly enhance the power of discrimination of video features, and consequently improve the accuracy of video semantic concept detection. (C) 2016 Elsevier Inc. All rights reserved.
引用
收藏
页码:65 / 73
页数:9
相关论文
共 24 条
[1]   K-SVD: An algorithm for designing overcomplete dictionaries for sparse representation [J].
Aharon, Michal ;
Elad, Michael ;
Bruckstein, Alfred .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2006, 54 (11) :4311-4322
[2]  
[Anonymous], J JIANGSU U
[3]   Compression of facial images using the K-SVD algorithm [J].
Bryt, Ori ;
Elad, Michael .
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2008, 19 (04) :270-282
[4]  
Candes E. J., 2006, P INT C MATH ICM, P1433, DOI DOI 10.4171/022-3/69
[5]   Image denoising via sparse and redundant representations over learned dictionaries [J].
Elad, Michael ;
Aharon, Michal .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2006, 15 (12) :3736-3745
[6]  
Fu J., 2012, INFORM TECHNOL J, V11, P1381
[7]   Improved pseudo nearest neighbor classification [J].
Gou, Jianping ;
Zhan, Yongzhao ;
Rao, Yunbo ;
Shen, Xiangjun ;
Wang, Xiaoming ;
He, Wu .
KNOWLEDGE-BASED SYSTEMS, 2014, 70 :361-375
[8]  
Guha Tanaya, 2011, Proceedings 2011 IEEE International Conference on Automatic Face & Gesture Recognition (FG 2011), P143, DOI 10.1109/FG.2011.5771388
[9]   TEXTURAL FEATURES FOR IMAGE CLASSIFICATION [J].
HARALICK, RM ;
SHANMUGAM, K ;
DINSTEIN, I .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1973, SMC3 (06) :610-621
[10]  
Liang Z., 2010, P INT C INF SCI ENG, P3404