Annular Spatial Pyramid Mapping and Feature Fusion-Based Image Coding Representation and Classification

被引:5
作者
Xu, Mengxi [1 ]
Lu, Yingshu [2 ]
Wu, Xiaobin [1 ]
机构
[1] Nanjing Inst Technol, Sch Comp Engn, Nanjing 211167, Peoples R China
[2] Huawei Technol Co Ltd, Nanjing 210000, Peoples R China
关键词
BOVW; PSO;
D O I
10.1155/2020/8838454
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Conventional image classification models commonly adopt a single feature vector to represent informative contents. However, a single image feature system can hardly extract the entirety of the information contained in images, and traditional encoding methods have a large loss of feature information. Aiming to solve this problem, this paper proposes a feature fusion-based image classification model. This model combines the principal component analysis (PCA) algorithm, processed scale invariant feature transform (P-SIFT) and color naming (CN) features to generate mutually independent image representation factors. At the encoding stage of the scale-invariant feature transform (SIFT) feature, the bag-of-visual-word model (BOVW) is used for feature reconstruction. Simultaneously, in order to introduce the spatial information to our extracted features, the rotation invariant spatial pyramid mapping method is introduced for the P-SIFT and CN feature division and representation. At the stage of feature fusion, we adopt a support vector machine with two kernels (SVM-2K) algorithm, which divides the training process into two stages and finally learns the knowledge from the corresponding kernel matrix for the classification performance improvement. The experiments show that the proposed method can effectively improve the accuracy of image description and the precision of image classification.
引用
收藏
页数:9
相关论文
共 36 条
[1]   Principal component analysis [J].
Abdi, Herve ;
Williams, Lynne J. .
WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL STATISTICS, 2010, 2 (04) :433-459
[2]  
[Anonymous], 2006, ADV NEURAL INFORM PR
[3]  
[Anonymous], 2007, Caltech-256 object category dataset'
[4]  
Arthur D, 2007, PROCEEDINGS OF THE EIGHTEENTH ANNUAL ACM-SIAM SYMPOSIUM ON DISCRETE ALGORITHMS, P1027
[5]   LIBSVM: A Library for Support Vector Machines [J].
Chang, Chih-Chung ;
Lin, Chih-Jen .
ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2011, 2 (03)
[6]   The devil is in the details: an evaluation of recent feature encoding methods [J].
Chatfield, Ken ;
Lempitsky, Victor ;
Vedaldi, Andrea ;
Zisserman, Andrew .
PROCEEDINGS OF THE BRITISH MACHINE VISION CONFERENCE 2011, 2011,
[7]   Deep Learning-Based Classification of Hyperspectral Data [J].
Chen, Yushi ;
Lin, Zhouhan ;
Zhao, Xing ;
Wang, Gang ;
Gu, Yanfeng .
IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2014, 7 (06) :2094-2107
[8]  
Farquhar J.D. R., 2005, NIPS
[9]   Distributed k-connected fault-tolerant topology control algorithms with PSO in future autonomic sensor systems [J].
Guo, Wenzhong ;
Xiong, Naixue ;
Vasilakos, Athanasios V. ;
Chen, Guolong ;
Yu, Chaolong .
INTERNATIONAL JOURNAL OF SENSOR NETWORKS, 2012, 12 (01) :53-62
[10]  
He KM, 2014, LECT NOTES COMPUT SC, V8691, P346, DOI [arXiv:1406.4729, 10.1007/978-3-319-10578-9_23]