Orthogonal and Smooth Subspace Based on Sparse Coding for Image Classification

被引:0
作者
Dai, Fushuang [1 ,2 ]
Zhao, Yao [1 ,2 ]
Chang, Dongxia [1 ,2 ]
Lin, Chunyu [2 ]
机构
[1] Beijing Jiaotong Univ, Inst Informat Sci, Beijing 100044, Peoples R China
[2] Beijing Key Lab Adv Informat Sci & Network Techno, Beijing, Peoples R China
来源
ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2015, PT II | 2015年 / 9315卷
关键词
Image classification; Orthogonal and smooth subspace; Sparse coding; Max pooling;
D O I
10.1007/978-3-319-24078-7_5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Many real-world problems usually deal with high-dimensional data, such as images, videos, text, web documents and so on. In fact, the classification algorithms used to process these high-dimensional data often suffer from the low accuracy and high computational complexity. Therefore, we propose a framework of transforming images from a high-dimensional image space to a low-dimensional target image space, based on learning an orthogonal smooth subspace for the SIFT sparse codes (SC-OSS). It is a two stage framework for subspace learning. Firstly, a sparse coding followed by spatial pyramid max pooling is used to get the image representation. Then, the image descriptor is mapped into an orthonormal and smooth subspace to classify images in low dimension. The proposed algorithm adds the orthogonality and a Laplacian smoothing penalty to constrain the projective function coefficient to be orthogonal and spatially smooth. The experimental results on the public datasets have shown that the proposed algorithm outperforms other subspace methods.
引用
收藏
页码:41 / 50
页数:10
相关论文
共 14 条
  • [1] [Anonymous], 2006, ADV NEURAL INF PROCE
  • [2] Bai EW, 2014, CHIN CONTR CONF, P6, DOI 10.1109/ChiCC.2014.6896586
  • [3] Duda RO., 1973, PATTERN CLASSIFICATI
  • [4] Learning an Orthogonal and Smooth Subspace for Image Classification
    Hou, Chenping
    Nie, Feiping
    Zhang, Changshui
    Wu, Yi
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2009, 16 (04) : 303 - 306
  • [5] Lazebnik S., COMPUTER VISION PATT, V2, P2169
  • [6] Liu FR, 2012, PROCEEDING OF THE IEEE INTERNATIONAL CONFERENCE ON INFORMATION AND AUTOMATION, P299, DOI 10.1109/ICInfA.2012.6246822
  • [7] Lou Xiong-wei, 2014, Journal of Multimedia, V9, P269, DOI 10.4304/jmm.9.2.269-277
  • [8] Distinctive image features from scale-invariant keypoints
    Lowe, DG
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2004, 60 (02) : 91 - 110
  • [9] Serre T, 2005, PROC CVPR IEEE, P994
  • [10] WANG JJ, 2010, PROC CVPR IEEE, P3360, DOI DOI 10.1109/CVPR.2010.5540018