Pyramid of Spatial Relatons for Scene-Level Land Use Classification

被引:204
作者
Chen, Shizhi [1 ]
Tian, YingLi [1 ,2 ]
机构
[1] CUNY City Coll, New York, NY 10031 USA
[2] CUNY, Grad Ctr, New York, NY 10031 USA
来源
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING | 2015年 / 53卷 / 04期
关键词
Bag of words (BOW); geographical image classification; land use classification; pyramid of spatial relatons (PSR); spatial pyramid matching (SPM); URBAN-AREA; IMAGE; FEATURES; POINTS; SIFT;
D O I
10.1109/TGRS.2014.2351395
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
Local feature with bag-of-words (BOW) representation has become one of the most popular approaches in object classification and image retrieval applications in the computer vision community. The recent efforts in the remote sensing community have demonstrated that the BOW approach can also effectively apply to geographic images for the applications of classification and retrieval. However, the BOW representation discards spatial information, which is critical for the remotely sensed land use classification. Several algorithms have incorporated spatial information into the BOWrepresentation by hard encoding coordinates of local features. Such rigid spatial encoding is not robust to translation and rotation variations, which are common characteristics of geographic images. To effectively incorporate spatial information into the BOW model for the land use classification, we propose a pyramid-of-spatial-relatons (PSR) model to capture both absolute and relative spatial relationships of local features. Unlike the conventional cooccurrence approach to describe pairwise spatial relationships between local features, the PSR model employs a novel concept of spatial relation to describe relative spatial relationship of a group of local features. As the result, the storage cost of the PSR model only linearly increases with the visual word codebook size instead of the quadratic relationship as in the cooccurrence approach. The PSR model is robust to translation and rotation variations and demonstrates excellent performance for the application of remotely sensed land use classification. On the Land Use and Land Cover image database, the PSR achieves 8% higher in the classification accuracy than the state of the art. If using only gray images, it outperforms the state of the art by more than 11%.
引用
收藏
页码:1947 / 1957
页数:11
相关论文
共 46 条
  • [1] [Anonymous], 2009, IEEE TPAMI
  • [2] [Anonymous], P AS C COMP VIS
  • [3] Baeza-Yates R., 1999, Modern Information Retrieval
  • [4] Speeded-Up Robust Features (SURF)
    Bay, Herbert
    Ess, Andreas
    Tuytelaars, Tinne
    Van Gool, Luc
    [J]. COMPUTER VISION AND IMAGE UNDERSTANDING, 2008, 110 (03) : 346 - 359
  • [5] Bordes J. B., 2008, P BMVC, P1
  • [6] Boureau YL, 2011, IEEE I CONF COMP VIS, P2651, DOI 10.1109/ICCV.2011.6126555
  • [7] Performance evaluation of local colour invariants
    Burghouts, Gertjan J.
    Geusebroek, Jan-Mark
    [J]. COMPUTER VISION AND IMAGE UNDERSTANDING, 2009, 113 (01) : 48 - 62
  • [8] Spatial-Bag-of-Features
    Cao, Yang
    Wang, Changhu
    Li, Zhiwei
    Zhang, Liqing
    Zhang, Lei
    [J]. 2010 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2010, : 3352 - 3359
  • [9] Chen S., 2012, J COMPUT VIS IMAGE P, V2, P1
  • [10] Csurka G, 2004, WORKSH STAT LEARN CO, V1, P1, DOI DOI 10.1234/12345678