Comparison of Classification Algorithms and Training Sample Sizes in Urban Land Classification with Landsat Thematic Mapper Imagery

被引:312
作者
Li, Congcong [1 ]
Wang, Jie [2 ]
Wang, Lei [2 ]
Hu, Luanyun [3 ]
Gong, Peng [2 ,3 ,4 ,5 ]
机构
[1] Beijing Normal Univ, State Key Lab Remote Sensing Sci, Beijing 100875, Peoples R China
[2] Chinese Acad Sci, Inst Remote Sensing & Digital Earth, State Key Lab Remote Sensing Sci, Beijing 100101, Peoples R China
[3] Tsinghua Univ, Ctr Earth Syst Sci, Minist Educ, Key Lab Earth Syst Modeling, Beijing 100084, Peoples R China
[4] Univ Calif Berkeley, Dept Environm Sci Policy & Management, Berkeley, CA 94720 USA
[5] Joint Ctr Global Change Studies, Beijing 100875, Peoples R China
基金
国家高技术研究发展计划(863计划); 中国国家自然科学基金;
关键词
machine learning; maximum likelihood classification; logistic regression; support vector machine; tree classifiers; random forests; COVER CLASSIFICATION; REQUIREMENTS; TM;
D O I
10.3390/rs6020964
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Although a large number of new image classification algorithms have been developed, they are rarely tested with the same classification task. In this research, with the same Landsat Thematic Mapper (TM) data set and the same classification scheme over Guangzhou City, China, we tested two unsupervised and 13 supervised classification algorithms, including a number of machine learning algorithms that became popular in remote sensing during the past 20 years. Our analysis focused primarily on the spectral information provided by the TM data. We assessed all algorithms in a per-pixel classification decision experiment and all supervised algorithms in a segment-based experiment. We found that when sufficiently representative training samples were used, most algorithms performed reasonably well. Lack of training samples led to greater classification accuracy discrepancies than classification algorithms themselves. Some algorithms were more tolerable to insufficient (less representative) training samples than others. Many algorithms improved the overall accuracy marginally with per-segment decision making.
引用
收藏
页码:964 / 983
页数:20
相关论文
共 54 条
[1]  
AHA DW, 1991, MACH LEARN, V6, P37, DOI 10.1007/BF00153759
[2]  
[Anonymous], 2014, C4. 5: programs for machine learning
[3]  
[Anonymous], GEN BOOSTED MODELS G
[4]  
Ball, 1965, ISODATA NOVEL METHOD
[5]  
Bishop CM., 1995, NEURAL NETWORKS PATT
[6]   SmcHD1, containing a structural-maintenance-of-chromosomes hinge domain, has a critical role in X inactivation [J].
Blewitt, Marnie E. ;
Gendrel, Anne-Valerie ;
Pang, Zhenyi ;
Sparrow, Duncan B. ;
Whitelaw, Nadia ;
Craig, Jeffrey M. ;
Apedaile, Anwyn ;
Hilton, Douglas J. ;
Dunwoodie, Sally L. ;
Brockdorff, Neil ;
Kay, Graham F. ;
Whitelaw, Emma .
NATURE GENETICS, 2008, 40 (05) :663-669
[7]  
Bradski G, 2000, DR DOBBS J, V25, P120
[8]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[9]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[10]   A comparison of object-based and contextual pixel-based classifications using high and medium spatial resolution images [J].
Cai, Shanshan ;
Liu, Desheng .
REMOTE SENSING LETTERS, 2013, 4 (10) :998-1007