Integrating OpenStreetMap crowdsourced data and Landsat time series imagery for rapid land use/land cover (LULC) mapping: Case study of the Laguna de Bay area of the Philippines

被引:117
作者
Johnson, Brian A. [1 ]
Iizuka, Kotaro [2 ]
机构
[1] Inst Global Environm Strategies, 2108-11 Kamiyamaguchi, Hayama, Kanagawa 2400115, Japan
[2] Kyoto Univ, Res Inst Sustainable Humanosphere, Uji, Kyoto 6110011, Japan
关键词
OpenStreetMap; Volunteered geographic information; Citizen science; Crowdsourced data; Random forest; Landsat; 8; Google Earth Engine; SUPPORT VECTOR MACHINES; TRAINING DATA; FOREST; INFORMATION; URBAN; CLASSIFICATIONS; ACCURACY; MAPS;
D O I
10.1016/j.apgeog.2015.12.006
中图分类号
P9 [自然地理学]; K9 [地理];
学科分类号
0705 ; 070501 ;
摘要
We explored the potential for rapid land use/land cover (WLC) mapping using time-series Landsat satellite imagery and training data (for supervised classification) automatically extracted from crowd sourced OpenStreetMap (OSM) "landuse" (OSM-LU) and "natural" (OSM-N) polygon datasets. The main challenge with using these data for LULC classification was their high level of noise, as the Landsat images all contained varying degrees of cloud cover (causes of attribute noise) and the OSM polygons contained locational errors and class labeling errors (causes of class noise). A second challenge arose from the imbalanced class distribution in the extracted training data, which occurred due to wide discrepancies in the area coverage of each OSM-LU/OSM-N class. To address the first challenge, three relatively noise tolerant algorithms - naive bayes (NB), decision tree (C4.5 algorithm), and random forest (RF) were evaluated for image classification. To address the second challenge, artificial training samples were generated for the minority classes using the synthetic minority over-sampling technique (SMOTE). Image classification accuracies were calculated for a four-class, five-class, and six-class LULC system to assess the capability of the proposed methods for mapping relatively broad as well as more detailed LULC types, and the highest overall accuracies achieved were 84.0% (four-class SMOTE-RF result), 81.0% (five-class SMOTE-RF result), and 72.0% (six-class SMOTE-NB result). RF and NB had relatively similar overall accuracies, while those of C4.5 were much lower. SMOTE led to higher classification accuracies for RF and C4.5, and in some cases for NB, despite the noise in the training set. The main advantages of the proposed methods are their cost- and time-efficiency, as training data for supervised classification is automatically extracted from the crowdsourced datasets and no pre-processing for cloud detection/cloud removal is performed. (C) 2015 Elsevier Ltd. All rights reserved.
引用
收藏
页码:140 / 149
页数:10
相关论文
共 52 条
[11]  
Estima J., 2013, P 2 ACM SIGSPATIAL I, P39, DOI [DOI 10.1145/2534732.2534734, 10.1145/2534732.2534734]
[12]   Investigating the potential of OpenStreetMap for land use/land cover production: A case study for continental Portugal [J].
Estima, Jacinto ;
Painho, Marco .
Lecture Notes in Geoinformation and Cartography, 2015, 0 (9783319142791) :273-293
[13]   Modeling sensitivity to accuracy in classified imagery: A study of areal interpolation by dasymetric mapping [J].
Fisher, PF ;
Langford, M .
PROFESSIONAL GEOGRAPHER, 1996, 48 (03) :299-309
[14]  
Folleco AA, 2009, INFORM-J COMPUT INFO, V33, P245
[15]  
Food and Agriculture Organization of the United Nations, 2010, 163 FAO UN, DOI [10.4060/ca9825-n, DOI 10.4060/CA9825-N]
[16]   Predictive relations of tropical forest biomass from Landsat TM data and their transferability between regions [J].
Foody, GM ;
Boyd, DS ;
Cutler, MEJ .
REMOTE SENSING OF ENVIRONMENT, 2003, 85 (04) :463-474
[17]   Classification in the Presence of Label Noise: a Survey [J].
Frenay, Benoit ;
Verleysen, Michel .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2014, 25 (05) :845-869
[18]   Random Forests for land cover classification [J].
Gislason, PO ;
Benediktsson, JA ;
Sveinsson, JR .
PATTERN RECOGNITION LETTERS, 2006, 27 (04) :294-300
[19]   Citizens as sensors: the world of volunteered geography [J].
Goodchild, Michael .
GEOJOURNAL, 2007, 69 (04) :211-221
[20]   A Survey of mislabeled training data detection techniques for pattern classification [J].
Guan, Donghai ;
Yuan, Weiwei .
IETE TECHNICAL REVIEW, 2013, 30 (06) :524-530