Uncertainty-Based Sample Optimization Strategies for Large Forest Samples Set

被引:1
作者
Guo, Yan [1 ]
Liu, Wenyi [2 ]
Liu, Fujiang [3 ]
机构
[1] China Univ Geosci, Coll Comp Sci, Wuhan 430074, Hubei, Peoples R China
[2] Wuhan Univ, Sch Remote Sensing & Informat Engn, Wuhan 430072, Hubei, Peoples R China
[3] China Univ Geosci, Fac Informat Engn, Wuhan 430074, Hubei, Peoples R China
来源
COMPUTATIONAL INTELLIGENCE AND INTELLIGENT SYSTEMS, (ISICA 2015) | 2016年 / 575卷
关键词
Sample optimization; Uncertainty; Clustering; KNN; Remotesensing image classification;
D O I
10.1007/978-981-10-0356-1_55
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Our study was focused on the optimization of large training samples set selected from the global forest cover change detection system. Automatically delineating training samples procedure labeled tens of millions of samples representing forests and non-forests. To improve the precision, reduce the computational complexity and avoid over-fitting, we need to select samples from the large set of tens of millions of samples that are helpful for training a classifier. In this paper, two methods were used to optimize a large sample set from the Landsat-7 ETM+ data and obtain samples for training the classifier. The first method was the traditional stratified system sampling strategy. The second was uncertainty-based sample set optimization that selects training samples based on uncertainty by examining the uncertainty measure of samples and the distribution of their feature space, and involving the subtractive clustering, KNN and support vector machine. Through precision evaluation, our experiments validated that the uncertainty-based sampling strategy can achieve better results than the stratified system sampling strategy.
引用
收藏
页码:519 / 530
页数:12
相关论文
共 15 条
  • [1] Fast nearest neighbor condensation for large data sets classification
    Angiulli, Fabrizio
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2007, 19 (11) : 1450 - 1464
  • [2] Bay S. D., 1999, Intelligent Data Analysis, V3, P191, DOI 10.1016/S1088-467X(99)00018-9
  • [3] Dasgupta S., 2008, ICML08: Proceedings of the 25th International Conference on Machine Learning, P208
  • [4] Donmez P, 2007, LECT NOTES ARTIF INT, V4701, P116
  • [5] Hu Z., 2010, J YANSHAN U, V5, P421
  • [6] Use of a dark object concept and support vector machines to automate forest cover change analysis
    Huang, Chengquan
    Song, Kuan
    Kim, Sunghee
    Townshend, John R. G.
    Davis, Paul
    Masek, Jeffrey G.
    Goward, Samuel N.
    [J]. REMOTE SENSING OF ENVIRONMENT, 2008, 112 (03) : 970 - 985
  • [7] Automated masking of cloud and cloud shadow for forest change analysis using Landsat images
    Huang, Chengquan
    Thomas, Nancy
    Goward, Samuel N.
    Masek, Jeffrey G.
    Zhu, Zhiliang
    Townshend, John R. G.
    Vogelmann, James E.
    [J]. INTERNATIONAL JOURNAL OF REMOTE SENSING, 2010, 31 (20) : 5449 - 5464
  • [8] Jun Ying Chen, 2008, Information Technology Journal, V7, P356
  • [9] Li Y., 2011, J HUHAI I TECHNOL NA, VS1, P67
  • [10] QRS detection using K-Nearest Neighbor algorithm (KNN) and evaluation on standard ECG databases
    Saini, Indu
    Singh, Dilbag
    Khosla, Arun
    [J]. JOURNAL OF ADVANCED RESEARCH, 2013, 4 (04) : 331 - 344