Spatial Ensemble Learning for Heterogeneous Geographic Data with Class Ambiguity

被引:12
作者
Jiang, Zhe [1 ]
Sainju, Arpan Man [1 ]
Li, Yan [2 ]
Shekhar, Shashi [2 ]
Knight, Joseph [3 ]
机构
[1] Univ Alabama, Comp Sci Dept, Box 870290, Tuscaloosa, AL 35487 USA
[2] Univ Minnesota, Dept Comp Sci, 4-192 Keller Hall,200 Union St SE, Minneapolis, MN 55455 USA
[3] Univ Minnesota, Dept Forest Resources, 1530 Cleveland Ave North, St Paul, MN 55108 USA
基金
美国国家科学基金会;
关键词
Spatial classification; class ambiguity; spatial heterogeneity; spatial ensemble; local models; CLASSIFICATION; MIXTURE;
D O I
10.1145/3337798
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Class ambiguity refers to the phenomenon whereby similar features correspond to different classes at different locations. Given heterogeneous geographic data with class ambiguity, the spatial ensemble learning (SEL) problem aims to find a decomposition of the geographic area into disjoint zones such that class ambiguity is minimized and a local classifier can be learned in each zone. The problem is important for applications such as land cover mapping from heterogeneous earth observation data with spectral confusion. However, the problem is challenging due to its high computational cost. Related work in ensemble learning either assumes an identical sample distribution (e.g., bagging, boosting, random forest) or decomposes multi-modular input data in the feature vector space (e.g., mixture of experts, multimodal ensemble) and thus cannot effectively minimize class ambiguity. In contrast, we propose a spatial ensemble framework that explicitly partitions input data in geographic space. Our approach first preprocesses data into homogeneous spatial patches and uses a greedy heuristic to allocate pairs of patches with high class ambiguity into different zones. We further extend our spatial ensemble learning framework with spatial dependency between nearby zones based on the spatial autocorrelation effect. Both theoretical analysis and experimental evaluations on two real world wetland mapping datasets show the feasibility of the proposed approach.
引用
收藏
页数:25
相关论文
共 38 条
[1]  
[Anonymous], 2016, WEKA 3 DATA MINING S
[2]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[3]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[4]   DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs [J].
Chen, Liang-Chieh ;
Papandreou, George ;
Kokkinos, Iasonas ;
Murphy, Kevin ;
Yuille, Alan L. .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) :834-848
[5]   Ensemble methods in machine learning [J].
Dietterich, TG .
MULTIPLE CLASSIFIER SYSTEMS, 2000, 1857 :1-15
[6]   Subcategory-aware Object Classification [J].
Dong, Jian ;
Xia, Wei ;
Chen, Qiang ;
Feng, Jianshi ;
Huang, Zhongyang ;
Yan, Shuicheng .
2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, :827-834
[7]   Advances in Spectral-Spatial Classification of Hyperspectral Images [J].
Fauvel, Mathieu ;
Tarabalka, Yuliya ;
Benediktsson, Jon Atli ;
Chanussot, Jocelyn ;
Tilton, James C. .
PROCEEDINGS OF THE IEEE, 2013, 101 (03) :652-675
[8]   Fast balanced partitioning is hard even on grids and trees [J].
Feldmann, Andreas Emil .
THEORETICAL COMPUTER SCIENCE, 2013, 485 :61-68
[9]   A decision-theoretic generalization of on-line learning and an application to boosting [J].
Freund, Y ;
Schapire, RE .
JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 1997, 55 (01) :119-139
[10]  
Gonçalves AR, 2015, PROCEEDINGS OF THE TWENTY-FOURTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE (IJCAI), P3525