Cost-Sensitive Active Learning With Lookahead: Optimizing Field Surveys for Remote Sensing Data Classification

被引:41
作者
Persello, Claudio [1 ,2 ]
Boularias, Abdeslam [3 ]
Dalponte, Michele [4 ]
Gobakken, Terje [5 ]
Naesset, Erik [5 ]
Schoelkopf, Bernhard [1 ]
机构
[1] Max Planck Inst Intelligent Syst, Dept Empir Inference, D-72076 Tubingen, Germany
[2] Univ Trent, Dept Informat Engn & Comp Sci, I-38123 Trento, Italy
[3] Carnegie Mellon Univ, Pittsburgh, PA 15201 USA
[4] Edmund Mach Fdn, I-38010 San Michele All Adige, Italy
[5] Norwegian Univ Life Sci, Dept Ecol & Nat Resource Management, N-1432 As, Norway
来源
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING | 2014年 / 52卷 / 10期
关键词
Active learning (AL); field surveys; forest inventories; hyperspectral data; image classification; Markov decision process (MDP); support vector machine (SVM); TREE SPECIES CLASSIFICATION; HYPERSPECTRAL DATA; FOREST; LEAF;
D O I
10.1109/TGRS.2014.2300189
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
Active learning typically aims at minimizing the number of labeled samples to be included in the training set to reach a certain level of classification accuracy. Standard methods do not usually take into account the real annotation procedures and implicitly assume that all samples require the same effort to be labeled. Here, we consider the case where the cost associated with the annotation of a given sample depends on the previously labeled samples. In general, this is the case when annotating a queried sample is an action that changes the state of a dynamic system, and the cost is a function of the state of the system. In order to minimize the total annotation cost, the active sample selection problem is addressed in the framework of a Markov decision process, which allows one to plan the next labeling action on the basis of an expected long-term cumulative reward. This framework allows us to address the problem of optimizing the collection of labeled samples by field surveys for the classification of remote sensing data. The proposed method is applied to the ground sample collection for tree species classification using airborne hyperspectral images. Experiments carried out in the context of a real case study on forest inventory show the effectiveness of the proposed method.
引用
收藏
页码:6652 / 6664
页数:13
相关论文
共 39 条
[31]   An active learning approach to hyperspectral data classification [J].
Rajan, Suju ;
Ghosh, Joydeep ;
Crawford, Melba M. .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2008, 46 (04) :1231-1242
[32]  
Roy N., 2001, ICML Williamstown
[33]  
Schohn Greg, 2000, ICML, P839
[34]  
Seung H. S., 1992, Proceedings of the Fifth Annual ACM Workshop on Computational Learning Theory, P287, DOI 10.1145/130385.130417
[35]  
*STAT NORW, 2008, SKOGST 2008
[36]   Support vector machine active learning with applications to text classification [J].
Tong, S ;
Koller, D .
JOURNAL OF MACHINE LEARNING RESEARCH, 2002, 2 (01) :45-66
[37]   Active Learning Methods for Remote Sensing Image Classification [J].
Tuia, Devis ;
Ratle, Frederic ;
Pacifici, Fabio ;
Kanevski, Mikhail F. ;
Emery, William J. .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2009, 47 (07) :2218-2232
[38]  
Xu Z, 2003, LECT NOTES COMPUT SC, V2633, P393
[39]   Penalized discriminant analysis of in situ hyperspectral data for conifer species recognition [J].
Yu, B ;
Ostland, IM ;
Gong, P ;
Pu, RL .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 1999, 37 (05) :2569-2577