Cost-Sensitive Active Learning With Lookahead: Optimizing Field Surveys for Remote Sensing Data Classification

被引:41
作者
Persello, Claudio [1 ,2 ]
Boularias, Abdeslam [3 ]
Dalponte, Michele [4 ]
Gobakken, Terje [5 ]
Naesset, Erik [5 ]
Schoelkopf, Bernhard [1 ]
机构
[1] Max Planck Inst Intelligent Syst, Dept Empir Inference, D-72076 Tubingen, Germany
[2] Univ Trent, Dept Informat Engn & Comp Sci, I-38123 Trento, Italy
[3] Carnegie Mellon Univ, Pittsburgh, PA 15201 USA
[4] Edmund Mach Fdn, I-38010 San Michele All Adige, Italy
[5] Norwegian Univ Life Sci, Dept Ecol & Nat Resource Management, N-1432 As, Norway
来源
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING | 2014年 / 52卷 / 10期
关键词
Active learning (AL); field surveys; forest inventories; hyperspectral data; image classification; Markov decision process (MDP); support vector machine (SVM); TREE SPECIES CLASSIFICATION; HYPERSPECTRAL DATA; FOREST; LEAF;
D O I
10.1109/TGRS.2014.2300189
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
Active learning typically aims at minimizing the number of labeled samples to be included in the training set to reach a certain level of classification accuracy. Standard methods do not usually take into account the real annotation procedures and implicitly assume that all samples require the same effort to be labeled. Here, we consider the case where the cost associated with the annotation of a given sample depends on the previously labeled samples. In general, this is the case when annotating a queried sample is an action that changes the state of a dynamic system, and the cost is a function of the state of the system. In order to minimize the total annotation cost, the active sample selection problem is addressed in the framework of a Markov decision process, which allows one to plan the next labeling action on the basis of an expected long-term cumulative reward. This framework allows us to address the problem of optimizing the collection of labeled samples by field surveys for the classification of remote sensing data. The proposed method is applied to the ground sample collection for tree species classification using airborne hyperspectral images. Experiments carried out in the context of a real case study on forest inventory show the effectiveness of the proposed method.
引用
收藏
页码:6652 / 6664
页数:13
相关论文
共 39 条
[1]  
[Anonymous], STAT FOREST CONDITIO
[2]  
[Anonymous], 2009, ACTIVE LEARNING LIT
[3]  
[Anonymous], 2008, SAMPLING STRATEGIES
[4]  
[Anonymous], 2007, Approximate Dynamic Programming: Solving the Curses of Dimensionality (Wiley Series in Probability and Statistics)
[5]  
Azimi J., 2012, P 29 INT C MACH LEAR, P1199
[6]  
Bernstein LS, 2005, INT GEOSCI REMOTE SE, P3549
[7]   Detection and analysis of individual leaf-off tree crowns in small footprint, high sampling density lidar data from the eastern deciduous forest in North America [J].
Brandtberg, T ;
Warner, TA ;
Landenberger, RE ;
McGraw, JB .
REMOTE SENSING OF ENVIRONMENT, 2003, 85 (03) :290-303
[8]  
Brinker K., 2003, P 20 INT C MACH LEAR, P59
[9]  
Campbell C., 2000, ICML, P111
[10]   Hyperspectral discrimination of tropical rain forest tree species at leaf to crown scales [J].
Clark, ML ;
Roberts, DA ;
Clark, DB .
REMOTE SENSING OF ENVIRONMENT, 2005, 96 (3-4) :375-398