Tree Species Abundance Predictions in a Tropical Agricultural Landscape with a Supervised Classification Model and Imbalanced Data

被引:63
作者
Graves, Sarah J. [1 ]
Asner, Gregory P. [2 ]
Martin, Roberta E. [2 ]
Anderson, Christopher B. [2 ]
Colgan, Matthew S. [2 ]
Kalantari, Leila [3 ]
Bohlman, Stephanie A. [1 ,4 ]
机构
[1] Univ Florida, Sch Forest Resources & Conservat, POB 11041, Gainesville, FL 32611 USA
[2] Carnegie Inst Sci, Dept Global Ecol, 260 Panama St, Stanford, CA 94305 USA
[3] Univ Florida, Dept Comp & Informat Sci & Engn, POB 116120, Gainesville, FL 32611 USA
[4] Smithsonian Trop Res Inst, Apartado 0843-03092, Balboa, Ancon, Panama
关键词
Support Vector Machine; imaging spectroscopy; class imbalance; tropics; agriculture; operational species mapping; SUPPORT VECTOR MACHINE; IMAGING SPECTROSCOPY; ESTIMATING AREA; LIDAR DATA; ACCURACY; ERROR; BIODIVERSITY; SCIENCE; IMAGERY;
D O I
10.3390/rs8020161
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Mapping species through classification of imaging spectroscopy data is facilitating research to understand tree species distributions at increasingly greater spatial scales. Classification requires a dataset of field observations matched to the image, which will often reflect natural species distributions, resulting in an imbalanced dataset with many samples for common species and few samples for less common species. Despite the high prevalence of imbalanced datasets in multiclass species predictions, the effect on species prediction accuracy and landscape species abundance has not yet been quantified. First, we trained and assessed the accuracy of a support vector machine (SVM) model with a highly imbalanced dataset of 20 tropical species and one mixed-species class of 24 species identified in a hyperspectral image mosaic (350-2500 nm) of Panamanian farmland and secondary forest fragments. The model, with an overall accuracy of 62% +/- 2.3% and F-score of 59% +/- 2.7%, was applied to the full image mosaic (23,000 ha at a 2-m resolution) to produce a species prediction map, which suggested that this tropical agricultural landscape is more diverse than what has been presented in field-based studies. Second, we quantified the effect of class imbalance on model accuracy. Model assessment showed a trend where species with more samples were consistently over predicted while species with fewer samples were under predicted. Standardizing sample size reduced model accuracy, but also reduced the level of species over- and under-prediction. This study advances operational species mapping of diverse tropical landscapes by detailing the effect of imbalanced data on classification accuracy and providing estimates of tree species abundance in an agricultural landscape. Species maps using data and methods presented here can be used in landscape analyses of species distributions to understand human or environmental effects, in addition to focusing conservation efforts in areas with high tree cover and diversity.
引用
收藏
页数:21
相关论文
共 60 条
[1]   Urban tree species mapping using hyperspectral and lidar data fusion [J].
Alonzo, Michael ;
Bookhagen, Bodo ;
Roberts, Dar A. .
REMOTE SENSING OF ENVIRONMENT, 2014, 148 :70-83
[2]   Identifying Santa Barbara's urban tree species from AVIRIS imagery using canonical discriminant analysis [J].
Alonzo, Mike ;
Roth, Keely ;
Roberts, Dar .
REMOTE SENSING LETTERS, 2013, 4 (05) :513-521
[3]  
[Anonymous], 2015, MISC FUNCT DEP STAT
[4]  
[Anonymous], J SUSTAIN FOR
[5]   Quantifying forest canopy traits: Imaging spectroscopy versus field survey [J].
Asner, Gregory P. ;
Martin, Roberta E. ;
Anderson, Christopher B. ;
Knapp, David E. .
REMOTE SENSING OF ENVIRONMENT, 2015, 158 :15-27
[6]   Carnegie Airborne Observatory-2: Increasing science data dimensionality via high-fidelity multi-sensor fusion [J].
Asner, Gregory P. ;
Knapp, David E. ;
Boardman, Joseph ;
Green, Robert O. ;
Kennedy-Bowdoin, Ty ;
Eastwood, Michael ;
Martin, Roberta E. ;
Anderson, Christopher ;
Field, Christopher B. .
REMOTE SENSING OF ENVIRONMENT, 2012, 124 :454-465
[7]  
BALDECK CA, 2014, IEEE J-STARS, V8, P2501, DOI DOI 10.1109/JSTARS.2014.2346475
[8]   Operational Tree Species Mapping in a Diverse Tropical Forest with Airborne Imaging Spectroscopy [J].
Baldeck, Claire A. ;
Asner, Gregory P. ;
Martin, Robin E. ;
Anderson, Christopher B. ;
Knapp, David E. ;
Kellner, James R. ;
Wright, S. Joseph .
PLOS ONE, 2015, 10 (07)
[9]   Improving Remote Species Identification through Efficient Training Data Collection [J].
Baldeck, Claire A. ;
Asner, Gregory P. .
REMOTE SENSING, 2014, 6 (04) :2682-2698
[10]   Class prediction for high-dimensional class-imbalanced data [J].
Blagus, Rok ;
Lusa, Lara .
BMC BIOINFORMATICS, 2010, 11 :523