Machine-learning models, cost matrices, and conservation-based reduction of selected landscape classification errors

被引:2
作者
Gutzwiller, Kevin J. [1 ,2 ]
Chaudhary, Anand [2 ]
机构
[1] Baylor Univ, Dept Biol, One Bear Pl,97388, Waco, TX 76798 USA
[2] Baylor Univ, Inst Ecol Earth & Environm Sci, One Bear Pl,97205, Waco, TX 76798 USA
关键词
Classifiers; Cohen's Kappa; False negative; False positive; Sensitivity; Specificity;
D O I
10.1007/s10980-020-00969-y
中图分类号
Q14 [生态学(生物生态学)];
学科分类号
071012 ; 0713 ;
摘要
Context Use of statistical models developed with machine-learning algorithms is increasing in the ecological sciences, yet these disciplines have not capitalized on the ability to use cost matrices to selectively reduce classification errors that have highly detrimental consequences. Objectives Our aim was to promote such applications by demonstrating the process of using a cost matrix to decrease specific types of misclassification, explaining the importance of exploring the effectiveness of cost matrices for a given dataset, and encouraging use of cost matrices with machine-learning models in landscape-ecological and conservation contexts. Methods Bird occurrence data, landscape and regional land-cover data, costs of false-positive and false-negative errors, and the C5.0 decision tree algorithm were used to train and test a binary classifier. Results Increasing the cost for false negatives tended to decrease the frequency of this error type while allowing for reasonable predictive performance for each class separately and both classes combined. Conclusions Cost matrices are applicable to many different categorical response variables and spatial scales. We encourage landscape ecologists and planners to explore the effectiveness of cost matrices for their particular dataset and project goals, especially when conservation of biodiversity across broad spatial extents is at stake.
引用
收藏
页码:249 / 255
页数:7
相关论文
共 17 条
  • [1] [Anonymous], 2017, N AM BREEDING BIRD S
  • [2] [Anonymous], 2017, R LANG ENV STAT COMP
  • [3] [Anonymous], 2015, MACHINE LEARNING R L
  • [4] Bhattacharya M, 2013, INT J ADV COMPUT SC, V4, P1
  • [5] Breiman L, 1993, CLASSIFICATION REGRE
  • [6] Fielding AH, 2007, CLUSTER AND CLASSIFICATION TECHNIQUES FOR THE BIOSCIENCES, P1, DOI 10.2277/ 0521618002
  • [7] Fielding AH, 2002, PREDICTING SPECIES OCCURRENCES: ISSUES OF ACCURACY AND SCALE, P271
  • [8] What is the value of a good map?: An example using high spatial resolution imagery to aid riparian restoration
    Gergel, Sarah E.
    Stange, Yulia
    Coops, Nicholas C.
    Johansen, Kasper
    Kirby, Kathryn R.
    [J]. ECOSYSTEMS, 2007, 10 (05) : 688 - 702
  • [9] Avian abundance thresholds, human-altered landscapes, and the challenge of assemblage-level conservation
    Gutzwiller, Kevin J.
    Riffell, Samuel K.
    Flather, Curtis H.
    [J]. LANDSCAPE ECOLOGY, 2015, 30 (10) : 2095 - 2110
  • [10] Hollmén J, 2000, MANAG INFORMAT SYST, V2, P495