Effects of different sampling strategies for unburned label selection in machine learning modelling of wildfire occurrence probability

被引:5
作者
Quan, Xingwen [1 ,2 ,3 ]
Jiao, Miao [2 ]
He, Zhili [4 ]
Jaafari, Abolfazl [5 ]
Xie, Qian [2 ]
Lai, Xiaoying [2 ]
机构
[1] Univ Elect Sci & Technol China, Sch Resources & Environm, Chengdu 611731, Peoples R China
[2] Univ Elect Sci & Technol China, Sch Resources & Environm, Chengdu 611731, Peoples R China
[3] Univ Elect Sci & Technol China, Yangtze Delta Reg Inst Huzhou, Huzhou 313001, Peoples R China
[4] Univ Elect Sci & Technol China, Glasgow Coll, Chengdu 611731, Peoples R China
[5] Agr Res Educ & Extens Org AREEO, Res Inst Forests & Rangelands, Tehran 496813111, Iran
基金
中国国家自然科学基金;
关键词
AUC-PR; AUC-ROC; foliage fuel load; fuel moisture content; imbalanced dataset; machine learning; random forest; sampling strategy; unburned label; wildfire; FUEL MOISTURE-CONTENT; RADIATIVE-TRANSFER MODEL; FOREST-FIRE; LOGISTIC-REGRESSION; ALGORITHMS; SUSCEPTIBILITY; PREDICTION; SYSTEM; AREA; OPTIMIZATION;
D O I
10.1071/WF21149
中图分类号
S7 [林业];
学科分类号
0829 ; 0907 ;
摘要
The selection of unburned labels is a crucial step in machine learning modelling of wildfire occurrence probability. However, the effect of different sampling strategies on the performance of machine learning methods has not yet been thoroughly investigated. Additionally, whether the ratio of burned labels to unburned labels should be balanced or imbalanced remains a controversial issue. To address these gaps in the literature, we examined the effects of four broadly used sampling strategies for unburned label selection: (1) random selection in the unburned areas, (2) selection of areas with only one fire event, (3) selection of barren areas, and (4) selection of areas determined by the semi-variogram geostatistical technique. The effect of the balanced and imbalanced ratio between burned and unburned labels was also investigated. The random forest (RF) method explored the relationships between historical wildfires that occurred over the period between 2001 and 2020 in Yunnan Province, China, and climate, topography, fuel and anthropogenic variables. Multiple metrics demonstrated that the random selection of the unburned labels from the unburned areas with an imbalanced dataset outperformed the other three sampling strategies. Thus, we recommend this strategy to produce the required datasets for machine learning modelling of wildfire occurrence probability.
引用
收藏
页码:561 / 575
页数:15
相关论文
共 69 条
[1]   Impact of anthropogenic climate change on wildfire across western US forests [J].
Abatzoglou, John T. ;
Williams, A. Park .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2016, 113 (42) :11770-11775
[2]   Assessing the effect of foliar moisture on the spread rate of crown fires [J].
Alexander, Martin E. ;
Cruz, Miguel G. .
INTERNATIONAL JOURNAL OF WILDLAND FIRE, 2013, 22 (04) :415-427
[3]   External Validation of the ASTER GDEM2, GMTED2010 and CGIAR-CSI-SRTM v4.1 Free Access Digital Elevation Models (DEMs) in Tunisia and Algeria [J].
Athmania, Djamel ;
Achour, Hammadi .
REMOTE SENSING, 2014, 6 (05) :4600-4620
[4]   Wildfire ignition-distribution modelling: a comparative study in the Huron-Manistee National Forest, Michigan, USA [J].
Bar Massada, Avi ;
Syphard, Alexandra D. ;
Stewart, Susan I. ;
Radeloff, Volker C. .
INTERNATIONAL JOURNAL OF WILDLAND FIRE, 2013, 22 (02) :174-183
[5]   Urban air pollution, climate change and wildfires: The case study of an extended forest fire episode in northern Italy favoured by drought and warm weather conditions [J].
Bo, Matteo ;
Mercalli, Luca ;
Pognant, Federica ;
Berro, Daniele Cat ;
Clerico, Marina .
ENERGY REPORTS, 2020, 6 :781-786
[6]   Unprecedented burn area of Australian mega forest fires [J].
Boer, Matthias M. ;
Resco de Dios, Victor ;
Bradstock, Ross A. .
NATURE CLIMATE CHANGE, 2020, 10 (03) :171-172
[7]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[8]   OPTIMAL INTERPOLATION AND ISARITHMIC MAPPING OF SOIL PROPERTIES .1. THE SEMI-VARIOGRAM AND PUNCTUAL KRIGING [J].
BURGESS, TM ;
WEBSTER, R .
JOURNAL OF SOIL SCIENCE, 1980, 31 (02) :315-331
[9]   Monitoring live fuel moisture content of heathland, shrubland and sclerophyll forest in south-eastern Australia using MODIS data [J].
Caccamo, G. ;
Chisholm, L. A. ;
Bradstock, R. A. ;
Puotinen, M. L. ;
Pippen, B. G. .
INTERNATIONAL JOURNAL OF WILDLAND FIRE, 2012, 21 (03) :257-269
[10]   Predicting late-successional fire refugia pre-dating European settlement in the Wenatchee Mountains [J].
Camp, A ;
Oliver, C ;
Hessburg, P ;
Everett, R .
FOREST ECOLOGY AND MANAGEMENT, 1997, 95 (01) :63-77