Data-Driven Wildfire Risk Prediction in Northern California

被引:41
作者
Malik, Ashima [1 ]
Rao, Megha Rajam [1 ]
Puppala, Nandini [1 ]
Koouri, Prathusha [1 ]
Thota, Venkata Anil Kumar [1 ]
Liu, Qiao [1 ]
Chiao, Sen [2 ]
Gao, Jerry [3 ]
机构
[1] San Jose State Univ, Dept Appl Data Sci, San Jose, CA 95192 USA
[2] San Jose State Univ, Dept Meteorol & Climate Sci, San Jose, CA 95192 USA
[3] San Jose State Univ, Dept Comp Engn, San Jose, CA 95192 USA
关键词
wildfire; wildfire risks prediction; machine learning; random forest; spatial and temporal accuracy;
D O I
10.3390/atmos12010109
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Over the years, rampant wildfires have plagued the state of California, creating economic and environmental loss. In 2018, wildfires cost nearly 800 million dollars in economic loss and claimed more than 100 lives in California. Over 1.6 million acres of land has burned and caused large sums of environmental damage. Although, recently, researchers have introduced machine learning models and algorithms in predicting the wildfire risks, these results focused on special perspectives and were restricted to a limited number of data parameters. In this paper, we have proposed two data-driven machine learning approaches based on random forest models to predict the wildfire risk at areas near Monticello and Winters, California. This study demonstrated how the models were developed and applied with comprehensive data parameters such as powerlines, terrain, and vegetation in different perspectives that improved the spatial and temporal accuracy in predicting the risk of wildfire including fire ignition. The combined model uses the spatial and the temporal parameters as a single combined dataset to train and predict the fire risk, whereas the ensemble model was fed separate parameters that were later stacked to work as a single model. Our experiment shows that the combined model produced better results compared to the ensemble of random forest models on separate spatial data in terms of accuracy. The models were validated with Receiver Operating Characteristic (ROC) curves, learning curves, and evaluation metrics such as: accuracy, confusion matrices, and classification report. The study results showed and achieved cutting-edge accuracy of 92% in predicting the wildfire risks, including ignition by utilizing the regional spatial and temporal data along with standard data parameters in Northern California.
引用
收藏
页数:19
相关论文
共 28 条
  • [11] Brownlee J, 2020, SMOTE OVERSAMPLING I
  • [12] Relieving Uncertainty in Forest Fire Spread Prediction by Exploiting Multicore Architectures
    Cencerrado, Andres
    Artes, Tomas
    Cortes, Ana
    Margalef, Tomas
    [J]. INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE, ICCS 2015 COMPUTATIONAL SCIENCE AT THE GATES OF NATURE, 2015, 51 : 1752 - 1761
  • [13] Response time assessment in forest fire spread simulation: An integrated methodology for efficient exploitation of available prediction time
    Cencerrado, Andres
    Cortes, Ana
    Margalef, Tomas
    [J]. ENVIRONMENTAL MODELLING & SOFTWARE, 2014, 54 : 153 - 164
  • [14] Big data integration shows Australian bush-fire frequency is increasing significantly
    Dutta, Ritaban
    Das, Aruneema
    Aryal, Jagannath
    [J]. ROYAL SOCIETY OPEN SCIENCE, 2016, 3 (02):
  • [15] El-Nesr M., FILLING GAPS TIME SE
  • [16] Famiglietti C., 2018, Final Report
  • [17] George E.S., 2010, P IEEE ASME INT C AD
  • [18] An Improved Cellular Automata for Wildfire Spread
    Ghisu, Tiziano
    Arca, Bachisio
    Pellizzaro, Grazia
    Duce, Pierpaolo
    [J]. INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE, ICCS 2015 COMPUTATIONAL SCIENCE AT THE GATES OF NATURE, 2015, 51 : 2287 - 2296
  • [19] Increase in the quality of the prediction of a computational wildfire behavior method through the improvement of the internal metaheuristic
    Mendez-Garabetti, Miguel
    Bianchini, German
    Caymes-Scutari, Paola
    Laura Tardivo, Maria
    [J]. FIRE SAFETY JOURNAL, 2016, 82 : 49 - 62
  • [20] Onur S, 2016, NAT HAZ RISK, V7, P1645