Data-Driven Wildfire Risk Prediction in Northern California

被引:41
作者
Malik, Ashima [1 ]
Rao, Megha Rajam [1 ]
Puppala, Nandini [1 ]
Koouri, Prathusha [1 ]
Thota, Venkata Anil Kumar [1 ]
Liu, Qiao [1 ]
Chiao, Sen [2 ]
Gao, Jerry [3 ]
机构
[1] San Jose State Univ, Dept Appl Data Sci, San Jose, CA 95192 USA
[2] San Jose State Univ, Dept Meteorol & Climate Sci, San Jose, CA 95192 USA
[3] San Jose State Univ, Dept Comp Engn, San Jose, CA 95192 USA
关键词
wildfire; wildfire risks prediction; machine learning; random forest; spatial and temporal accuracy;
D O I
10.3390/atmos12010109
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Over the years, rampant wildfires have plagued the state of California, creating economic and environmental loss. In 2018, wildfires cost nearly 800 million dollars in economic loss and claimed more than 100 lives in California. Over 1.6 million acres of land has burned and caused large sums of environmental damage. Although, recently, researchers have introduced machine learning models and algorithms in predicting the wildfire risks, these results focused on special perspectives and were restricted to a limited number of data parameters. In this paper, we have proposed two data-driven machine learning approaches based on random forest models to predict the wildfire risk at areas near Monticello and Winters, California. This study demonstrated how the models were developed and applied with comprehensive data parameters such as powerlines, terrain, and vegetation in different perspectives that improved the spatial and temporal accuracy in predicting the risk of wildfire including fire ignition. The combined model uses the spatial and the temporal parameters as a single combined dataset to train and predict the fire risk, whereas the ensemble model was fed separate parameters that were later stacked to work as a single model. Our experiment shows that the combined model produced better results compared to the ensemble of random forest models on separate spatial data in terms of accuracy. The models were validated with Receiver Operating Characteristic (ROC) curves, learning curves, and evaluation metrics such as: accuracy, confusion matrices, and classification report. The study results showed and achieved cutting-edge accuracy of 92% in predicting the wildfire risks, including ignition by utilizing the regional spatial and temporal data along with standard data parameters in Northern California.
引用
收藏
页数:19
相关论文
共 28 条
  • [1] Andres C, 2013, SCI WORLD J, V2013
  • [2] [Anonymous], WORLD EC FORUM
  • [3] [Anonymous], LANDSAT SURFACE REFL
  • [4] [Anonymous], NDVI NDBI NDWI CALCU, V7, P8
  • [5] [Anonymous], NAIP Imagery, United States Department of Agriculture from Service Agency
  • [6] [Anonymous], CalMatters
  • [7] [Anonymous], CALIFORNIA DEP FORES
  • [8] [Anonymous], LANDSAT SCI NASA
  • [9] [Anonymous], Fire Perimeters GIS Data
  • [10] Evolutionary-Statistical System: A parallel method for improving forest fire spread prediction
    Bianchini, G.
    Caymes-Scutari, P.
    Mendez-Garabetti, M.
    [J]. JOURNAL OF COMPUTATIONAL SCIENCE, 2015, 6 : 58 - 66