FC-StackGNB: A novel machine learning modeling framework for forest fire risk prediction combining feature crosses and model fusion algorithm

被引:2
作者
Su, Ye [1 ]
Zhao, Longlong [1 ]
Li, Xiaoli [1 ]
Li, Hongzhong [1 ]
Ge, Yuankai [2 ]
Chen, Jinsong [1 ,3 ]
机构
[1] Chinese Acad Sci, Shenzhen Inst Adv Technol, Shenzhen 518055, Peoples R China
[2] China Tiesiju Civil Engn Grp CO LTD, Engn CO LTD 8, Hefei 230023, Peoples R China
[3] Shenzhen Engn Lab Ocean Environm Big Data Anal & A, Shenzhen 518055, Peoples R China
基金
中国国家自然科学基金;
关键词
Environmental factors; Feature crosses; Model fusion; Forest fire risk; Machine learning; FC-StackGNB; SAMPLING ALGORITHM; SUSCEPTIBILITY; REGRESSION; DATASET;
D O I
10.1016/j.ecolind.2024.112577
中图分类号
X176 [生物多样性保护];
学科分类号
090705 ;
摘要
Forest fire risk prediction is a crucial link in maintaining forest ecological security. Machine learning, due to its powerful non-linear modeling capabilities, has been widely applied in forest fire risk prediction research. However, existing studies often focus on the direct information provided by multiple environmental factor features when constructing the feature space, while overlooking the deeper information conveyed by feature cross-correlations. Additionally, fire risk prediction predominantly relies on single-model forecasting, exhibiting slightly insufficient generalization and stability in models. Model fusion algorithms (MFA) can combine the advantages of multiple models to compensate for this limitation. In this study, a machine learning framework, FC-StackGNB, combining feature crosses (FC) and model fusion, is proposed. This framework employs the FC method to analyze the temporal trends of various environmental factors influencing fire occurrence, constructing multiple seasonal cross features (SCFs) capable of effectively capturing the non-linear relationship between environmental factors and time. Moreover, the framework develops a Gaussian Naive Bayes (GNB) optimized stacking MFA to fully leverage the strengths of different ML algorithms. Results demonstrate that the introduction of SCFs effectively enhances the prediction performance of six machine learning models, with the mean values of five evaluation metrics (Accuracy, Precision, Recall, F1-score, and ROC_AUC) increasing by 1.58% to 6.30%. The fusion model constructed based on the StackGNB algorithm can effectively handle the multicollinearity issue of features, exhibiting significantly better prediction performance than single models, particularly in improving the Recall metric (increasing by around 3% and 5% compared to the top two ranked single models respectively), which signifies the model's ability to predict positive samples (i.e., high-risk fire areas). The proposed modeling framework effectively enhances the robustness and prediction performance of the models, offering new modeling insights for subsequent research. This study holds significant importance for enhancing the level of forest fire risk warning.
引用
收藏
页数:14
相关论文
共 60 条
[1]   An intelligent system for forest fire risk prediction and fire fighting management in Galicia [J].
Alonso-Betanzos, A ;
Fontenla-Romero, O ;
Guijarro-Berdiñas, B ;
Hernández-Pereira, E ;
Andrade, MIP ;
Jiménez, E ;
Soto, JLL ;
Carballas, T .
EXPERT SYSTEMS WITH APPLICATIONS, 2003, 25 (04) :545-554
[2]  
[Anonymous], 2006, SPR S STAT
[3]   Modeling and prediction of fire occurrences along an elevational gradient in Western Himalayas [J].
Bar, Somnath ;
Parida, Bikash Ranjan ;
Pandey, Arvind Chandra ;
Shankar, B. Uma ;
Kumar, Pankaj ;
Panda, Santosh K. ;
Behera, Mukunda Dev .
APPLIED GEOGRAPHY, 2023, 151
[4]   An empirical comparison of voting classification algorithms: Bagging, boosting, and variants [J].
Bauer, E ;
Kohavi, R .
MACHINE LEARNING, 1999, 36 (1-2) :105-139
[5]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[6]   A robust gradient sampling algorithm for nonsmooth, nonconvex optimization [J].
Burke, JV ;
Lewis, AS ;
Overton, ML .
SIAM JOURNAL ON OPTIMIZATION, 2005, 15 (03) :751-779
[7]   Improved Prediction of Forest Fire Risk in Central and Northern China by a Time-Decaying Precipitation Model [J].
Chen, Jiajun ;
Wang, Xiaoqing ;
Yu, Ying ;
Yuan, Xinzhe ;
Quan, Xiangyin ;
Huang, Haifeng .
FORESTS, 2022, 13 (03)
[8]   XGBoost: A Scalable Tree Boosting System [J].
Chen, Tianqi ;
Guestrin, Carlos .
KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, :785-794
[9]   Performance evaluation of the GIS-based data mining techniques of best-first decision tree, random forest, and naive Bayes tree for landslide susceptibility modeling [J].
Chen, Wei ;
Zhang, Shuai ;
Li, Renwei ;
Shahabi, Himan .
SCIENCE OF THE TOTAL ENVIRONMENT, 2018, 644 :1006-1018
[10]   ResGANet: Residual group attention network for medical image classification and segmentation [J].
Cheng, Junlong ;
Tian, Shengwei ;
Yu, Long ;
Gao, Chengrui ;
Kang, Xiaojing ;
Ma, Xiang ;
Wu, Weidong ;
Liu, Shijia ;
Lu, Hongchun .
MEDICAL IMAGE ANALYSIS, 2022, 76