Quantitative prediction of soil chromium content using laboratory-based visible and near-infrared spectroscopy with different ensemble learning models

被引:1
作者
Fu, Chengbiao [1 ]
Jiang, Yuheng [1 ]
Tian, Anhong [1 ,2 ]
机构
[1] Kunming Univ Sci & Technol, Fac Informat Engn & Automat, Kunming 650500, Peoples R China
[2] Kunming Univ Sci & Technol, Fac Land Resource Engn, Kunming 650093, Peoples R China
关键词
Visible and Near-Infrared Spectroscopy; Soil Heavy Metal Cr; Base Learner; Ensemble Learning; Spatial Distribution; CONTAMINATION; RANIPET;
D O I
10.1016/j.asr.2024.07.056
中图分类号
V [航空、航天];
学科分类号
08 ; 0825 ;
摘要
Soil heavy metal chromium pollution poses significant threats to human health and ecosystems, necessitating accurate quantitative prediction methods for effective monitoring and management. This study aims to develop robust predictive models for soil chromium content in farmland soils of Mojiang Hani Autonomous County, Pu'er City, Yunnan Province, China. These models utilize ensemble learning techniques based on visible and near-infrared spectroscopy. Operations before model building involved partitioning datasets with the Kennard-Stone algorithm to ensure representative training and testing sets. Visible and near-infrared spectroscopy-data preprocessing was performed using Savitzky-Golay smoothing and first-order derivative transformations to enhance signal quality. Bands selection was achieved through the Successive Projections Algorithm (SPA), effectively reducing data dimensionality and collinearity. Six ensemble learning models were constructed and assessed for their predictive performance: Bagging-DTR, Random Forest (RF), Adaboost-DTR, XGBoost-DTR, Stacking-1, and Stacking-2. These models utilized Decision Trees (DTR) and Linear Regression (LR) as base learners. Results demonstrated that ensemble models significantly outperformed individual base learners. Notably, the Stacking-2 model achieved the highest accuracy with an R<^>2 of 0.954, RMSE of 125.967 mg/kg, and RPD of 4.667. To validate the model's practical applicability, spatial interpolation of soil Cr content was conducted using the Kriging method based on Stacking-2 model predictions. The spatial distribution maps of measured and predicted values exhibited high congruence, underscoring the model's effectiveness in accurately mapping Cr distribution across the study area. This study underscores the efficacy of integrating ensemble learning with visible and near-infrared spectroscopy-data preprocessing and SPA for precise soil heavy metal prediction. The findings offer valuable insights and a scientific basis for enhanced soil quality monitoring, environmental risk assessment, and informed agricultural land management and pollution control. (c) 2024 COSPAR. Published by Elsevier B.V. All rights are reserved, including those for text and data mining, AI training, and similar technologies.
引用
收藏
页码:4705 / 4720
页数:16
相关论文
共 54 条
[1]  
Andiyappan K, 2017, ORIENT J CHEM, V33, P3092, DOI 10.13005/ojc/330650
[2]   Modeling potential arsenic enrichment and distribution using stacking ensemble learning in the lower Yellow River Plain, China [J].
Cao, Wengeng ;
Fu, Yu ;
Cheng, Yanpei ;
Zhai, Wenhua ;
Sun, Xiaoyue ;
Ren, Yu ;
Pan, Deng .
JOURNAL OF HYDROLOGY, 2023, 625
[3]   Hyperspectral detection of sugar content for sugar-sweetened apples based on sample grouping and SPA feature selecting methods [J].
Chen, Jie ;
Bai, Tiecheng ;
Zhang, Nannan ;
Zhu, Lixia ;
Zhang, Xiao .
INFRARED PHYSICS & TECHNOLOGY, 2022, 125
[4]   Development of a soil heavy metal estimation method based on a spectral index: Combining fractional-order derivative pretreatment and the absorption mechanism [J].
Chen, Lihan ;
Lai, Jian ;
Tan, Kun ;
Wang, Xue ;
Chen, Yu ;
Ding, Jianwei .
SCIENCE OF THE TOTAL ENVIRONMENT, 2022, 813
[5]   Predicting individual apple tree yield using UAV multi-source remote sensing data and ensemble learning [J].
Chen, Riqiang ;
Zhang, Chengjian ;
Xu, Bo ;
Zhu, Yaohui ;
Zhao, Fa ;
Han, Shaoyu ;
Yang, Guijun ;
Yang, Hao .
COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2022, 201
[6]   Evaluating validation strategies on the performance of soil property prediction from regional to continental spectral data [J].
Chen, Songchao ;
Xu, Hanyi ;
Xu, Dongyun ;
Ji, Wenjun ;
Li, Shuo ;
Yang, Meihua ;
Hu, Bifeng ;
Zhou, Yin ;
Wang, Nan ;
Arrouays, Dominique ;
Shi, Zhou .
GEODERMA, 2021, 400
[7]   Hyperspectral image classification method based on semantic filtering and ensemble learning [J].
Cui, Binge ;
Dong, Wenwen ;
Yin, Bei ;
Li, Xinhui ;
Cui, Jianming .
INFRARED PHYSICS & TECHNOLOGY, 2023, 135
[8]   Estimation of soil copper content based on fractional-order derivative spectroscopy and spectral characteristic band selection [J].
Cui, Shichao ;
Zhou, Kefa ;
Ding, Rufu ;
Cheng, Yinyi ;
Jiang, Guo .
SPECTROCHIMICA ACTA PART A-MOLECULAR AND BIOMOLECULAR SPECTROSCOPY, 2022, 275
[9]   A novel method for detecting soil salinity using AVIRIS-NG imaging spectroscopy and ensemble machine learning [J].
Das, Ayan ;
Bhattacharya, Bimal Kumar ;
Setia, Raj ;
Jayasree, G. ;
Das, Bhabani Sankar .
ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2023, 200 :191-212
[10]   A new method to estimate soil organic matter using the combination model based on short memory fractional order derivative and machine learning model [J].
Fu, Chengbiao ;
Gan, Shu ;
Xiong, Heigang ;
Tian, Anhong .
INFRARED PHYSICS & TECHNOLOGY, 2023, 134