SHAP values accurately explain the difference in modeling accuracy of convolution neural network between soil full-spectrum and feature-spectrum

被引:34
作者
Zhong, Liang [1 ,2 ]
Guo, Xi [2 ]
Ding, Meng [3 ]
Ye, Yingcong [2 ]
Jiang, Yefeng [2 ]
Zhu, Qing [4 ]
Li, Jianlong [1 ]
机构
[1] Nanjing Univ, Sch Life Sci, State Key Lab Pharmaceut Biotechnol, Nanjing 210023, Peoples R China
[2] Jiangxi Agr Univ, Key Lab Poyang Lake Watershed Agr Resources & Ecol, Nanchang 330045, Peoples R China
[3] Jiangsu Environm Protect Grp Suzhou Co Ltd, Suzhou 215009, Peoples R China
[4] Shanghai Jiao Tong Univ, Sch Design, Shanghai 200240, Peoples R China
关键词
Deep learning; Convolutional neural network; Interpretability; SHAP values; Soil total nitrogen; ORGANIC-CARBON; TOTAL NITROGEN; REFLECTANCE; SPECTROSCOPY; PREDICTION; MATTER;
D O I
10.1016/j.compag.2024.108627
中图分类号
S [农业科学];
学科分类号
09 ;
摘要
Acquiring soil nutrient content quickly and accurately through remote sensing is the key to advance precision agriculture. The development of deep learning has provided new technical means for soil hyperspectral modeling. However, the problem of poor interpretability of deep learning models limits its development. Although SHapley Additive exPlanations (SHAP) values based on game theory have been successfully applied to the interpretation of deep learning soil spectral modeling, whether they can accurately explain the differences in deep learning model accuracy remains to be verified. Based on this, we explored whether SHAP values can accurately explain the differences in convolutional neural network (CNN) modeling accuracy. We collected soil samples from agricultural land in the Liangshui River Basin in the southern mountainous and hilly areas of China, and measured the soil total nitrogen (STN) content and soil spectral data in the laboratory. We compared the effects of full-spectrum and feature-spectrum on the accuracy of deep learning models, and obtained the contribution of wavelengths in the CNN modeling process by calculating SHAP values. The results showed that combining different spectral pre-processing methods can play their respective advantages and help improve modeling accuracy. Among them, the CNN model obtained the highest prediction accuracy under the firstderivative Savitzky-Golay smoothing combination standard normal variate (SG1-SNV) spectral pre-processing in full-spectrum modeling. Compared with the feature-spectrum selected for modeling by Mutual information (MI) and competitive adaptive reweighted sampling (CARS), the CNN model achieved higher accuracy in most pre-processed spectra in full-spectrum modeling, and SHAP values accurately explained this reason. This is because the contribution is usually higher at most wavelengths with a high correlation with STN content. The feature-spectrum selected by CARS is more widely distributed but lacks continuity, and some wavelengths with high correlation and high contribution will also be missed. Meanwhile, some wavelengths with low correlation also have high contributions, which are usually not involved in the feature spectrum modeling of MI, thus affecting the modeling accuracy. Therefore, the deep learning model is more suitable for full-spectrum modeling due to its strong feature extraction and self-learning capabilities, and SHAP can obtain the wavelength contribution of the CNN model in soil spectral modeling, and then explain the differences in modeling accuracy. This study further proves the interpretability of deep learning, provides an important basis for the application of deep learning in soil hyperspectral modeling.
引用
收藏
页数:13
相关论文
共 79 条
[1]   Explainable artificial intelligence (XAI) for interpreting the contributing factors feed into the wildfire susceptibility prediction model [J].
Abdollahi, Arnick ;
Pradhan, Biswajeet .
SCIENCE OF THE TOTAL ENVIRONMENT, 2023, 879
[2]   Predictive ability of soil properties to spectral degradation from laboratory Vis-NIR spectroscopy data [J].
Adeline, K. R. M. ;
Gomez, C. ;
Gorretta, N. ;
Roger, J. -M. .
GEODERMA, 2017, 288 :143-153
[3]   Iron and nitrogen interactions in groundnut nutrition [J].
Ali, ZI ;
Malik, EMA ;
Babiker, HM ;
Ramraj, VM ;
Sultana, A ;
Johansen, C .
COMMUNICATIONS IN SOIL SCIENCE AND PLANT ANALYSIS, 1998, 29 (17-18) :2619-2630
[4]   Estimation of soil inorganic carbon with visible near-infrared spectroscopy coupling of variable selection and deep learning in arid region of China [J].
Bai, Zijin ;
Chen, Songchao ;
Hong, Yongsheng ;
Hu, Bifeng ;
Luo, Defang ;
Peng, Jie ;
Shi, Zhou .
GEODERMA, 2023, 437
[5]   A regional-scale hyperspectral prediction model of soil organic carbon considering geomorphic features [J].
Bao, Yilin ;
Ustin, Susan ;
Meng, Xiangtian ;
Zhang, Xinle ;
Guan, Haixiang ;
Qi, Beisong ;
Liu, Huanjun .
GEODERMA, 2021, 403
[6]   Vis-SWIR spectral prediction model for soil organic matter with different grouping strategies [J].
Bao, Yilin ;
Meng, Xiangtian ;
Ustin, Susan ;
Wang, Xiang ;
Zhang, Xinle ;
Liu, Huanjun ;
Tang, Haitao .
CATENA, 2020, 195
[7]   Explainable Artificial Intelligence (XAI): Concepts, taxonomies, opportunities and challenges toward responsible AI [J].
Barredo Arrieta, Alejandro ;
Diaz-Rodriguez, Natalia ;
Del Ser, Javier ;
Bennetot, Adrien ;
Tabik, Siham ;
Barbado, Alberto ;
Garcia, Salvador ;
Gil-Lopez, Sergio ;
Molina, Daniel ;
Benjamins, Richard ;
Chatila, Raja ;
Herrera, Francisco .
INFORMATION FUSION, 2020, 58 :82-115
[8]   USING MUTUAL INFORMATION FOR SELECTING FEATURES IN SUPERVISED NEURAL-NET LEARNING [J].
BATTITI, R .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 1994, 5 (04) :537-550
[9]   Determination of carbon and nitrogen contents in Alfisols, Oxisols and Ultisols from Africa and Brazil using NIRS analysis:: Effects of sample grinding and set heterogeneity [J].
Brunet, Didier ;
Barthes, Bernard G. ;
Chotte, Jean-Luc ;
Feller, Christian .
GEODERMA, 2007, 139 (1-2) :106-117
[10]   Digital mapping of GlobalSoilMap soil properties at a broad scale: A review [J].
Chen, Songchao ;
Arrouays, Dominique ;
Mulder, Vera Leatitia ;
Poggio, Laura ;
Minasny, Budiman ;
Roudier, Pierre ;
Libohova, Zamir ;
Lagacherie, Philippe ;
Shi, Zhou ;
Hannam, Jacqueline ;
Meersmans, Jeroen ;
Richer-de-Forges, Anne C. ;
Walter, Christian .
GEODERMA, 2022, 409