Estimation of Korla Fragrant Pear Leaf Water Content Using Near-Infrared Spectroscopy Combined with Machine Learning

被引:0
作者
Yu, Mingyang [1 ,2 ,3 ]
Fan, Weifan [1 ,2 ,3 ]
Wang, Lanfei [1 ,2 ,3 ]
Chen, Yufeng [1 ,2 ,3 ]
Wang, Hao [4 ]
Guo, Kailu [1 ,2 ,3 ]
Bao, Jianping [1 ,2 ,3 ]
机构
[1] Tarim Univ, Coll Hort & Forestry Sci, Alar 843300, Peoples R China
[2] Xinjiang Prod & Construct Corps, Tarim Basin Biol Resources Protect & Utilizat Key, Alar 843300, Peoples R China
[3] High Qual Cultivat & Deep Proc Fruit Prod Proc Tec, Southern Xinjiang Special Fruit Trees High Qual, Alar 843300, Peoples R China
[4] Nanjing Agr Univ, Coll Hort & Forestry Sci, Nanjing 210095, Peoples R China
来源
AGRONOMY-BASEL | 2025年 / 15卷 / 04期
基金
中国国家自然科学基金;
关键词
Korla fragrant pear; leaf water content; machine learning; near-infrared spectroscopy; predictive modeling; QUALITY; FRUIT;
D O I
10.3390/agronomy15040876
中图分类号
S3 [农学(农艺学)];
学科分类号
0901 ;
摘要
In modern agricultural production, accurately estimating the leaf water content (LWC) of Korla fragrant pear is crucial for achieving scientific irrigation and ensuring fruit quality. However, constructing accurate and effective LWC prediction models remains challenging due to limitations in sample selection, spectral feature analysis, and model applicability. To address these issues, this study was conducted to systematically optimize the process. During sample collection, a random split method was employed to divide the dataset into modeling and testing sets at a ratio of 75%:25%. This approach ensures computational efficiency, avoids data leakage, and balances training and evaluation needs, particularly for small- to medium-sized datasets. Specifically, in stage S1, 352 samples were allocated to the modeling set and 108 to the testing set, while in stage S2, 137 and 58 samples were assigned, respectively. The analysis revealed slight differences in LWC distribution and standard deviation between the modeling and testing sets, validating the scientific rigor of dataset division. For instance, the LWC distribution in the S1 modeling set ranged from 4.88% to 83.45%, with a standard deviation of 11.33%. The spectral acquisition process within the range of 4000 cm-1 to 10,000 cm-1 exhibited complex absorbance variation trends, showing distinct characteristics across different intervals. Preprocessing techniques such as SG convolution smoothing, MSC, and SNV significantly reduced the absorbance variability and enhanced spectral features. Notably, the selection of LWC feature bands differed markedly between stages S1 and S2. For example, in S1, SNV-SPA (successive projections algorithm) feature bands were concentrated around 5000 cm-1, 6000 cm-1, and 7000 cm-1, whereas their positions shifted significantly in S2, reflecting the growth dynamics of the Korla fragrant pear. During the model-building phase, various algorithms, including Random Forest Regression (RFR), Backpropagation Neural Network (BP), and Support Vector Regression (SVR), were compared. Under different feature selections, the RFR model demonstrated strong predictive ability with determination coefficients (R2) exceeding 0.75 and root mean square errors (RMSE) below 0.7%. Specifically, the SNV-CARS-BP model achieved an R2 of 0.81594 in S1, while the SNV-SPA-RFR model reached an R2 of 0.817756 in S2, with relative deviations between the predicted and actual values of less than 5%. These results provide robust support for the precise LWC monitoring of Korla fragrant pear and offer valuable insights for subsequent research.
引用
收藏
页数:33
相关论文
共 47 条
[1]   Determination of leaf nitrogen content in apple and jujube by near-infrared spectroscopy [J].
Bao, Jianping ;
Yu, Mingyang ;
Li, Jiaxin ;
Wang, Guanli ;
Tang, Zhihui ;
Zhi, Jinhu .
SCIENTIFIC REPORTS, 2024, 14 (01)
[2]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[3]   Region-aware network: Model human's Top-Down visual perception mechanism for crowd counting [J].
Chen, Yuehai ;
Yang, Jing ;
Zhang, Dong ;
Zhang, Kun ;
Chen, Badong ;
Du, Shaoyi .
NEURAL NETWORKS, 2022, 148 :219-231
[4]   Gene selection and classification of microarray data using random forest -: art. no. 3 [J].
Díaz-Uriarte, R ;
de Andrés, SA .
BMC BIOINFORMATICS, 2006, 7 (1)
[5]   Variability in spectral absorbance metrics across boreal lake waters [J].
Erlandsson, Martin ;
Futter, Martyn N. ;
Kothawala, Dolly N. ;
Kohler, Stephan J. .
JOURNAL OF ENVIRONMENTAL MONITORING, 2012, 14 (10) :2643-2652
[6]   Estimation of potato canopy leaf water content in various growth stages using UAV hyperspectral remote sensing and machine learning [J].
Guo, Faxu ;
Feng, Quan ;
Yang, Sen ;
Yang, Wanxia .
FRONTIERS IN PLANT SCIENCE, 2024, 15
[7]   Tomato fruit quality and content depend on stage of maturity [J].
Helyes, Lajos ;
Pek, Zoltan ;
Lugasi, Andrea .
HORTSCIENCE, 2006, 41 (06) :1400-1401
[8]   A Complex Terrain Simulation Approach Using Ensemble Learning of Random Forest Regression [J].
Huang, Zechun ;
Liu, Zipu .
JOURNAL OF THE INDIAN SOCIETY OF REMOTE SENSING, 2022, 50 (10) :2011-2023
[9]   ZCHSound: Open-Source ZJU Paediatric Heart Sound Database With Congenital Heart Disease [J].
Jia, Weijie ;
Wang, Yunyan ;
Chen, Renwei ;
Ye, Jingjing ;
Li, Die ;
Yin, Fei ;
Yu, Jin ;
Chen, Jiajia ;
Shu, Qiang ;
Xu, Weize .
IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2024, 71 (08) :2278-2286
[10]   Changes in the Metabolome and Nutritional Quality of Pulp from Three Types of Korla Fragrant Pears with Different Appearances as Revealed by Widely Targeted Metabolomics [J].
Jiang, Wei ;
Yan, Pan ;
Zheng, Qiangqing ;
Wang, Zhendong ;
Chen, Qiling ;
Wang, Yi .
PLANTS-BASEL, 2023, 12 (23)