Multi-model fusion stacking ensemble learning method for the prediction of berberine by FT-NIR spectroscopy

被引:12
作者
Li, Xiaoyu [1 ]
Chen, Huazhou [1 ,2 ,4 ]
Xu, Lili [3 ]
Mo, Qiushuang [1 ]
Du, Xinrong [1 ]
Tang, Guoqiang [1 ,2 ]
机构
[1] Guilin Univ Technol, Sch Math & Stat, Guilin 541004, Peoples R China
[2] Guilin Univ Technol, Ctr Data Anal & Algorithm Technol, Guilin 541004, Peoples R China
[3] Beibu Gulf Univ, Coll Marine Sci, Qinzhou 535011, Peoples R China
[4] Guilin Univ Technol, Sch Math & Stat, 12 Jiangan Rd, Guilin 541004, Peoples R China
基金
中国国家自然科学基金;
关键词
FT-NIR spectroscopy; Berberine; Stacking ensemble learning; Particle swarm optimization algorithm; Adaptive inertia weight;
D O I
10.1016/j.infrared.2024.105169
中图分类号
TH7 [仪器、仪表];
学科分类号
0804 ; 080401 ; 081102 ;
摘要
Rhizoma Coptidis is a Chinese herbal medicine with antibacterial and anti-inflammatory properties. It has extensive applications in modern medicine. The content of berberine in Rhizoma Coptidis directly determines its quality. Fourier transforms near-infrared (FT-NIR) spectroscopy is a commonly used non-destructive method for rapidly detecting berberine content. In contrast to single-supervised learning algorithms in machine learning, ensemble learning combines individual learning algorithms to create a stable and better-performing strong-supervised model. This study collected spectral data of Rhizoma Coptidis using FT-NIR spectroscopy technology and established a chemometric model using a stacking ensemble approach with multiple models. Partial Least Squares (PLS), Adaptive Boosting (AdaBoost), Gradient boosting decision trees (GBDT), random forest (RF), and extreme gradient boosting (XGBoost) regression models were chosen as alternative base models, different Stacking models were established by random combinations. To fully leverage the strengths of each model and enhance predictive capability, an adaptive inertia weight particle swarm optimization algorithm (AWPSO) was used to search for the optimal parameters. The correlation coefficient of the test (RT) and the root mean square error of the test (RMSET) systematically evaluated the model performance. Finally, AWPSO-RF, AWPSOXGBoost, and AWPSO-AdaBoost were selected as the base models. The RMSET and RT for RF, XGBoost, and AdaBoost were 0.226, 0.250, 0.195, and 0.871, 0.830, 0.927. After optimizing with the AWPSO algorithm, the RMSET and RT for AWPSO-RF, AWPSO-XGBoost, and AWPSO-AdaBoost were 0.226, 0.245, 0.194, and 0.871, 0.843, 0.922, respectively. The RMSET and RT values for the stacking ensemble were 0.174 and 0.932. The prediction accuracy and generalization ability of multi -model fusion stacking ensemble learning are superior to the single -model regression methods. Therefore, the stacking ensemble learning method that combines AdaBoost, RF, and XGBoost regression models is effective and feasible for assisting in the detection of berberine content in Rhizoma Coptidis.
引用
收藏
页数:10
相关论文
共 36 条
[11]   Berberine pharmacology and the gut microbiota: A hidden therapeutic link [J].
Habtemariam, Solomon .
PHARMACOLOGICAL RESEARCH, 2020, 155
[12]   Rapid and simultaneous quantification of phenolic compounds in peanut (Arachis hypogaea L.) seeds using NIR spectroscopy coupled with multivariate calibration [J].
Haruna, Suleiman A. ;
Ivane, Ngouana Moffo A. ;
Adade, Selorm Yao-Say Solomon ;
Luo, Xiaofeng ;
Geng, Wenhui ;
Zareef, Muhammad ;
Jargbah, Jewel ;
Li, Huanhuan ;
Chen, Quansheng .
JOURNAL OF FOOD COMPOSITION AND ANALYSIS, 2023, 123
[13]   Particle Swarm Optimization-based co-state initialization for low-thrust minimum-fuel trajectory optimization [J].
Hecht, Grant R. ;
Botta, Eleonora M. .
ACTA ASTRONAUTICA, 2023, 211 :416-430
[14]   Evidential Random Forests [J].
Hoarau, Arthur ;
Martin, Arnaud ;
Dubois, Jean-Christophe ;
Le Gall, Yolande .
EXPERT SYSTEMS WITH APPLICATIONS, 2023, 230
[15]   Recent developments of Red/NIR carbon dots in biosensing, bioimaging, and tumor theranostics [J].
Hussain, Muhammad Muzammal ;
Khan, Waheed Ullah ;
Ahmed, Farid ;
Wei, Yen ;
Xiong, Hai .
CHEMICAL ENGINEERING JOURNAL, 2023, 465
[16]   A high dimensional features-based cascaded forward neural network coupled with MVMD and Boruta-GBDT for multi-step ahead forecasting of surface soil moisture [J].
Jamei, Mehdi ;
Ali, Mumtaz ;
Karbasi, Masoud ;
Sharma, Ekta ;
Jamei, Mozhdeh ;
Chu, Xuefeng ;
Yaseen, Zaher Mundher .
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 120
[17]   Predicting the nutrition deficiency of fresh pear leaves with a miniature near-infrared spectrometer in the laboratory [J].
Jin, Xiu ;
Wang, Lianglong ;
Zheng, Wenjuan ;
Zhang, XiaoDan ;
Liu, Li ;
Li, Shaowen ;
Rao, Yuan ;
Xuan, Jinxiang .
MEASUREMENT, 2022, 188
[18]   Adulteration detection of Sudan Red and metanil yellow in turmeric powder by NIR spectroscopy and chemometrics: The role of preprocessing methods in analysis [J].
Khodabakhshian, Rasool ;
Bayati, Mohammad Reza ;
Emadi, Bagher .
VIBRATIONAL SPECTROSCOPY, 2022, 120
[19]   Rapid determination of rice protein content using near-infrared spectroscopy coupled with feature wavelength selection [J].
Liu, Jinming ;
Luo, Xin ;
Zhang, Dongjie ;
Wang, Chunqi ;
Chen, Zhengguang ;
Zhao, Xiaoyu .
INFRARED PHYSICS & TECHNOLOGY, 2023, 135
[20]   Hybrid EEG-fNIRS brain-computer interface based on the non-linear features extraction and stacking ensemble learning [J].
Maher, Asmaa ;
Qaisar, Saeed Mian ;
Salankar, N. ;
Jiang, Feng ;
Tadeusiewicz, Ryszard ;
Plawiak, Pawel ;
Abd El-Latif, Ahmed A. ;
Hammad, Mohamed .
BIOCYBERNETICS AND BIOMEDICAL ENGINEERING, 2023, 43 (02) :463-475