An Improved XGBoosting Algorithm Based on Fat Content in Infant Milk Powder Prediction Model

被引:0
作者
Zhang Wen-jing [1 ,2 ]
Xue He-ru [1 ,2 ]
Jiang Xin-hua [1 ,2 ]
Liu Jiang-ping [1 ,2 ]
Huang Qing [1 ,2 ]
机构
[1] Inner Mongolia Agr Univ, Coll Comp & Informat Engn, Hohhot 010018, Peoples R China
[2] Agr Univ, Inner Mongolia Key Lab Big Date Res & Applicat Ag, Hohhot 010018, Peoples R China
关键词
Hyperspectral; Bayesian optimization; XGBoosting model; Fat content; Non-destructive testing;
D O I
暂无
中图分类号
O433 [光谱学];
学科分类号
0703 ; 070302 ;
摘要
Fat plays an essential role in the composition of infant formula. Not only is fat a vital component of a baby's growth and development, but it also provides essential energy for growth. It is crucial for the development of the infant brain and the formation of nerve myelin. Chemical methods for determining the fat content of infant milk powder, such as ether extraction, are sensitive but have the disadvantage of destroying samples and having a long detection period. In this paper, the hyperspectral data undergoes preprocessing processes with standard normal transform (SNV), multiple scattering corrections (MSC), Savitzky-Golay smoothing, and Roust method using different stages of infant milk powder in Inner Mongolia, China. A competitive adaptive re-weighting algorithm, CARS, was used to sift out redundant wavelengths from the spectroscopic data at 125 feature wavelengths, leaving 66 valid wavelengths. The Bayesian optimization algorithm optimizes the XGBoosting prediction model, leading to a BO-XGBoosting model that predicts the fat content of infant formula better than the original model. The experimental results show that the model predicts better than the traditional partial least squares regression (PLSR) and support vector machine (SVR) regression model, outperforming the Bagging and GrdientBoosting algorithms in the integrated algorithm. In the BO-XGBoosting model in the test set experiments, the decision coefficient R-2 and root mean square error of prediction (RMSEP) obtained are 0. 953 7 and 0. 577 3, which are 2. 91% higher and 19. 2% lower than the determination coefficient R-2 and root mean squared error of prediction (RMSEP) of the XGBoosting model's R-2 and RMSEP, respectively. This study provides algorithmic support and a theoretical foundation for BO-XGBooting based rapid, non-destructive detection of infant formula fat content.
引用
收藏
页码:1464 / 1471
页数:8
相关论文
共 18 条
  • [1] Cui B. W., 2020, Food Industry, V41, P293
  • [2] Cui Jia-Xu, 2018, Journal of Software, V29, P3068, DOI 10.13328/j.cnki.jos.005607
  • [3] Debora A P, 2017, LWT-Food Science and Technology, P337
  • [4] Quantitative analysis of melamine in milk powders using near-infrared hyperspectral imaging, and band ratio
    Huang, Min
    Kim, Moon S.
    Delwiche, Stephen R.
    Chao, Kuanglin
    Qin, Jianwei
    Mo, Changyeun
    Esquerre, Carlos
    Zhu, Qibing
    [J]. JOURNAL OF FOOD ENGINEERING, 2016, 181 : 10 - 19
  • [5] A Review Towards Hyperspectral Imaging for Real-Time Quality Control of Food Products with an Illustrative Case Study of Milk Powder Production
    Khan, Asma
    Munir, M. T.
    Yu, W.
    Young, B. R.
    [J]. FOOD AND BIOPROCESS TECHNOLOGY, 2020, 13 (05) : 739 - 752
  • [6] Khan Asma, 2020, Sensors
  • [7] Lee H., 2018, Chemistry. Analysis. Control. Exposure & Risk Assessment, P1027
  • [8] Detection of Melamine Adulteration in Milk Powder by Using Optical Spectroscopy Technologies in the Last Decade-a Review
    Liang, Wenting
    Wei, Yuqiang
    Gao, Mengjie
    Yan, Xin
    Zhu, Xinhua
    Guo, Wenchuan
    [J]. FOOD ANALYTICAL METHODS, 2020, 13 (11) : 2059 - 2069
  • [9] Prediction of milk protein content based on improved sparrow search algorithm and optimized back propagation neural network
    Liu, Jiangping
    Hu, Pengwei
    Xue, Heru
    Pan, Xin
    Chen, Chen
    [J]. SPECTROSCOPY LETTERS, 2022, 55 (04) : 229 - 239
  • [10] Munir MT., 2018, Journal of Food Engineering, P1