Comparison of Gaussian process regression, partial least squares, random forest and support vector machines for a near infrared calibration of paracetamol samples

被引:4
|
作者
Sow, Aminata [1 ]
Traore, Issiaka [1 ]
Diallo, Tidiane [2 ,3 ]
Traore, Mohamed [4 ]
Ba, Abdramane [1 ]
机构
[1] Univ Sci Tech & Technol Bamako, Fac Sci & Tech FST, Lab Opt Spect & Sci Atmospher LOSSA, Bamako, Mali
[2] Univ Sci Tech & Technol Bamako, Fac Pharm, Dept Sci Medicament, Bamako, Mali
[3] Lab Natl Sante LNS, Bamako, Mali
[4] Ecole Natl Ingn Abderhamane Baba Toure, Bamako, Mali
关键词
Paracetamol; Near Infrared Spectroscopy; Data preprocessing; Nonlinear regression models; Linear regression techniques; COMPONENTS; TABLETS;
D O I
10.1016/j.rechem.2022.100508
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
In this article, we analyze the near-infrared (NIR) spectra of fifty-eight (58) commercial tablets of 500 mg of paracetamol from different origins (that is, with different batch numbers) in the local markets in Bamako. The NIR spectra were recorded in the spectral range 930 nm-1700 nm. The samples are divided into forty-eight (48) samples forming the set of calibration (training set) and ten (10) samples used as the validation or test set. To perform multivariate calibration, we apply-three nonlinear regression techniques (Gaussian processes regression (GPR), Random Forest (RF), Support vector machine (KSVM)), along with the traditional linear partial leastsquares regression (PLSR) to several data pretreatments of the 58 samples. The results show that the three nonlinear regression calibrations have better prediction performance than PLS as far as RMSE is concerned. To decide the best regression model, we avoid R2 since this quantity is not a good parameter for this purpose. We will instead consider RMSE when comparing the different multivariate models. Additionally, to assess the impact of data preprocessing, we apply the above regression techniques to the original data, Multi-scattering correction (MSC), standard variate normalization (SNV) correction, smoothing correction, first derivative (FD), and second derivative correction (SD). The overall results reveal that Gaussian Processes Regression (GPR) applied to smooth correction gives the lowest RMSEP = 2.303053e-06 for validation (prediction) and RMSEC = 2.112316e-06 for calibration. In our investigation, one also notices that the developed GPR model is more accurate and exhibits enhanced behavior no matter which data preprocessing is used. All in all, GPR can be seen as an alternative powerful regression tool for NIR spectra of paracetamol samples. The statistical parameters of the proposed model are compared to the results of some other models reported in the literature.
引用
收藏
页数:7
相关论文
共 50 条
  • [41] Combined prediction model based on partial least squares regression and its application to near infrared spectroscopy quantitative analysis
    Cheng Zhong
    Zhu Ai-Shi
    Chen De-Zhao
    CHINESE JOURNAL OF ANALYTICAL CHEMISTRY, 2007, 35 (07) : 978 - 982
  • [42] Application of partial least-squares regression to near-infrared reflectance spectroscopic determination of shive content in flax
    Sohn, M
    Barton, FE
    Morrison, WH
    Archibald, DD
    APPLIED SPECTROSCOPY, 2003, 57 (05) : 551 - 556
  • [43] Support Vector Regression based Spectral Calibration Model Building for In-situ Process Measurement via Near-infrared Spectroscopy
    Qu, Xiaoyu
    Li, Tao
    Wu, Yang
    2022 41ST CHINESE CONTROL CONFERENCE (CCC), 2022, : 2531 - 2536
  • [44] Determination of Eucalyptus globulus wood extractives content by near infrared-based partial least squares regression models: comparison between extraction procedures
    Alves, Ana M. M.
    Simoes, Rita F. S.
    Santos, Claudia A.
    Potts, Brad M.
    Rodrigues, Jose
    Schwanninger, Manfred
    JOURNAL OF NEAR INFRARED SPECTROSCOPY, 2012, 20 (02) : 275 - 285
  • [45] Determination of acetic acid of fruit vinegars using near infrared spectroscopy and least squares-support vector machine
    Liu, Fei
    Wang, Li
    He, Yong
    PROCEEDINGS OF 2008 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2008, : 1232 - 1237
  • [46] Mutual information-induced interval selection combined with kernel partial least squares for near-infrared spectral calibration
    Tan, Chao
    Li, Menglong
    SPECTROCHIMICA ACTA PART A-MOLECULAR AND BIOMOLECULAR SPECTROSCOPY, 2008, 71 (04) : 1266 - 1273
  • [47] A near-infrared spectroscopy method for the detection of texture profile analysis of Litopeneo vannamei based on partial least squares regression
    Wei, Changhui
    Li, Xinxing
    JOURNAL OF FOOD PROCESS ENGINEERING, 2022, 45 (10)
  • [48] Developing near infrared spectroscopy calibration model of molar ratio between methanol and isobutylene by support vector regression
    Chu Xiao-li
    Yuan Hong-fu
    Luo Xian-hui
    Xu Yu-peng
    Lu Wan-zhen
    SPECTROSCOPY AND SPECTRAL ANALYSIS, 2008, 28 (06) : 1227 - 1231
  • [49] Quantitative analysis of tea using ytterbium-based internal standard near-infrared spectroscopy coupled with boosting least-squares support vector regression
    Luo, Rui-Min
    Tan, Shi-Miao
    Zhou, Yan-Ping
    Liu, Shu-Juan
    Xu, Hui
    Song, Dan-Dan
    Cui, Yan-Fang
    Fu, Hai-Yan
    Yang, Tian-Ming
    JOURNAL OF CHEMOMETRICS, 2013, 27 (7-8) : 198 - 206
  • [50] Identification of Wine According to Grape Variety Using Near-Infrared Spectroscopy Based on Radial Basis Function Neural Networks and Least-Squares Support Vector Machines
    Jing Yu
    Jicheng Zhan
    Weidong Huang
    Food Analytical Methods, 2017, 10 : 3306 - 3311