Comparison of Gaussian process regression, partial least squares, random forest and support vector machines for a near infrared calibration of paracetamol samples

被引:4
|
作者
Sow, Aminata [1 ]
Traore, Issiaka [1 ]
Diallo, Tidiane [2 ,3 ]
Traore, Mohamed [4 ]
Ba, Abdramane [1 ]
机构
[1] Univ Sci Tech & Technol Bamako, Fac Sci & Tech FST, Lab Opt Spect & Sci Atmospher LOSSA, Bamako, Mali
[2] Univ Sci Tech & Technol Bamako, Fac Pharm, Dept Sci Medicament, Bamako, Mali
[3] Lab Natl Sante LNS, Bamako, Mali
[4] Ecole Natl Ingn Abderhamane Baba Toure, Bamako, Mali
关键词
Paracetamol; Near Infrared Spectroscopy; Data preprocessing; Nonlinear regression models; Linear regression techniques; COMPONENTS; TABLETS;
D O I
10.1016/j.rechem.2022.100508
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
In this article, we analyze the near-infrared (NIR) spectra of fifty-eight (58) commercial tablets of 500 mg of paracetamol from different origins (that is, with different batch numbers) in the local markets in Bamako. The NIR spectra were recorded in the spectral range 930 nm-1700 nm. The samples are divided into forty-eight (48) samples forming the set of calibration (training set) and ten (10) samples used as the validation or test set. To perform multivariate calibration, we apply-three nonlinear regression techniques (Gaussian processes regression (GPR), Random Forest (RF), Support vector machine (KSVM)), along with the traditional linear partial leastsquares regression (PLSR) to several data pretreatments of the 58 samples. The results show that the three nonlinear regression calibrations have better prediction performance than PLS as far as RMSE is concerned. To decide the best regression model, we avoid R2 since this quantity is not a good parameter for this purpose. We will instead consider RMSE when comparing the different multivariate models. Additionally, to assess the impact of data preprocessing, we apply the above regression techniques to the original data, Multi-scattering correction (MSC), standard variate normalization (SNV) correction, smoothing correction, first derivative (FD), and second derivative correction (SD). The overall results reveal that Gaussian Processes Regression (GPR) applied to smooth correction gives the lowest RMSEP = 2.303053e-06 for validation (prediction) and RMSEC = 2.112316e-06 for calibration. In our investigation, one also notices that the developed GPR model is more accurate and exhibits enhanced behavior no matter which data preprocessing is used. All in all, GPR can be seen as an alternative powerful regression tool for NIR spectra of paracetamol samples. The statistical parameters of the proposed model are compared to the results of some other models reported in the literature.
引用
收藏
页数:7
相关论文
共 50 条
  • [31] Determination of Tetracyclines by near-infrared (NIR) Spectroscopy and partial least-squares (PLS) regression method
    Szlyk, Edward
    Kowalczyk-Marzec, Agnieszka
    Koter, Izabela
    CHEMIA ANALITYCZNA, 2007, 52 (04): : 605 - 617
  • [32] Detection of oil yield from oil shale based on near-infrared spectroscopy combined with wavelet transform and least squares support vector machines
    Zhang, Fudong
    Liu, Jie
    Lin, Jun
    Wang, Zhihong
    INFRARED PHYSICS & TECHNOLOGY, 2019, 97 : 224 - 228
  • [33] Near infrared spectroscopy combined with least squares support vector machines and fuzzy rule-building expert system applied to diagnosis of endometrial carcinoma
    Yang, Fan
    Tian, Jing
    Xiang, Yuhong
    Zhang, Zhuoyong
    Harrington, Peter de B.
    CANCER EPIDEMIOLOGY, 2012, 36 (03) : 317 - 323
  • [34] A practical approach for near infrared spectral quantitative analysis of complex samples using partial least squares modeling
    Liu ZhiChao
    Ma Xiang
    Wen YaDong
    Wang Yi
    Cai WenSheng
    Shao XueGuang
    SCIENCE IN CHINA SERIES B-CHEMISTRY, 2009, 52 (07): : 1021 - 1027
  • [35] Partial least squares regression method based on consensus modeling for quantitative analysis of near-infrared spectra
    Li Yan-Kun
    Shao Xue-Guang
    Cai Wen-Sheng
    CHEMICAL JOURNAL OF CHINESE UNIVERSITIES-CHINESE, 2007, 28 (02): : 246 - 249
  • [36] Application of Near-Infrared Spectroscopy for Evaluation of Drying Stress on Lumber Surface: A Comparison of Artificial Neural Networks and Partial Least Squares Regression
    Watanabe, Ken
    Kobayashi, Isao
    Matsushita, Yasuhiro
    Saito, Shuetsu
    Kuroda, Naohiro
    Noshiro, Shuichi
    DRYING TECHNOLOGY, 2014, 32 (05) : 590 - 596
  • [37] Investigation of an on-line detection method combining near infrared spectroscopy with local partial least squares regression for the elution process of sodium aescinate
    Jin, Ye
    Ding, Haiying
    Liu, Xuesong
    Wan, Xinmin
    Luan, Lianjun
    Wu, Yongjiang
    SPECTROCHIMICA ACTA PART A-MOLECULAR AND BIOMOLECULAR SPECTROSCOPY, 2013, 109 : 68 - 78
  • [38] Ensemble Partial Least Squares Algorithm in Mutual Information-Induced Subspace for Near-infrared Quantitative Calibration
    Tan Chao
    Qin Xin
    Li Meng-Long
    CHINESE JOURNAL OF ANALYTICAL CHEMISTRY, 2009, 37 (12) : 1834 - 1838
  • [39] Characterization of a Stable Adaptive Calibration Model Using Near-Infrared Spectroscopy and Partial Least Squares with a Kalman Filter
    Mei, Qing-Ping
    Tang, Yi-Ke
    Li, Tai-Fu
    Yao, Li-Zhong
    Yang, Qiong
    Zhang, Heng-Jian
    Liu, Xiao-Hong
    ANALYTICAL LETTERS, 2018, 51 (08) : 1176 - 1193
  • [40] Near infrared quantitative analysis of total curcuminoids in rhizomes of Curcuma longa by moving window partial least squares regression
    Kasemsumran, Sumaporn
    Keeratinijakal, Vichien
    Thanapase, Warunee
    Ozaki, Yukihiro
    JOURNAL OF NEAR INFRARED SPECTROSCOPY, 2010, 18 (04) : 263 - 269