Non-destructive Prediction of Nicotine Content in Tobacco Using Hyperspectral Image-Derived Spectra and Machine Learning

被引:10
作者
Divyanth, L. G. [1 ]
Chakraborty, Somsubhra [1 ]
Li, Bin [2 ]
Weindorf, David C. [3 ]
Deb, Prithwiraj [4 ]
Gem, Carol Jacob [4 ]
机构
[1] Indian Inst Technol IIT, Agr & Food Engn Dept, Kharagpur 721302, India
[2] Louisiana State Univ, Dept Expt Stat, Baton Rouge, LA USA
[3] Cent Michigan Univ, Dept Earth & Atmospher Sci, Mt Pleasant, MI USA
[4] ITC Ltd, Agri Business Div, Guntur 522004, India
关键词
Chemometrics; Partial least squares regression; Random forest; Support vector regression; Variable importance in projection; FOOD-PRODUCTS; CLASSIFICATION; SPECTROSCOPY; REGRESSION; STALK; SMOKE;
D O I
10.1007/s42853-022-00134-0
中图分类号
S2 [农业工程];
学科分类号
0828 ;
摘要
PurposeRapid prediction of tobacco nicotine content in tobacco industries has become essential to maintain a stable and reliable cigarette quality. This research deals with combining hyperspectral images (HSI) and chemometric models to predict nicotine content in powdered tobacco samples.MethodsFifty-seven dried powdered tobacco leaf samples were scanned using a hyperspectral camera followed by image processing. The region of interest (ROI) was selected for calculating average spectra. The average spectra and the destructive measurements of nicotine concentration in the samples were used to develop four regression models based on partial least squares regression (PLSR), support vector regression (SVR), random forest (RF), and PLSR-variable importance in projection (PLSR-VIP). The models were evaluated using leave-one-out cross-validation (LOOCV) and on 15% test dataset.ResultsThe PLSR outperformed (R2=0.93, RMSE= 0.21%) SVR- and RF-based nicotine prediction models using the entire 970-1700-nm range. Five bands centred at 976.15 nm, 1452 nm, 1575.5 nm, 1592.3 nm, and 1698.9 nm were identified as effective wavelengths for nicotine content prediction and used by the PLSR-variable importance in projection (PLSR-VIP) model to produce satisfactory validation performance (R2=0.91, RMSE= 0.30%). The LOOCV yielded R2 values ranging between 0.89 and 0.93 for the evaluated models.ConclusionThe PLSR-VIP model with 96% fewer wavelengths than the full range PLSR highlighted its potential for a more simplistic nicotine prediction mechanism. The HSI plus chemometric model approach has shown the potential to predict tobacco nicotine content rapidly.
引用
收藏
页码:106 / 117
页数:12
相关论文
共 52 条
  • [1] Using basis expansions for estimating functional PLS regression Applications with chemometric data
    Aguilera, Ana M.
    Escabias, Manuel
    Preda, Cristian
    Saporta, Gilbert
    [J]. CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2010, 104 (02) : 289 - 305
  • [2] Detection of sprout damage in wheat kernels using NIR hyperspectral imaging
    Barbedo, Jayme G. A.
    Guarienti, Eliana M.
    Tibola, Casiane S.
    [J]. BIOSYSTEMS ENGINEERING, 2018, 175 : 124 - 132
  • [3] Soil spectroscopy with the use of chemometrics, machine learning and pre-processing techniques in soil diagnosis: Recent advances-A review
    Barra, Issam
    Haefele, Stephan M.
    Sakrabani, Ruben
    Kebede, Fassil
    [J]. TRAC-TRENDS IN ANALYTICAL CHEMISTRY, 2021, 135
  • [4] Benelli A., 2021, In press, DOI [10.1016/j.biosystemseng.2021.08.009, DOI 10.1016/J.BIOSYSTEMSENG.2021.08.009]
  • [5] Near infrared spectroscopic analysis of total alkaloids as nicotine, total nitrogen and total ash in Cuban cigar tobacco
    Borges Miranda, Amaury
    Perez Martinez, Carlos
    Jimenez Chacon, Juan
    Alvarez Prieto, Manuel
    [J]. JOURNAL OF NEAR INFRARED SPECTROSCOPY, 2019, 27 (02) : 123 - 133
  • [6] Random forests
    Breiman, L
    [J]. MACHINE LEARNING, 2001, 45 (01) : 5 - 32
  • [7] Fast analysis of nicotine related alkaloids in tobacco and cigarette smoke by megabore capillary gas chromatography
    Cai, JB
    Liu, BZ
    Lin, P
    Su, QD
    [J]. JOURNAL OF CHROMATOGRAPHY A, 2003, 1017 (1-2) : 187 - 193
  • [8] PLS regression algorithms in the presence of nonlinearity
    Cook, R. Dennis
    Forzani, Liliana
    [J]. CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2021, 213
  • [9] CORESTA, 2017, Method No. 85-Determination of the content of total alkaloids as nicotine-Continuous flow analysis method using KSCN/DCIC
  • [10] SIMPLS - AN ALTERNATIVE APPROACH TO PARTIAL LEAST-SQUARES REGRESSION
    DEJONG, S
    [J]. CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 1993, 18 (03) : 251 - 263