Origin identification of Cornus officinalis based on PCA-SVM combined model

被引:8
作者
Jin, Yueqiang [1 ]
Liu, Bing [1 ]
Li, Chaoning [2 ]
Shi, Shasha [3 ]
机构
[1] Nanjing Vocat Univ Ind Technol, Publ Fdn Courses Dept, Nanjing, Peoples R China
[2] Nanjing Changxingyang Intelligent Home Co Ltd, Res & Dev Dept, Nanjing, Peoples R China
[3] Jiangsu Ocean Univ, Sch Sci, Lianyungang, Peoples R China
关键词
CHINESE HERBAL MEDICINE; PRINCIPAL COMPONENT ANALYSIS; GAS-CHROMATOGRAPHY; CAPILLARY-ELECTROPHORESIS; TERAHERTZ SPECTROSCOPY; EXTRACTION; FUSION;
D O I
10.1371/journal.pone.0282429
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Infrared spectroscopy can quickly and non-destructively extract analytical information from samples. It can be applied to the authenticity identification of various Chinese herbal medicines, the prediction of the mixing amount of defective products, and the analysis of the origin. In this paper, the spectral information of Cornus officinalis from 11 origins was used as the research object, and the origin identification model of Cornus officinalis based on mid-infrared spectroscopy was established. First, principal component analysis was used to extract the absorbance data of Cornus officinalis in the wavenumber range of 551 similar to 3998 cm(-1). The extracted principal components contain more than 99.8% of the information of the original data. Second, the extracted principal component information was used as input, and the origin category was used as output, and the origin identification model was trained with the help of support vector machine. In this paper, this combined model is called PCA-SVM combined model. Finally, the generalization ability of the PCA-SVM model is evaluated through an external test set. The three indicators of Accuracy, F1-Score, and Kappa coefficient are used to compare this model with other commonly used classification models such as naive Bayes model, decision trees, linear discriminant analysis, radial basis function neural network and partial least square discriminant analysis. The results show that PCA-SVM model is superior to other commonly used models in accuracy, F1 score and Kappa coefficient. In addition, compared with the SVM model with full spectrum data, the PCA-SVM model not only reduces the redundant variables in the model, but also has higher accuracy. Using this model to identify the origin of Cornus officinalis, the accuracy rate is 84.8%.
引用
收藏
页数:20
相关论文
共 54 条
[1]   Detection of Melamine in Foods Using Terahertz Time-Domain Spectroscopy [J].
Baek, Seung Hyun ;
Lim, Heung Bin ;
Chun, Hyang Sook .
JOURNAL OF AGRICULTURAL AND FOOD CHEMISTRY, 2014, 62 (24) :5403-5407
[2]   Application of artificial neural networks in the geographical identification of coffee samples [J].
Borsato, Dionisio ;
Roberto Pina, Marcos Vinicios ;
Spacino, Kelly Roberta ;
dos Santos Scholz, Maria Brigida ;
Androcioli Filho, Armando .
EUROPEAN FOOD RESEARCH AND TECHNOLOGY, 2011, 233 (03) :533-543
[3]  
Broomhead D.S., 1988, RSRE, P4148
[4]   Qualitative analysis of a sulfur-fumigated Chinese herbal medicine by comprehensive two-dimensional gas chromatography and high-resolution time of flight mass spectrometry using colorized fuzzy difference data processing [J].
Cai, Hao ;
Cao, Gang ;
Zhang, Hong-yan .
CHINESE JOURNAL OF INTEGRATIVE MEDICINE, 2017, 23 (04) :261-269
[5]   Application of near infrared spectroscopy combined with SVR algorithm in rapid detection of cAMP content in red jujube [J].
Chen, Chen ;
Li, Hongyi ;
Lv, Xiaoyi ;
Tang, Jun ;
Chen, Cheng ;
Zheng, Xiangxiang .
OPTIK, 2019, 194
[6]   A novel diagnostic method: FT-IR, Raman and derivative spectroscopy fusion technology for the rapid diagnosis of renal cell carcinoma serum [J].
Chen, Cheng ;
Chen, Fangfang ;
Yang, Bo ;
Zhang, Kai ;
Lv, Xiaoyi ;
Chen, Chen .
SPECTROCHIMICA ACTA PART A-MOLECULAR AND BIOMOLECULAR SPECTROSCOPY, 2022, 269
[7]   Raman spectroscopy combined with multiple algorithms for analysis and rapid screening of chronic renal failure [J].
Chen, Cheng ;
Yang, Li ;
Li, Hongyi ;
Chen, Fangfang ;
Chen, Chen ;
Gao, Rui ;
Lv, X. Y. ;
Tang, Jun .
PHOTODIAGNOSIS AND PHOTODYNAMIC THERAPY, 2020, 30
[8]  
CORTES C, 1995, MACH LEARN, V20, P273, DOI 10.1023/A:1022627411411
[9]   Comprehensive determination of polycyclic aromatic hydrocarbons in Chinese herbal medicines by solid phase extraction and gas chromatography coupled to tandem mass spectrometry [J].
Cui, Zongyan ;
Ge, Na ;
Zhang, Ang ;
Liu, Yongming ;
Zhang, Jinjie ;
Cao, Yanzhong .
ANALYTICAL AND BIOANALYTICAL CHEMISTRY, 2015, 407 (07) :1989-1997
[10]   Twin support vector machine: theory, algorithm and applications [J].
Ding, Shifei ;
Zhang, Nan ;
Zhang, Xiekai ;
Wu, Fulin .
NEURAL COMPUTING & APPLICATIONS, 2017, 28 (11) :3119-3130