A feature-selection algorithm based on Support Vector Machine-Multiclass for hyperspectral visible spectral analysis

被引:37
作者
Deng, Shuiguang [1 ,3 ]
Xu, Yifei [1 ]
Li, Li [1 ]
Li, Xiaoli [2 ,3 ]
He, Yong [2 ,3 ]
机构
[1] Zhejiang Univ, Coll Comp Sci & Technol, Hangzhou 310027, Zhejiang, Peoples R China
[2] Zhejiang Univ, Coll Biosyst Engn & Food Sci, Hangzhou 310058, Zhejiang, Peoples R China
[3] Zhejiang Univ, Cyrus Tang Ctr Sensor Mat & Applicat, Hangzhou 310058, Zhejiang, Peoples R China
基金
中国国家自然科学基金;
关键词
Food quality inspection; Feature selection; Hyperspectral visible and near infrared (Vis-NIR); Support Vector Machine-Multiclass Forward Feature Selection (SVM-MFFS); Sesame oil; NEAR-INFRARED SPECTROSCOPY; GENE SELECTION; CLASSIFICATION;
D O I
10.1016/j.jfoodeng.2013.05.024
中图分类号
TQ [化学工业];
学科分类号
0817 ;
摘要
Quality and safety of foods is one of the world's top topics. Using high-precision spectral devices is a main technology trends by its high accuracy and nondestructive of food inspection, but the common obstacle is how to extract informative variables from raw data without losing significant information. This article proposes a novel feature selection algorithm named Support Vector Machine-Multiclass Forward Feature Selection (SVM-MFFS). SVM-MFFS adopts the wrapper and forward feature selection strategy, explores the stability of spectral variables, and uses classical SVM as classification and regression model to select the most relevant wavelengths from hundreds of spectral data. We compare SVM-MFFS with Successive Projection Analysis and Uninformative Variable Elimination in the experiment of identifying different brands of sesame oil. The results show that SVM-MFFS outperforms in accuracy, Receiver Operating Characteristic curve, Prediction and Cumulative Stability, and it will provide a reliable and rapid method in food quality inspection. (c) 2013 Elsevier Ltd. All rights reserved.
引用
收藏
页码:159 / 166
页数:8
相关论文
共 27 条
[1]   Principal component analysis [J].
Abdi, Herve ;
Williams, Lynne J. .
WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL STATISTICS, 2010, 2 (04) :433-459
[2]  
[Anonymous], 1989, P C ADV NEUR INF PRO
[3]   The successive projections algorithm for variable selection in spectroscopic multicomponent analysis [J].
Araújo, MCU ;
Saldanha, TCB ;
Galvao, RKH ;
Yoneyama, T ;
Chame, HC ;
Visani, V .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2001, 57 (02) :65-73
[4]   Selection of relevant features and examples in machine learning [J].
Blum, AL ;
Langley, P .
ARTIFICIAL INTELLIGENCE, 1997, 97 (1-2) :245-271
[5]   Elimination of uninformative variables for multivariate calibration [J].
Centner, V ;
Massart, DL ;
deNoord, OE ;
deJong, S ;
Vandeginste, BM ;
Sterna, C .
ANALYTICAL CHEMISTRY, 1996, 68 (21) :3851-3858
[6]   LIBSVM: A Library for Support Vector Machines [J].
Chang, Chih-Chung ;
Lin, Chih-Jen .
ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2011, 2 (03)
[7]  
Darvin ME, 2004, LASER PHYS, V14, P231
[8]   Gene selection for cancer classification using support vector machines [J].
Guyon, I ;
Weston, J ;
Barnhill, S ;
Vapnik, V .
MACHINE LEARNING, 2002, 46 (1-3) :389-422
[9]  
Guyon I., 2003, J MACH LEARN RES, V3, P1157
[10]   Prediction of soil macronutrients content using near-infrared spectroscopy [J].
He, Yong ;
Huang, Min ;
Garcia, Annia ;
Hernandez, Antihus ;
Song, Haiyan .
COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2007, 58 (02) :144-153