Comparison of random forest, support vector machine and back propagation neural network for electronic tongue data classification: Application to the recognition of orange beverage and Chinese vinegar

被引:235
作者
Liu, Miao [1 ,3 ]
Wang, Mingjun [2 ]
Wang, Jun [1 ]
Li, Duo [3 ]
机构
[1] Zhejiang Univ, Dept Biosyst Engn, Hangzhou 310058, Zhejiang, Peoples R China
[2] Chinese Acad Sci, Tianjin Inst Ind Biotechnol, Tianjin 300308, Peoples R China
[3] Zhejiang Univ, Dept Food Sci & Nutr, Hangzhou 310058, Zhejiang, Peoples R China
关键词
Random forest; Support vector machine; Back propagation neural network; Pattern recognition; Classification; Electronic tongue; COMPOUND CLASSIFICATION; GREEN TEA; ARRAY; DISCRIMINATION; TOOL; WINES; IDENTIFICATION; OPTIMIZATION; REGRESSION; SIGNALS;
D O I
10.1016/j.snb.2012.11.071
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Random forest (RF) has been proposed on the basis of classification and regression trees (CART) with "ensemble learning" strategy by Breiman in 2001. In this paper, RF is introduced and investigated for electronic tongue (E-tongue) data processing. The experiments were designed for type and brand recognition of orange beverage and Chinese vinegar by an E-tongue with seven potentiometric sensors and an Ag/AgCl reference electrode. Principal component analysis (PCA) was used to visualize the distribution of total samples of each data set. Back propagation neural network (BPNN) and support vector machine (SVM), as comparative methods, were also employed to deal with four data sets. Five-fold cross-validation (CV) with twenty replications was applied during modeling and an external testing set was employed to validate the prediction performance of models. The average correct rates (CR) on CV sets of the four data sets performed by BPNN, SVM and RF were 86.68%, 66.45% and 99.07%, respectively. RF has been proved to outperform BPNN and SVM, and has some advantages in such cases, because it can deal with classification problems of unbalanced, multiclass and small sample data without data preprocessing procedures. These results suggest that RF may be a promising pattern recognition method for E-tongues. (c) 2012 Elsevier B.V. All rights reserved.
引用
收藏
页码:970 / 980
页数:11
相关论文
共 67 条
[1]  
[Anonymous], 2003, Statistical pattern recognition
[2]   Using an e-tongue based on voltammetric electrodes to discriminate among red wines aged in oak barrels or aged using alternative methods -: Correlation between electrochemical signals and analytical parameters [J].
Apetrei, C. ;
Apetrei, I. M. ;
Nevares, I. ;
del Alamo, M. ;
Parra, V. ;
Rodriguez-Mendez, M. L. ;
De Saja, J. A. .
ELECTROCHIMICA ACTA, 2007, 52 (07) :2588-2594
[3]   Developing a powerful In Silico tool for the discovery of novel caspase-3 substrates: a preliminary screening of the human proteome [J].
Ayyash, Muneef ;
Tamimi, Hashem ;
Ashhab, Yaqoub .
BMC BIOINFORMATICS, 2012, 13
[4]   Geographical classification of wine and olive oil by means of classification and influence matrix analysis (CAIMAN) [J].
Ballabio, D ;
Mauri, A ;
Todeschini, R ;
Buratti, S .
ANALYTICA CHIMICA ACTA, 2006, 570 (02) :249-258
[5]   Analysis of tomato taste using two types of electronic tongues [J].
Beullens, Katrien ;
Meszaros, Peter ;
Vermeir, Steven ;
Kirsanov, Dmitriy ;
Legin, Andrey ;
Buysens, Saskia ;
Cap, Nathahe ;
Nicolai, Bart M. ;
Larnmertyn, Jeroen .
SENSORS AND ACTUATORS B-CHEMICAL, 2008, 131 (01) :10-17
[6]   The electronic tongue and ATR-FTIR for rapid detection of sugars and acids in tomatoes [J].
Beullens, Katrien ;
Kirsanov, Dmitriy ;
Irudayaraj, Joseph ;
Rudnitskaya, Alisa ;
Legin, Andrey ;
Nicolai, Bart M. ;
Lammertyn, Jeroen .
SENSORS AND ACTUATORS B-CHEMICAL, 2006, 116 (1-2) :107-115
[7]   A novel approach using Dynamic Social Impact Theory for optimization of impedance-Tongue (iTongue) [J].
Bhondekar, Amol P. ;
Kaur, Rishemjit ;
Kumar, Ritesh ;
Vig, Renu ;
Kapur, Pawan .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2011, 109 (01) :65-76
[8]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[9]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[10]   Support Vector Machines for classification and regression [J].
Brereton, Richard G. ;
Lloyd, Gavin R. .
ANALYST, 2010, 135 (02) :230-267