Modeling wine preferences by data mining from physicochemical properties

被引:825
作者
Cortez, Paulo [1 ]
Cerdeira, Antonio [2 ]
Almeida, Fernando [2 ]
Matos, Telmo [2 ]
Reis, Jose [1 ,2 ]
机构
[1] Univ Minho, Dept Informat Syst, R&D Ctr Algoritmi, P-4800058 Guimaraes, Portugal
[2] CVRVV, P-4050501 Oporto, Portugal
关键词
Sensory preferences; Regression; Variable selection; Model selection; Support vector machines; Neural networks; SUPPORT VECTOR MACHINES; NEURAL-NETWORKS; CLASSIFICATION; DISCRIMINATION; PARAMETERS;
D O I
10.1016/j.dss.2009.05.016
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a data mining approach to predict human wine taste preferences that is based on easily available analytical tests at the certification step. A large dataset (when compared to other studies in this domain) is considered, with white and red vinho verde samples (from Portugal). Three regression techniques were applied, under a computationally efficient procedure that performs simultaneous variable and model selection. The support vector machine achieved promising results, Outperforming the multiple regression and neural network methods. Such model is useful to support the oenologist wine tasting evaluations and improve wine production. Furthermore, similar techniques can help in target marketing by modeling consumer tastes from niche markets. (C) 2009 Elsevier B.V. All rights reserved.
引用
收藏
页码:547 / 553
页数:7
相关论文
共 37 条
[11]  
Ebeler SE, 1999, FLAVOR CHEMISTRY, P409
[12]  
*FAO FAOSTAT, 2008, FOOD AGR ORG AGR TRA
[13]   An optimization approach for scheduling wine grape harvest operations [J].
Ferrer, Juan-Carlos ;
Mac Cawley, Alejandro ;
Maturana, Sergio ;
Toloza, Sergio ;
Vera, Jorge .
INTERNATIONAL JOURNAL OF PRODUCTION ECONOMICS, 2008, 112 (02) :985-999
[14]  
Flexer A., 1996, P 13 EUROPEAN M CYBE, V2, P1005
[15]   Classifier technology and the illusion of progress [J].
Hand, David J. .
STATISTICAL SCIENCE, 2006, 21 (01) :1-14
[16]  
Hastie T., 2009, The elements of statistical learning: data mining, inference, and prediction, P9
[17]   Modeling consumer situational choice of long distance communication with neural networks [J].
Hu, Michael Y. ;
Shanker, Murali ;
Zhang, G. Peter ;
Hung, Ming S. .
DECISION SUPPORT SYSTEMS, 2008, 44 (04) :899-908
[18]   Credit rating analysis with support vector machines and neural networks: a market comparative study [J].
Huang, Z ;
Chen, HC ;
Hsu, CJ ;
Chen, WH ;
Wu, SS .
DECISION SUPPORT SYSTEMS, 2004, 37 (04) :543-558
[19]   Data strip mining for the virtual design of pharmaceuticals with neural networks [J].
Kewley, RH ;
Embrechts, MJ ;
Breneman, C .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 2000, 11 (03) :668-679
[20]   A comparative assessment of classification methods [J].
Kiang, MY .
DECISION SUPPORT SYSTEMS, 2003, 35 (04) :441-454