Clustering and Classification of Red Wines According to Physical-Chemical Properties Using Data Mining Methods

被引:0
作者
Bondarev, N. V. [1 ]
机构
[1] VN Karazin Karazin Kharkiv Natl Univ, UA-61022 Kharkiv, Ukraine
关键词
red wines; clustering; classification; data mining; STATISTICA; STABILITY; CATION;
D O I
10.1134/S1070363223090141
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
The data on 178 samples of Italian red wines taken from the public machine learning repository UCI have been studied. Computer analysis of 13 physico-chemical properties of the wine samples on their distribution between three groups has been performed via different data mining methods. Classification models: factor, discriminant, canonical, and neural network (multilayer perceptron MLP, Kohonen's map SOFM) ones, predicting models (support vector machine, Bayesian classifier, and nearest neighbor method), and decision trees have been built. The neural network classifiers SOFM 13-3, MLP 13-5-3, and SOFM 16-3 have been trained. It has been found that predicting power of the models is determined by the following variables: proline, flavonoids, color intensity, proteins, and alcohol.
引用
收藏
页码:2325 / 2339
页数:15
相关论文
共 27 条