A cluster approach to analyze preference data: Choice of the number of clusters

被引：12

作者：

Sahmer, K ^{[1
]}

Vigneau, E ^{[1
]}

Qannari, EM ^{[1
]}

机构：

[1] ENITIAA, INRA, Unite Sensometrie & Chimiometrie, F-44322 Nantes 03, France

来源：

FOOD QUALITY AND PREFERENCE | 2006年 / 17卷 / 3-4期

关键词：

bootstrap; clustering; cluster tendency; cluster validity; preference data;

D O I：

10.1016/j.foodqual.2005.03.007

中图分类号：

TS2 [食品工业];

学科分类号：

0832 ;

摘要：

We consider the clustering of a panel of consumers according to their scores of liking. The procedure is based on a cluster of variables approach proposed by Vigneau et al. [Vigneau, E., Qannari, E. M., Punter, P. H., & Knoops, S. (2001). Segmentation of a panel of consumers using clustering of variables around latent directions of preference. Food Quality and Preference, 12, 259-363]. We aim at setting up a hypothesis-testing framework in order to determine the appropriate number of clusters. The procedure consists of two steps. Firstly, a cluster tendency test determines if there is more than one cluster. Secondly, a hierarchical algorithm is performed and cluster validity tests at the different levels of the hierarchy indicate the appropriate number of clusters. Once the number of clusters is determined, a partitioning algorithm is implemented by considering as a starting point the partition obtained from the hierarchical algorithm. We illustrate the method on preference data from a European sensory and consumer study on coffee [ESN (1996). A European sensory and consumer study: A case study on coffee. European Sensory Network] and we undergo a simulation study in order to assess the efficiency of the procedure. 2005 Elsevier Ltd. All rights reserved.

引用

页码：257 / 265

页数：9

共 50 条

[21] Effects of Resampling in Determining the Number of Clusters in a Data Set
Rainer Dangl
Friedrich Leisch
Journal of Classification, 2020, 37 : 558 - 583
[22] Effects of Resampling in Determining the Number of Clusters in a Data Set
Dangl, Rainer
Leisch, Friedrich
JOURNAL OF CLASSIFICATION, 2020, 37 (03) : 558 - 583
[23] Clustering of fMRI data: the elusive optimal number of clusters
Seghier, Mohamed L.
PEERJ, 2018, 6
[24] Categorical Data Clustering with Automatic Selection of Cluster Number
Liao, Hai-Yong
Ng, Michael K.
FUZZY INFORMATION AND ENGINEERING, 2009, 1 (01) : 5 - 25
[25] Multilevel models for cost-effectiveness analyses that use cluster randomised trial data: An approach to model choice
Ng, Edmond S-W
Diaz-Ordaz, Karla
Grieve, Richard
Nixon, Richard M.
Thompson, Simon G.
Carpenter, James R.
STATISTICAL METHODS IN MEDICAL RESEARCH, 2016, 25 (05) : 2036 - 2052
[26] Revised DBSCAN algorithm to cluster data with dense adjacent clusters
Tran, Thanh N.
Drab, Klaudia
Daszykowski, Michal
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2013, 120 : 92 - 96
[27] Automatic Determination of the Appropriate Number of Clusters for Multispectral Image Data
Koonsanit, Kitti
Jaruskulchai, Chuleerat
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2012, E95D (05): : 1256 - 1263
[28] Estimating the number of clusters in a data set via the gap statistic
Tibshirani, R
Walther, G
Hastie, T
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2001, 63 : 411 - 423
[29] An Automatic Approach for Solving Clustering Problem with the Number of Clusters Unknown
Dong, Jinxin
Qi, Minyong
2010 SECOND ETP/IITA WORLD CONGRESS IN APPLIED COMPUTING, COMPUTER SCIENCE, AND COMPUTER ENGINEERING, 2010, : 282 - 285
[30] Determining the number of clusters using information entropy for mixed data
Liang, Jiye
Zhao, Xingwang
Li, Deyu
Cao, Fuyuan
Dang, Chuangyin
PATTERN RECOGNITION, 2012, 45 (06) : 2251 - 2265

← 1 2 3 4 5 →