A cluster approach to analyze preference data: Choice of the number of clusters

被引:12
|
作者
Sahmer, K [1 ]
Vigneau, E [1 ]
Qannari, EM [1 ]
机构
[1] ENITIAA, INRA, Unite Sensometrie & Chimiometrie, F-44322 Nantes 03, France
关键词
bootstrap; clustering; cluster tendency; cluster validity; preference data;
D O I
10.1016/j.foodqual.2005.03.007
中图分类号
TS2 [食品工业];
学科分类号
0832 ;
摘要
We consider the clustering of a panel of consumers according to their scores of liking. The procedure is based on a cluster of variables approach proposed by Vigneau et al. [Vigneau, E., Qannari, E. M., Punter, P. H., & Knoops, S. (2001). Segmentation of a panel of consumers using clustering of variables around latent directions of preference. Food Quality and Preference, 12, 259-363]. We aim at setting up a hypothesis-testing framework in order to determine the appropriate number of clusters. The procedure consists of two steps. Firstly, a cluster tendency test determines if there is more than one cluster. Secondly, a hierarchical algorithm is performed and cluster validity tests at the different levels of the hierarchy indicate the appropriate number of clusters. Once the number of clusters is determined, a partitioning algorithm is implemented by considering as a starting point the partition obtained from the hierarchical algorithm. We illustrate the method on preference data from a European sensory and consumer study on coffee [ESN (1996). A European sensory and consumer study: A case study on coffee. European Sensory Network] and we undergo a simulation study in order to assess the efficiency of the procedure. 2005 Elsevier Ltd. All rights reserved.
引用
收藏
页码:257 / 265
页数:9
相关论文
共 50 条
  • [21] Effects of Resampling in Determining the Number of Clusters in a Data Set
    Rainer Dangl
    Friedrich Leisch
    Journal of Classification, 2020, 37 : 558 - 583
  • [22] Effects of Resampling in Determining the Number of Clusters in a Data Set
    Dangl, Rainer
    Leisch, Friedrich
    JOURNAL OF CLASSIFICATION, 2020, 37 (03) : 558 - 583
  • [23] Clustering of fMRI data: the elusive optimal number of clusters
    Seghier, Mohamed L.
    PEERJ, 2018, 6
  • [24] Categorical Data Clustering with Automatic Selection of Cluster Number
    Liao, Hai-Yong
    Ng, Michael K.
    FUZZY INFORMATION AND ENGINEERING, 2009, 1 (01) : 5 - 25
  • [25] Multilevel models for cost-effectiveness analyses that use cluster randomised trial data: An approach to model choice
    Ng, Edmond S-W
    Diaz-Ordaz, Karla
    Grieve, Richard
    Nixon, Richard M.
    Thompson, Simon G.
    Carpenter, James R.
    STATISTICAL METHODS IN MEDICAL RESEARCH, 2016, 25 (05) : 2036 - 2052
  • [26] Revised DBSCAN algorithm to cluster data with dense adjacent clusters
    Tran, Thanh N.
    Drab, Klaudia
    Daszykowski, Michal
    CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2013, 120 : 92 - 96
  • [27] Automatic Determination of the Appropriate Number of Clusters for Multispectral Image Data
    Koonsanit, Kitti
    Jaruskulchai, Chuleerat
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2012, E95D (05): : 1256 - 1263
  • [28] Estimating the number of clusters in a data set via the gap statistic
    Tibshirani, R
    Walther, G
    Hastie, T
    JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2001, 63 : 411 - 423
  • [29] An Automatic Approach for Solving Clustering Problem with the Number of Clusters Unknown
    Dong, Jinxin
    Qi, Minyong
    2010 SECOND ETP/IITA WORLD CONGRESS IN APPLIED COMPUTING, COMPUTER SCIENCE, AND COMPUTER ENGINEERING, 2010, : 282 - 285
  • [30] Determining the number of clusters using information entropy for mixed data
    Liang, Jiye
    Zhao, Xingwang
    Li, Deyu
    Cao, Fuyuan
    Dang, Chuangyin
    PATTERN RECOGNITION, 2012, 45 (06) : 2251 - 2265