A cluster approach to analyze preference data: Choice of the number of clusters

被引:12
|
作者
Sahmer, K [1 ]
Vigneau, E [1 ]
Qannari, EM [1 ]
机构
[1] ENITIAA, INRA, Unite Sensometrie & Chimiometrie, F-44322 Nantes 03, France
关键词
bootstrap; clustering; cluster tendency; cluster validity; preference data;
D O I
10.1016/j.foodqual.2005.03.007
中图分类号
TS2 [食品工业];
学科分类号
0832 ;
摘要
We consider the clustering of a panel of consumers according to their scores of liking. The procedure is based on a cluster of variables approach proposed by Vigneau et al. [Vigneau, E., Qannari, E. M., Punter, P. H., & Knoops, S. (2001). Segmentation of a panel of consumers using clustering of variables around latent directions of preference. Food Quality and Preference, 12, 259-363]. We aim at setting up a hypothesis-testing framework in order to determine the appropriate number of clusters. The procedure consists of two steps. Firstly, a cluster tendency test determines if there is more than one cluster. Secondly, a hierarchical algorithm is performed and cluster validity tests at the different levels of the hierarchy indicate the appropriate number of clusters. Once the number of clusters is determined, a partitioning algorithm is implemented by considering as a starting point the partition obtained from the hierarchical algorithm. We illustrate the method on preference data from a European sensory and consumer study on coffee [ESN (1996). A European sensory and consumer study: A case study on coffee. European Sensory Network] and we undergo a simulation study in order to assess the efficiency of the procedure. 2005 Elsevier Ltd. All rights reserved.
引用
收藏
页码:257 / 265
页数:9
相关论文
共 50 条
  • [1] A New Approach to Cluster Datasets without Prior Knowledge of Number of Clusters
    Swapna, Ch Swetha
    Kuma, V. V.
    Murthy, J. V. R.
    JOURNAL OF SCIENTIFIC & INDUSTRIAL RESEARCH, 2015, 74 (05): : 261 - 264
  • [2] EVALUATION OF COEFFICIENTS FOR DETERMINING THE OPTIMAL NUMBER OF CLUSTERS IN CLUSTER ANALYSIS ON REAL DATA SETS
    Loster, Tomas
    9TH INTERNATIONAL DAYS OF STATISTICS AND ECONOMICS, 2015, : 1014 - 1023
  • [3] ENTROPY-BASED CLUSTER VALIDATION AND ESTIMATION OF THE NUMBER OF CLUSTERS IN GENE EXPRESSION DATA
    Novoselova, Natalia
    Tom, Igor
    JOURNAL OF BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, 2012, 10 (05)
  • [4] DETERMINING THE OPTIMAL NUMBER OF CLUSTERS IN CLUSTER ANALYSIS
    Loster, Tomas
    10TH INTERNATIONAL DAYS OF STATISTICS AND ECONOMICS, 2016, : 1078 - 1090
  • [5] Automatically Determining the Number of Clusters in Unlabeled Data Sets
    Wang, Liang
    Leckie, Christopher
    Ramamohanarao, Kotagiri
    Bezdek, James
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2009, 21 (03) : 335 - 350
  • [6] Joint Cluster Analysis of Attribute and Relationship Data Without A-Priori Specification of the Number of Clusters
    Moser, Flavia
    Ge, Rong
    Ester, Martin
    KDD-2007 PROCEEDINGS OF THE THIRTEENTH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2007, : 510 - 519
  • [7] A new cluster validity index for data with merged clusters and different densities
    Lam, B
    Yan, H
    INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS, VOL 1-4, PROCEEDINGS, 2005, : 798 - 803
  • [8] Investigating cluster validation metrics for optimal number of clusters determination
    Karanikola, Aikaterini
    Liapis, Charalampos M.
    Kotsiantis, Sotiris
    INTELLIGENT DECISION TECHNOLOGIES-NETHERLANDS, 2021, 15 (04): : 809 - 824
  • [9] Finding Clusters of Data: Cluster Analysis in R
    Narang, Tulika
    PROCEEDINGS OF THE 5TH INTERNATIONAL CONFERENCE ON FRONTIERS IN INTELLIGENT COMPUTING: THEORY AND APPLICATIONS, FICTA 2016, VOL 1, 2017, 515 : 635 - 640
  • [10] Nbclust: An R Package for Determining the Relevant Number of Clusters in a Data Set
    Charrad, Malika
    Ghazzali, Nadia
    Boiteau, Veronique
    Niknafs, Azam
    JOURNAL OF STATISTICAL SOFTWARE, 2014, 61 (06): : 1 - 36