A cluster approach to analyze preference data: Choice of the number of clusters

被引:12
作者
Sahmer, K [1 ]
Vigneau, E [1 ]
Qannari, EM [1 ]
机构
[1] ENITIAA, INRA, Unite Sensometrie & Chimiometrie, F-44322 Nantes 03, France
关键词
bootstrap; clustering; cluster tendency; cluster validity; preference data;
D O I
10.1016/j.foodqual.2005.03.007
中图分类号
TS2 [食品工业];
学科分类号
0832 ;
摘要
We consider the clustering of a panel of consumers according to their scores of liking. The procedure is based on a cluster of variables approach proposed by Vigneau et al. [Vigneau, E., Qannari, E. M., Punter, P. H., & Knoops, S. (2001). Segmentation of a panel of consumers using clustering of variables around latent directions of preference. Food Quality and Preference, 12, 259-363]. We aim at setting up a hypothesis-testing framework in order to determine the appropriate number of clusters. The procedure consists of two steps. Firstly, a cluster tendency test determines if there is more than one cluster. Secondly, a hierarchical algorithm is performed and cluster validity tests at the different levels of the hierarchy indicate the appropriate number of clusters. Once the number of clusters is determined, a partitioning algorithm is implemented by considering as a starting point the partition obtained from the hierarchical algorithm. We illustrate the method on preference data from a European sensory and consumer study on coffee [ESN (1996). A European sensory and consumer study: A case study on coffee. European Sensory Network] and we undergo a simulation study in order to assess the efficiency of the procedure. 2005 Elsevier Ltd. All rights reserved.
引用
收藏
页码:257 / 265
页数:9
相关论文
共 50 条
  • [41] Generalized Self-Organizing Maps for Automatic Determination of the Number of Clusters and Their Multiprototypes in Cluster Analysis
    Gorzalczany, Marian B.
    Rudzinski, Filip
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 29 (07) : 2833 - 2845
  • [42] Polygonization of point clusters through cluster boundary extraction for geographical data mining
    Lee, I
    Estivill-Castro, V
    ADVANCES IN SPATIAL DATA HANDLING, 2002, : 27 - 40
  • [43] An optimization approach to cluster data based on aggregate function
    Wang, Y
    PROCEEDINGS OF THE 2004 INTERNATIONAL CONFERENCE ON MANAGEMENT SCIENCE & ENGINEERING, VOLS 1 AND 2, 2004, : 271 - 274
  • [44] Estimating the number of clusters in a numerical data set via quantization error modeling
    Kolesnikov, Alexander
    Trichina, Elena
    Kauranne, Tuomo
    PATTERN RECOGNITION, 2015, 48 (03) : 941 - 952
  • [45] On clustering uncertain and structured data with Wasserstein barycenters and a geodesic criterion for the number of clusters
    Papayiannis, G. I.
    Domazakis, G. N.
    Drivaliaris, D.
    Koukoulas, S.
    Tsekrekos, A. E.
    Yannacopoulos, A. N.
    JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2021, 91 (13) : 2569 - 2594
  • [46] A New Online Clustering Approach for Data in Arbitrary Shaped Clusters
    Hyde, Richard
    Angelov, Plamen
    2015 IEEE 2ND INTERNATIONAL CONFERENCE ON CYBERNETICS (CYBCONF), 2015, : 228 - 233
  • [47] A new approach to generate diversified clusters for small data sets
    Peng, Chun-Cheng
    Tsai, Cheng-Jung
    Chang, Ting-Yi
    Yeh, Jen-Yuan
    Hua, Po-Wei
    APPLIED SOFT COMPUTING, 2020, 95 (95)
  • [48] A Clustering Approach for Discovering Intrinsic Clusters in Multivariate Geostatistical Data
    Fouedjio, Francky
    MACHINE LEARNING AND DATA MINING IN PATTERN RECOGNITION (MLDM 2016), 2016, 9729 : 491 - 500
  • [49] An integrated approach for market segmentation and visualization based on consumers' preference data
    Lv, Y
    Guo, G
    Cheng, D
    PROCEEDINGS OF THE 2004 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2004, : 1701 - 1710
  • [50] Recovering the number of clusters in data sets with noise features using feature rescaling factors
    de Amorim, Renato Cordeiro
    Hennig, Christian
    INFORMATION SCIENCES, 2015, 324 : 126 - 145