A cluster approach to analyze preference data: Choice of the number of clusters

被引:12
作者
Sahmer, K [1 ]
Vigneau, E [1 ]
Qannari, EM [1 ]
机构
[1] ENITIAA, INRA, Unite Sensometrie & Chimiometrie, F-44322 Nantes 03, France
关键词
bootstrap; clustering; cluster tendency; cluster validity; preference data;
D O I
10.1016/j.foodqual.2005.03.007
中图分类号
TS2 [食品工业];
学科分类号
0832 ;
摘要
We consider the clustering of a panel of consumers according to their scores of liking. The procedure is based on a cluster of variables approach proposed by Vigneau et al. [Vigneau, E., Qannari, E. M., Punter, P. H., & Knoops, S. (2001). Segmentation of a panel of consumers using clustering of variables around latent directions of preference. Food Quality and Preference, 12, 259-363]. We aim at setting up a hypothesis-testing framework in order to determine the appropriate number of clusters. The procedure consists of two steps. Firstly, a cluster tendency test determines if there is more than one cluster. Secondly, a hierarchical algorithm is performed and cluster validity tests at the different levels of the hierarchy indicate the appropriate number of clusters. Once the number of clusters is determined, a partitioning algorithm is implemented by considering as a starting point the partition obtained from the hierarchical algorithm. We illustrate the method on preference data from a European sensory and consumer study on coffee [ESN (1996). A European sensory and consumer study: A case study on coffee. European Sensory Network] and we undergo a simulation study in order to assess the efficiency of the procedure. 2005 Elsevier Ltd. All rights reserved.
引用
收藏
页码:257 / 265
页数:9
相关论文
共 50 条
  • [31] A Support System for Clustering Data Streams with a Variable Number of Clusters
    Silva, Jonathan de Andrade
    Hruschka, Eduardo Raul
    ACM TRANSACTIONS ON AUTONOMOUS AND ADAPTIVE SYSTEMS, 2016, 11 (02)
  • [32] An evolutionary algorithm for clustering data streams with a variable number of clusters
    Silva, Jonathan de Andrade
    Hruschka, Eduardo Raul
    Gama, Joao
    EXPERT SYSTEMS WITH APPLICATIONS, 2017, 67 : 228 - 238
  • [33] Efficient estimation of the number of clusters for high-dimension data
    Kasapis, Spiridon
    Zhang, Geng
    Smereka, Jonathon M.
    Vlahopoulos, Nickolas
    JOURNAL OF DEFENSE MODELING AND SIMULATION-APPLICATIONS METHODOLOGY TECHNOLOGY-JDMS, 2023,
  • [34] A Cluster-Based Data Fusion Technique to Analyze Big Data in Wireless Multi-Sensor System
    Din, Sadia
    Ahmad, Awais
    Paul, Anand
    Rathore, Muhammad Mazhar Ullah
    Jeon, Gwanggil
    IEEE ACCESS, 2017, 5 : 5069 - 5083
  • [35] A New Approach to Determine the Optimal Number of Clusters Based on the Gap Statistic
    Yang, Jaekyung
    Lee, Jong-Yeong
    Choi, Myoungjin
    Joo, Yeongin
    MACHINE LEARNING FOR NETWORKING (MLN 2019), 2020, 12081 : 227 - 239
  • [36] Clustering Categorical Data:A Cluster Ensemble Approach
    何增友
    High Technology Letters, 2003, (04) : 8 - 12
  • [37] Variance estimation for clustered recurrent event data with a small number of clusters
    Schaubel, DE
    STATISTICS IN MEDICINE, 2005, 24 (19) : 3037 - 3051
  • [38] Sequential clustering with particle filters - Estimating the number of clusters from data
    Schubert, J
    Sidenbladh, H
    2005 7TH INTERNATIONAL CONFERENCE ON INFORMATION FUSION (FUSION), VOLS 1 AND 2, 2005, : 122 - 129
  • [39] A Meta-learning approach for recommending the number of clusters for clustering algorithms
    Pimentel, Bruno Almeida
    de Carvalho, Andre C. P. L. F.
    KNOWLEDGE-BASED SYSTEMS, 2020, 195
  • [40] Automatic Estimation of Cluster Number in Fuzzy Co-clustering Based on Competition and Elimination of Clusters
    Ubukata, Seiki
    Yanagisawa, Kazuki
    Notsu, Akira
    Honda, Katsuhiro
    2018 JOINT 10TH INTERNATIONAL CONFERENCE ON SOFT COMPUTING AND INTELLIGENT SYSTEMS (SCIS) AND 19TH INTERNATIONAL SYMPOSIUM ON ADVANCED INTELLIGENT SYSTEMS (ISIS), 2018, : 660 - 665