Unsupervised training of Bayesian networks for data clustering

被引:23
|
作者
Pham, Duc Truong [1 ]
Ruz, Gonzalo A. [1 ]
机构
[1] Cardiff Univ, Mfg Engn Ctr, Cardiff CF24 3AA, S Glam, Wales
来源
PROCEEDINGS OF THE ROYAL SOCIETY A-MATHEMATICAL PHYSICAL AND ENGINEERING SCIENCES | 2009年 / 465卷 / 2109期
关键词
Bayesian networks; clustering; unsupervised training; classification expectation-maximization algorithm; machine learning; DEFECT CLASSIFICATION; EM ALGORITHM; CLASSIFIERS;
D O I
10.1098/rspa.2009.0065
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
This paper presents a new approach to the unsupervised training of Bayesian network classifiers. Three models have been analysed: the Chow and Liu (CL) multinets; the tree-augmented naive Bayes; and a new model called the simple Bayesian network classifier, which is more robust in its structure learning. To perform the unsupervised training of these models, the classification maximum likelihood criterion is used. The maximization of this criterion is derived for each model under the classification expectation-maximization ( EM) algorithm framework. To test the proposed unsupervised training approach, 10 well-known benchmark datasets have been used to measure their clustering performance. Also, for comparison, the results for the k-means and the EM algorithm, as well as those obtained when the three Bayesian network classifiers are trained in a supervised way, are analysed. A real-world image processing application is also presented, dealing with clustering of wood board images described by 165 attributes. Results show that the proposed learning method, in general, outperforms traditional clustering algorithms and, in the wood board image application, the CL multinets obtained a 12 per cent increase, on average, in clustering accuracy when compared with the k-means method and a 7 per cent increase, on average, when compared with the EM algorithm.
引用
收藏
页码:2927 / 2948
页数:22
相关论文
共 50 条
  • [1] Unsupervised bayesian clustering for functional data
    Juery, Damien
    Abraham, Christophe
    Fontez, Benedicte
    JOURNAL OF THE SFDS, 2014, 155 (02): : 185 - 201
  • [2] Unsupervised learning of Bayesian networks via estimation of distribution algorithms:: An application to gene expression data clustering
    Peña, JM
    Lozano, JA
    Larrañaga, P
    INTERNATIONAL JOURNAL OF UNCERTAINTY FUZZINESS AND KNOWLEDGE-BASED SYSTEMS, 2004, 12 : 63 - 82
  • [3] Multipartition clustering of mixed data with Bayesian networks
    Rodriguez-Sanchez, Fernando
    Bielza, Concha
    Larranaga, Pedro
    INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2022, 37 (03) : 2188 - 2218
  • [4] Deformable Bayesian Networks for Data Clustering and Fusion
    Kampa, Kittipat
    Principe, Jose C.
    Cobb, J. Tory
    Rangarajan, Anand
    DETECTION AND SENSING OF MINES, EXPLOSIVE OBJECTS, AND OBSCURED TARGETS XVI, 2011, 8017
  • [5] Unsupervised non parametric data clustering by means of Bayesian inference and information theory
    Bougeniere, Gilles
    Cariou, Claude
    Chehdi, Kacem
    Gay, Alan
    SIGMAP 2007: PROCEEDINGS OF THE SECOND INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND MULTIMEDIA APPLICATIONS, 2007, : 101 - +
  • [6] Data clustering using hidden variables in hybrid Bayesian networks
    Fernández A.
    Gámez J.A.
    Rumí R.
    Salmerón A.
    Fernández, Antonio, 1600, Springer Verlag (02): : 141 - 152
  • [7] Unsupervised clustering in streaming data
    Tasoulis, Dimitris K.
    Adams, Niall M.
    Hand, David J.
    ICDM 2006: SIXTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, WORKSHOPS, 2006, : 638 - +
  • [8] A Hierarchical Bayesian Approach for Unsupervised Cell Phenotype Clustering
    Krishna, Mahesh Venkata
    Denzler, Joachim
    PATTERN RECOGNITION, GCPR 2014, 2014, 8753 : 69 - 80
  • [9] BAYESIAN CLUSTERING FOR UNSUPERVISED ESTIMATION OF SURFACE AND TEXTURE MODELS
    SILVERMAN, JF
    COOPER, DB
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1988, 10 (04) : 482 - 495
  • [10] Unsupervised data pruning for clustering of noisy data
    Hong, Yi
    Kwong, Sam
    Chang, Yuchou
    Ren, Qingsheng
    KNOWLEDGE-BASED SYSTEMS, 2008, 21 (07) : 612 - 616