Bayesian model averaging of Bayesian network classifiers over multiple node-orders: Application to sparse datasets

被引:16
|
作者
Hwang, KB [1 ]
Zhang, BT [1 ]
机构
[1] Seoul Natl Univ, Sch Engn & Comp Sci, Seoul 151742, South Korea
来源
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS | 2005年 / 35卷 / 06期
关键词
Bayesian model averaging (BMA); Bayesian networks; classification; Markov chain Monte Carlo (MCMC); sparse data;
D O I
10.1109/TSMCB.2005.850162
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Bayesian model averaging (BMA) can resolve the overfilling problem by explicitly incorporating the model uncertainty into the analysis procedure. Hence, it can be used to improve the generalization performance of Bayesian network classifiers. Until now, BMA of Bayesian network classifiers has only been performed in some restricted forms, e.g., the model is averaged given a single node-order, because of its heavy computational burden. However, it can be hard to obtain a good node-order when the available training dataset is sparse. To alleviate this problem, we propose BMA of Bayesian network classifiers over several distinct node-orders obtained using the Markov chain Monte Carlo sampling technique. The proposed method was examined using two synthetic problems and four real-life datasets. First, we show that the proposed method is especially effective when the given dataset is very sparse. The classification accuracy of averaging over multiple node-orders was higher in most cases than that achieved using a single node-order in our experiments. We also present experimental results for test datasets with unobserved variables, where the quality of the averaged node-order is more important. Through these experiments, we show that the difference in classification performance between the cases of multiple node-orders and single node-order is related to the level of noise, confirming the relative benefit of averaging over multiple node-orders for incomplete data. We conclude that BMA of Bayesian network classifiers over multiple node-orders has an apparent advantage when the given dataset is sparse and noisy, despite the method's heavy computational cost.
引用
收藏
页码:1302 / 1310
页数:9
相关论文
共 24 条
  • [1] Bayesian Network Model Averaging Classifiers by Subbagging
    Sugahara, Shouta
    Aomi, Itsuki
    Ueno, Maomi
    ENTROPY, 2022, 24 (05)
  • [2] Bayesian Network Model Averaging Classifiers by Subbagging
    Sugahara, Shouta
    Aomi, Itsuki
    Ueno, Maomi
    INTERNATIONAL CONFERENCE ON PROBABILISTIC GRAPHICAL MODELS, VOL 138, 2020, 138 : 461 - 472
  • [3] Bayesian network Motifs for reasoning over heterogeneous unlinked datasets
    Sui, Yi
    Kwan, Alex
    Olson, Alexander W.
    Sanner, Scott
    Silver, Daniel A.
    DATA MINING AND KNOWLEDGE DISCOVERY, 2024, 38 (06) : 3643 - 3689
  • [4] Hybrid Bayesian network classifiers: Application to species distribution models
    Aguilera, P. A.
    Fernandez, A.
    Reche, F.
    Rumi, R.
    ENVIRONMENTAL MODELLING & SOFTWARE, 2010, 25 (12) : 1630 - 1639
  • [5] Bayesian Model Averaging of Load Demand Forecasts from Neural Network Models
    Hassan, Saima
    Khosravi, Abbas
    Jaafar, Jafreezal
    2013 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC 2013), 2013, : 3192 - 3197
  • [6] Bayesian averaging over decision tree models: An application for estimating uncertainty in trauma severity scoring
    Schetinin, V.
    Jakaite, L.
    Krzanowski, W.
    INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS, 2018, 112 : 6 - 14
  • [7] Construction and application of Bayesian network model for spatial data mining
    Huang, Jiejun
    Yuan, Yanbin
    2007 IEEE INTERNATIONAL CONFERENCE ON CONTROL AND AUTOMATION, VOLS 1-7, 2007, : 914 - 917
  • [8] A hybrid method for learning multi-dimensional Bayesian network classifiers based on an optimization model
    Mingmin Zhu
    Sanyang Liu
    Jiewei Jiang
    Applied Intelligence, 2016, 44 : 123 - 148
  • [9] A hybrid method for learning multi-dimensional Bayesian network classifiers based on an optimization model
    Zhu, Mingmin
    Liu, Sanyang
    Jiang, Jiewei
    APPLIED INTELLIGENCE, 2016, 44 (01) : 123 - 148
  • [10] A Bayesian Sparse Generalized Linear Model With an Application to Multiscale Covariate Discovery for Observed Rainfall Extremes Over the United States
    Das, Debasish
    Ganguly, Auroop R.
    Obradovic, Zoran
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2015, 53 (12): : 6689 - 6702