Learning Markov Blanket Bayesian Network for Big Data in MapReduce

被引:0
|
作者
Che, Yuxin [1 ]
Hong, Shaohui [1 ]
Zhang, Defu [1 ]
Zhang, Liming [2 ]
机构
[1] Xiamen Univ, Dept Comp Sci, Xiamen 361005, Peoples R China
[2] Univ Macau, Dept Comp Informat Sci, Macau, Peoples R China
关键词
Big Data; MapReduce; Bayesian Network; Markov blanket; Data Mining; CLASSIFICATION;
D O I
10.1109/ICTAI.2016.135
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A challenge task of data mining is to process massive data in the big data era. MapReduce is an attractive model to overcome this challenge. This paper presents a new method to accelerate the process of learning Markov blanket Bayesian network(MBBN). Markov blanket is a better model type of Bayesian network in some complex datasets. The time and space cost of learning Markov blanket is large, and grows fast as the variables increase. Large amounts of data are needed for its independence test which makes the problem harder. The statistical phase and independence test are parallelized to make it find an appropriate relation among variables in the MapReduce framework. Computational results are reported by testing four datasets and show that the speed-up can be obtained by means of MapReduce. In particular, the Markov blanket in MapReduce has higher accuracy rate than naive Bayesian and tree-augmented naive Bayesian.
引用
收藏
页码:896 / 900
页数:5
相关论文
共 50 条
  • [41] Online Markov Blanket Learning with Group Structure
    Li, Bo
    Ling, Zhaolong
    Zhang, Yiwen
    Zhou, Yong
    Hu, Yimin
    Ling, Haifeng
    INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2023, 37 (01): : 33 - 48
  • [42] Feature Selection by Efficient Learning of Markov Blanket
    Fu, Shunkai
    Desmarais, Michel
    WORLD CONGRESS ON ENGINEERING, WCE 2010, VOL I, 2010, : 302 - 308
  • [43] Critique of "A Parallel Framework for Constraint-Based Bayesian Network Learning via Markov Blanket Discovery" by SCC Team From Peking University
    Si, Jiaqi
    Guo, Junyi
    Hao, Zhewen
    He, Wenyang
    Li, Ruihan
    Pan, Yueyang
    Fu, Zhenxin
    Fan, Chun
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2023, 34 (06) : 1720 - 1722
  • [44] Local learning algorithm for Markov blanket discovery
    Fu, Shunkai
    Desmarais, Michel
    AI 2007: ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2007, 4830 : 68 - 79
  • [45] Critique of "A Parallel Framework for Constraint-Based Bayesian Network Learning via Markov Blanket Discovery" by SCC Team From Tsinghua University
    Cao, Juncheng
    Rong, Kaiyuan
    Zhai, Mingshu
    Song, Zeyu
    Ren, Yanyu
    Zhu, Yuxi
    Han, Wentao
    Zhai, Jidong
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2023, 34 (06) : 1723 - 1726
  • [46] Adaptive Bayesian Network Structure Learning from Big Datasets
    Tang, Yan
    Zhang, Qidong
    Liu, Huaxin
    Wang, Wangsong
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS (DASFAA 2017), 2017, 10179 : 158 - 168
  • [47] Critique of "A Parallel Framework for Constraint-Based Bayesian Network Learning via Markov Blanket Discovery" by SCC Team From ShanghaiTech University
    Li, Guancheng
    Cao, Songhui
    Zhao, Chuyi
    Zhang, Siyuan
    Ji, Yuchen
    Jing, Haotian
    Li, Zecheng
    Cheng, Jiajun
    Yang, Yiwei
    Yin, Shu
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2023, 34 (06) : 1716 - 1719
  • [48] Research on case retrieval of Bayesian network under big data
    Guo, Yuan
    Wu, K.
    DATA & KNOWLEDGE ENGINEERING, 2018, 118 : 1 - 13
  • [49] PEnBayes: A Multi-Layered Ensemble Approach for Learning Bayesian Network Structure from Big Data
    Tang, Yan
    Wang, Jianwu
    Mai Nguyen
    Altintas, Ilkay
    SENSORS, 2019, 19 (20)
  • [50] LARGE SCALE OPTIMIZATION TO MINIMIZE NETWORK TRAFFIC USING MAPREDUCE IN BIG DATA APPLICATIONS
    Neelakandan, S.
    Divyabharathi, S.
    Rahini, S.
    Vijayalakshmi, G.
    2016 INTERNATIONAL CONFERENCE ON COMPUTATION OF POWER, ENERGY INFORMATION AND COMMUNICATION (ICCPEIC), 2016, : 193 - 199