Duplicate Detection for Bayesian Network Structure Learning

被引:0
作者
Niklas Jahnsson
Brandon Malone
Petri Myllymäki
机构
[1] University of Helsinki,Section of Bioinformatics and Systems Cardiology, Department of Internal Medicine III and Klaus Tschira Institute for Integrative Computational Cardiology
[2] University of Heidelberg,Helsinki Institute for Information Technology
[3] University of Helsinki,undefined
来源
New Generation Computing | 2017年 / 35卷
关键词
Bayesian networks; Structure learning; State space search; Delayed duplicate detection; Structured duplicate detection;
D O I
暂无
中图分类号
学科分类号
摘要
We address the well-known score-based Bayesian network structure learning problem. Breadth-first branch and bound (BFBnB) has been shown to be an effective approach for solving this problem. Duplicate detection is an important component of the BFBnB algorithm. Previously, an external sorting-based technique was used for delayed duplicate detection (DDD). We propose a hashing-based technique for DDD and a bin packing algorithm for minimizing the number of external memory files and operations. We also give a structured duplicate detection approach which completely eliminates DDD. Importantly, these techniques ensure the search algorithms respect any given memory bound. Empirically, we demonstrate that structured duplicate detection is significantly faster than the previous state of the art in limited-memory settings. Our results show that the bin packing algorithm incurs some overhead, but that the overhead is offset by reducing I/O when more memory is available.
引用
收藏
页码:47 / 67
页数:20
相关论文
共 18 条
[1]  
Cooper GF(1992)A Bayesian method for the induction of probabilistic networks from data Mach. Learn. 9 309-347
[2]  
Herskovits E(2011)Efficient learning of Bayesian networks using constraints J. Mach. Learn. Res. 12 663-689
[3]  
de Campos CP(2000)A new approach for learning belief networks using independence criteria Int J Approx. Reason. 24 11-37
[4]  
Ji Q(1995)Learning Bayesian networks: the combination of knowledge and statistical data Mach. Learn. 20 197-243
[5]  
de Campos LM(2004)Exact Bayesian structure discovery in Bayesian networks J. Mach. Learn. Res. 5 549-573
[6]  
Huete JF(2011)Parallel algorithm for learning optimal Bayesian network structure J. Mach. Learn. Res. 12 2437-2459
[7]  
Heckerman D(2013)Learning optimal Bayesian networks: a shortest path perspective J. Artif. Intell. Res. 48 23-65
[8]  
Geiger D(2006)Breadth-first heuristic search Artif. Intell. 170 385-408
[9]  
Chickering DM(undefined)undefined undefined undefined undefined-undefined
[10]  
Koivisto M(undefined)undefined undefined undefined undefined-undefined