A survey on Bayesian network structure learning from data

被引:168
作者
Scanagatta, Mauro [1 ]
Salmeron, Antonio [2 ,3 ]
Stella, Fabio [4 ]
机构
[1] Fdn Bruno Kessler, Trento, Italy
[2] Univ Almeria, Dept Math, Almeria, Spain
[3] Univ Almeria, Ctr Dev & Transfer Math Res Ind CDTIME, Almeria, Spain
[4] Univ Milano Bicocca, Dept Informat Syst & Commun, Milan, Italy
关键词
Machine learning; Statistics; Bayesian network; Structure learning; BOUNDED-TREEWIDTH; ALGORITHM; REGRESSION; INFERENCE; MODELS; GRAPHS; SPACE;
D O I
10.1007/s13748-019-00194-y
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A necessary step in the development of artificial intelligence is to enable a machine to represent how the world works, building an internal structure from data. This structure should hold a good trade-off between expressive power and querying efficiency. Bayesian networks have proven to be an effective and versatile tool for the task at hand. They have been applied to modeling knowledge in a variety of fields, ranging from bioinformatics to law, from image processing to economic risk analysis. A crucial aspect is learning the dependency graph of a Bayesian network from data. This task, called structure learning, is NP-hard and is the subject of intense, cutting-edge research. In short, it can be thought of as choosing one graph over the many candidates, grounding our reasoning over a collection of samples of the distribution generating the data. The number of possible graphs increases very quickly at the increase in the number of variables. Searching in this space, and selecting a graph over the others, becomes quickly burdensome. In this survey, we review the most relevant structure learning algorithms that have been proposed in the literature. We classify them according to the approach they follow for solving the problem and we also show alternatives for handling missing data and continuous variable. An extensive review of existing software tools is also given.
引用
收藏
页码:425 / 439
页数:15
相关论文
共 69 条
  • [1] Abellan J., 2006, Proc. 3rd Eur. Workshop Probabilistic Graphical Models, Prague, P1
  • [2] Adel T, 2017, AAAI CONF ARTIF INTE, P1684
  • [3] On the use of local search heuristics to improve GES-based Bayesian network learning
    Alonso, Juan I.
    de la Ossa, Luis
    Gamez, Jose A.
    Puerta, Jose M.
    [J]. APPLIED SOFT COMPUTING, 2018, 64 : 366 - 376
  • [4] Scaling up the Greedy Equivalence Search algorithm by constraining the search space of equivalence classes
    Alonso-Barba, Juan I.
    delaOssa, Luis
    Gamez, Jose A.
    Puerta, Jose M.
    [J]. INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2013, 54 (04) : 429 - 451
  • [5] Structural learning of Bayesian networks using local algorithms based on the space of orderings
    Alonso-Barba, Juan I.
    delaOssa, Luis
    Puerta, Jose M.
    [J]. SOFT COMPUTING, 2011, 15 (10) : 1881 - 1895
  • [6] [Anonymous], 2010, P MACHINE LEARNING R
  • [7] [Anonymous], 2006, UAI
  • [8] Efficient identification of independence networks using mutual information
    Bacciu, Davide
    Etchells, Terence A.
    Lisboa, Paulo J. G.
    Whittaker, Joe
    [J]. COMPUTATIONAL STATISTICS, 2013, 28 (02) : 621 - 646
  • [9] A tabu search approach for the flow shop scheduling problem
    Ben-Daya, M
    Al-Fawzan, M
    [J]. EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 1998, 109 (01) : 88 - 95
  • [10] Boettcher SG., 2003, J Stat Softw, V8, P1, DOI DOI 10.18637/JSS.V008.I20