Marginal Pseudo-Likelihood Learning of Discrete Markov Network Structures

被引:13
作者
Pensar, Johan [1 ]
Nyman, Henrik [1 ]
Niiranen, Juha [2 ]
Corander, Jukka [2 ,3 ]
机构
[1] Abo Akad Univ, Dept Math & Stat, Turku, Finland
[2] Univ Helsinki, Dept Math & Stat, Helsinki, Finland
[3] Univ Oslo, Dept Biostat, Oslo, Norway
来源
BAYESIAN ANALYSIS | 2017年 / 12卷 / 04期
基金
芬兰科学院;
关键词
Markov networks; structure learning; pseudo-likelihood; non-chordal graph; Bayesian inference; regularization; CONTEXT-SPECIFIC INDEPENDENCE; ISING-MODEL SELECTION; RANDOM-FIELDS; BAYESIAN NETWORKS; GRAPHICAL MODELS; TREES;
D O I
10.1214/16-BA1032
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
Markov networks are a popular tool for modeling multivariate distributions over a set of discrete variables. The core of the Markov network representation is an undirected graph which elegantly captures the dependence structure over the variables. Traditionally, the Bayesian approach of learning the graph structure from data has been done under the assumption of chordality since non-chordal graphs are difficult to evaluate for likelihood-based scores. Recently, there has been a surge of interest towards the use of regularized pseudo-likelihood methods as such approaches can avoid the assumption of chordality. Many of the currently available methods necessitate the use of a tuning parameter to adapt the level of regularization for a particular dataset. Here we introduce the marginal pseudo-likelihood which has a built-in regularization through marginalization over the graph-specific nuisance parameters. We prove consistency of the resulting graph estimator via comparison with the pseudo-Bayesian information criterion. To identify high-scoring graph structures in a high-dimensional setting we design a two-step algorithm that exploits the decomposable structure of the score. Using synthetic and existing benchmark networks, the marginal pseudo-likelihood method is shown to perform favorably against recent popular structure learning methods.
引用
收藏
页码:1195 / 1215
页数:21
相关论文
共 38 条
  • [1] Abellan J., 2006, P 3 EUR WORKSH PROB, V1-8, P1208
  • [2] NEW LOOK AT STATISTICAL-MODEL IDENTIFICATION
    AKAIKE, H
    [J]. IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1974, AC19 (06) : 716 - 723
  • [3] HIGH-DIMENSIONAL STRUCTURE ESTIMATION IN ISING MODELS: LOCAL SEPARATION CRITERION
    Anandkumar, Animashree
    Tan, Vincent Y. F.
    Huang, Furong
    Willsky, Alan S.
    [J]. ANNALS OF STATISTICS, 2012, 40 (03) : 1346 - 1375
  • [4] [Anonymous], 2003, Algorithms for large scale Markov blanket discovery
  • [5] [Anonymous], 2013, P 29 C UNCERTAINTY A
  • [6] [Anonymous], 1990, GRAPHICAL MODELS APP
  • [7] Inverse Ising Inference Using All the Data
    Aurell, Erik
    Ekeberg, Magnus
    [J]. PHYSICAL REVIEW LETTERS, 2012, 108 (09)
  • [8] High-dimensional Ising model selection with Bayesian information criteria
    Barber, Rina Foygel
    Drton, Mathias
    [J]. ELECTRONIC JOURNAL OF STATISTICS, 2015, 9 (01): : 567 - 607
  • [9] Berg J, 2014, JMLR WORKSH CONF PRO, V33, P86
  • [10] STATISTICAL-ANALYSIS OF NON-LATTICE DATA
    BESAG, J
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES D-THE STATISTICIAN, 1975, 24 (03) : 179 - 195