Learning Tree Structures from Noisy Data

被引:0
作者
Nikolakakis, Konstantinos E. [1 ]
Kalogerias, Dionysios S. [2 ]
Sarwate, Anand D. [1 ]
机构
[1] Rutgers State Univ, New Brunswick, NJ 08901 USA
[2] Princeton Univ, Princeton, NJ 08544 USA
来源
22ND INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 89 | 2019年 / 89卷
关键词
INVERSE COVARIANCE ESTIMATION; ISING-MODEL SELECTION; GRAPHICAL LASSO; LIKELIHOOD-ESTIMATION; DISTRIBUTIONS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We provide high-probability sample complexity guarantees for exact structure recovery of tree-structured graphical models, when only noisy observations of the respective vertex emissions are available. We assume that the hidden variables follow either an Ising model or a Gaussian graphical model, and the observables are noise-corrupted versions of the hidden variables: We consider multiplicative 1 binary noise for Ising models, and additive Gaussian noise for Gaussian models. Such hidden models arise naturally in a variety of applications such as physics, biology, computer science, and finance. We study the impact of measurement noise on the task of learning the underlying tree structure via the well-known Chow-Liu algorithm, and provide formal sample complexity guarantees for exact recovery. In particular, for a tree with p vertices and probability of failure delta > 0, we show that the number of necessary samples for exact structure recovery is of the order of O(log(p/delta)) for Ising models (which remains the same as in the noiseless case), and O(polylog(p/delta)) for Gaussian models.
引用
收藏
页数:12
相关论文
共 50 条
  • [41] Using remotely sensed data to model suitable habitats for tree species in a desert environment
    Campos, Valeria E.
    Cappa, Flavio M.
    Maldonado Viviana, Fernandez
    Giannoni, Stella M.
    JOURNAL OF VEGETATION SCIENCE, 2016, 27 (01) : 200 - 210
  • [42] Marginal Pseudo-Likelihood Learning of Discrete Markov Network Structures
    Pensar, Johan
    Nyman, Henrik
    Niiranen, Juha
    Corander, Jukka
    BAYESIAN ANALYSIS, 2017, 12 (04): : 1195 - 1215
  • [43] A Bayesian Data Augmentation Approach for Learning Deep Models
    Toan Tran
    Trung Pham
    Carneiro, Gustavo
    Palmer, Lyle
    Reid, Ian
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30
  • [44] Representing molecular and materials data for unsupervised machine learning
    Swann, E.
    Sun, B.
    Cleland, D. M.
    Barnard, A. S.
    MOLECULAR SIMULATION, 2018, 44 (11) : 905 - 920
  • [45] SIMULTANEOUS GENERATION OF BINARY AND NORMAL DATA WITH SPECIFIED MARGINAL AND ASSOCIATION STRUCTURES
    Demirtas, Hakan
    Doganay, Beyza
    JOURNAL OF BIOPHARMACEUTICAL STATISTICS, 2012, 22 (02) : 223 - 236
  • [46] Using tree population size structures to assess the impacts of cattle grazing and eucalypts plantations in subtropical South America
    de Souza, Iliane Freitas
    Souza, Alexandre F.
    Pizo, Marco Aurelio
    Ganade, Gislene
    BIODIVERSITY AND CONSERVATION, 2010, 19 (06) : 1683 - 1698
  • [47] On the Selection of the Proper Number of Classes in TFD Segmentation for Extraction of Useful Information Content from Noisy Signals
    Saulig, Nicoletta
    Milanovic, Zeljka
    Lerga, Jonatan
    Griparic, Karlo
    2018 3RD INTERNATIONAL CONFERENCE ON SMART AND SUSTAINABLE TECHNOLOGIES (SPLITECH), 2018, : 89 - 93
  • [48] Extraction of Useful Information Content From Noisy Signals Based on Structural Affinity of Clustered TFDs' Coefficients
    Saulig, Nicoletta
    Lerga, Jonatan
    Milanovic, Zeljka
    Ioana, Cornel
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2019, 67 (12) : 3154 - 3167
  • [49] Nonlinear network-based quantitative trait prediction from biological data
    Blein-Nicolas, Melisande
    Devijver, Emilie
    Gallopin, Melina
    Perthame, Emeline
    JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES C-APPLIED STATISTICS, 2024, 73 (03) : 796 - 815
  • [50] Reconstruction of molecular network evolution from cross-sectional omics data
    Aflakparast, Mehran
    de Gunst, Mathisca C. M.
    van Wieringen, Wessel N.
    BIOMETRICAL JOURNAL, 2018, 60 (03) : 547 - 563