Learning Tree Structures from Noisy Data

被引:0
作者
Nikolakakis, Konstantinos E. [1 ]
Kalogerias, Dionysios S. [2 ]
Sarwate, Anand D. [1 ]
机构
[1] Rutgers State Univ, New Brunswick, NJ 08901 USA
[2] Princeton Univ, Princeton, NJ 08544 USA
来源
22ND INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 89 | 2019年 / 89卷
关键词
INVERSE COVARIANCE ESTIMATION; ISING-MODEL SELECTION; GRAPHICAL LASSO; LIKELIHOOD-ESTIMATION; DISTRIBUTIONS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We provide high-probability sample complexity guarantees for exact structure recovery of tree-structured graphical models, when only noisy observations of the respective vertex emissions are available. We assume that the hidden variables follow either an Ising model or a Gaussian graphical model, and the observables are noise-corrupted versions of the hidden variables: We consider multiplicative 1 binary noise for Ising models, and additive Gaussian noise for Gaussian models. Such hidden models arise naturally in a variety of applications such as physics, biology, computer science, and finance. We study the impact of measurement noise on the task of learning the underlying tree structure via the well-known Chow-Liu algorithm, and provide formal sample complexity guarantees for exact recovery. In particular, for a tree with p vertices and probability of failure delta > 0, we show that the number of necessary samples for exact structure recovery is of the order of O(log(p/delta)) for Ising models (which remains the same as in the noiseless case), and O(polylog(p/delta)) for Gaussian models.
引用
收藏
页数:12
相关论文
共 50 条
  • [31] Predicting loss and fragmentation of habitat of the vulnerable subtropical rainforest tree Macadamia integrifolia with models developed from compiled ecological data
    Powell, Michael
    Accad, Arnon
    Austin, Mike P.
    Choy, Samantha Low
    Williams, Kristen J.
    Shapcott, Alison
    BIOLOGICAL CONSERVATION, 2010, 143 (06) : 1385 - 1396
  • [32] ESTIMATING LINKS OF A NETWORK FROM TIME TO EVENT DATA
    Yen, Tso-Jung
    Lee, Zong-Rong
    Chen, Yi-Hau
    Yen, Yu-Min
    Hwang, Jing-Shiang
    ANNALS OF APPLIED STATISTICS, 2017, 11 (03) : 1429 - 1451
  • [33] Extraction of the multiplicity dependence of multiparton interactions from LHC pp data using machine learning techniques
    Ortiz, Antonio
    Zepeda, Erik A.
    JOURNAL OF PHYSICS G-NUCLEAR AND PARTICLE PHYSICS, 2021, 48 (08)
  • [34] Node-based learning of differential networks from multi-platform gene expression data
    Le Ou-Yang
    Zhang, Xiao-Fei
    Wu, Min
    Li, Xiao-Li
    METHODS, 2017, 129 : 41 - 49
  • [35] INTEGRATIVE NETWORK LEARNING FOR MULTIMODALITY BIOMARKER DATA
    Xie, Shanghong
    Zeng, Donglin
    Wang, Yuanjia
    ANNALS OF APPLIED STATISTICS, 2021, 15 (01) : 64 - 87
  • [36] Unsupervised Learning of Categorical Data With Competing Models
    Ilin, Roman
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2012, 23 (11) : 1726 - 1737
  • [37] Underwater Place Recognition in Noisy Stereo Data using Fab-Map with a Multimodal Vocabulary from 2D Texture and 3D Surface Descriptors
    Enchev, Ivaylo
    Pfingsthorn, Max
    Luczynski, Tomasz
    Sokolovski, Igor
    Birk, Andreas
    Tietjen, Daniel
    OCEANS 2015 - GENOVA, 2015,
  • [38] Graph-Guided Bayesian Factor Model for Integrative Analysis of Multi-modal Data with Noisy Network Information
    Li, Wenrui
    Zhang, Qiyiwen
    Qu, Kewen
    Long, Qi
    STATISTICS IN BIOSCIENCES, 2024,
  • [39] Status and population structures of three anthelmintic tree species along climatic gradient in Benin and the implications for conservation
    Alowanou, Georcelin G.
    Houehanou, Thierry D.
    Mensah, Sylvanus
    Alissou, Benoit K.
    Ahoyo, Carlos C.
    Akpako, Rauldin S.
    Wabi, Faroukou
    Houinato, Marcel R. B.
    Hounzangbe-Adote, Sylvie M.
    SOUTHERN FORESTS, 2020, 82 (01): : 1 - 9
  • [40] Model Selection Using Database Characteristics: Developing a Classification Tree for Longitudinal Incidence Data
    Schwartz, Eric M.
    Bradlow, Eric T.
    Fader, Peter S.
    MARKETING SCIENCE, 2014, 33 (02) : 188 - 205