Learning Tree Structures from Noisy Data

被引:0
|
作者
Nikolakakis, Konstantinos E. [1 ]
Kalogerias, Dionysios S. [2 ]
Sarwate, Anand D. [1 ]
机构
[1] Rutgers State Univ, New Brunswick, NJ 08901 USA
[2] Princeton Univ, Princeton, NJ 08544 USA
来源
22ND INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 89 | 2019年 / 89卷
关键词
INVERSE COVARIANCE ESTIMATION; ISING-MODEL SELECTION; GRAPHICAL LASSO; LIKELIHOOD-ESTIMATION; DISTRIBUTIONS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We provide high-probability sample complexity guarantees for exact structure recovery of tree-structured graphical models, when only noisy observations of the respective vertex emissions are available. We assume that the hidden variables follow either an Ising model or a Gaussian graphical model, and the observables are noise-corrupted versions of the hidden variables: We consider multiplicative 1 binary noise for Ising models, and additive Gaussian noise for Gaussian models. Such hidden models arise naturally in a variety of applications such as physics, biology, computer science, and finance. We study the impact of measurement noise on the task of learning the underlying tree structure via the well-known Chow-Liu algorithm, and provide formal sample complexity guarantees for exact recovery. In particular, for a tree with p vertices and probability of failure delta > 0, we show that the number of necessary samples for exact structure recovery is of the order of O(log(p/delta)) for Ising models (which remains the same as in the noiseless case), and O(polylog(p/delta)) for Gaussian models.
引用
收藏
页数:12
相关论文
共 50 条
  • [21] Representing and Learning High Dimensional Data with the Optimal Transport Map from a Probabilistic Viewpoint
    Park, Serim
    Thorpe, Matthew
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 7864 - 7872
  • [22] Sparse-Group Lasso for Graph Learning From Multi-Attribute Data
    Tugnai, Jitendra K.
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2021, 69 (69) : 1771 - 1786
  • [23] Kernel partial correlation: a novel approach to capturing conditional independence in graphical models for noisy data
    Oh, Jihwan
    Zheng, Faye
    Doerge, R. W.
    Chun, Hyonho
    JOURNAL OF APPLIED STATISTICS, 2018, 45 (14) : 2677 - 2696
  • [24] Information-Theoretic Image Reconstruction and Segmentation from Noisy Projections
    Visser, Gerhard
    Dowe, David L.
    Svalbe, Imants D.
    AI 2009: ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2009, 5866 : 170 - 179
  • [25] An Integrated Approach of Learning Genetic Networks From Genome-Wide Gene Expression Data Using Gaussian Graphical Model and Monte Carlo Method
    Zhao, Haitao
    Datta, Sujay
    Duan, Zhong-Hui
    BIOINFORMATICS AND BIOLOGY INSIGHTS, 2023, 17
  • [26] LEARNING A TREE-STRUCTURED ISING MODEL IN ORDER TO MAKE PREDICTIONS
    Bresler, Guy
    Karzand, Mina
    ANNALS OF STATISTICS, 2020, 48 (02): : 713 - 737
  • [27] Structured learning of time-varying networks with application to PM2.5 data
    Guo, Xiao a
    Zhang, Hai
    COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2021, 50 (05) : 1364 - 1382
  • [28] Joint learning of multiple gene networks from single-cell gene expression data
    Wu, Nuosi
    Yin, Fu
    Ou-Yang, Le
    Zhu, Zexuan
    Xie, Weixin
    COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL, 2020, 18 : 2583 - 2595
  • [30] Identifying refugia from climate change using coupled ecological and genetic data in a transitional Mediterranean-temperate tree species
    Temunovic, M.
    Frascaria-Lacoste, N.
    Franjic, J.
    Satovic, Z.
    Fernandez-Manjarres, J. F.
    MOLECULAR ECOLOGY, 2013, 22 (08) : 2128 - 2142