Efficient Distributed Transfer Learning for Large-Scale Gaussian Graphic Models

被引:0
作者
Zhou, Xingcai [1 ]
Zheng, Haotian [1 ]
Zhang, Haoran [2 ]
Huang, Chao [3 ]
机构
[1] Nanjing Audit Univ, Sch Stat & Data Sci, Nanjing, Jiangsu, Peoples R China
[2] Monash Univ, Fac Sci, Clayton, Vic, Australia
[3] Univ Georgia, Dept Epidemiol & Biostat, Athens, GA USA
来源
STAT | 2024年 / 13卷 / 04期
关键词
distributed learning; FDR; Gaussian graphic models; transfer learning; REGRESSION;
D O I
10.1002/sta4.70004
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
A transfer learning for the large-scale Gaussian graphical models (GGMs) is considered in a distributed architecture for fitting these target and auxiliary studies scattered across multiple sites. A distributed transfer learning algorithm, DisPower-Trans-CLIME, is proposed to learn the target GGMs by incorporating the data from similar and related auxiliary studies. The algorithm is communication efficient via the distributed power method for matrix decomposition. We show that DisPower-Trans-CLIME has a fast convergence rate comparable to the centred Trans-CLIME algorithm. A debiased DisPower-Trans-CLIME is constructed and is proved to be element-wise asymptotically normal for statistical inference. Thus, a multiple testing procedure is developed to detect edge of GGMs with false discovery rate (FDR) control. Extensive simulation experiments have been conducted to demonstrate superior numerical performance of our proposed learning algorithm on estimation and edge detection. It is also applied to infer the regional connection networks in brain regions of interest (ROIs) based on a target hospital site by leveraging the graph from multiple other hospital sites for autism spectrum disorder. We observe that the degrees of regional connectivity in the right brain are balanced, while the ones of the left brain region are extremely uneven because of the presence of multiple strong connections. However, the specific association, between the degrees of connectivity of these regions and ASD disease, is unknown.
引用
收藏
页数:14
相关论文
共 43 条
  • [1] Improving the Reliability of Deep Neural Networks in NLP: A Review
    Alshemali, Basemah
    Kalita, Jugal
    [J]. KNOWLEDGE-BASED SYSTEMS, 2020, 191
  • [2] Predicting with Proxies: Transfer Learning in High Dimension
    Bastani, Hamsa
    [J]. MANAGEMENT SCIENCE, 2021, 67 (05) : 2964 - 2984
  • [3] DISTRIBUTED TESTING AND ESTIMATION UNDER SPARSE HIGH DIMENSIONAL MODELS
    Battey, Heather
    Fan, Jianqing
    Liu, Han
    Lu, Junwei
    Zhu, Ziwei
    [J]. ANNALS OF STATISTICS, 2018, 46 (03) : 1352 - 1382
  • [4] Using a Gaussian Graphical Model to Explore Relationships Between Items and Variables in Environmental Psychology Research
    Bhushan, Nitin
    Mohnert, Florian
    Sloot, Daniel
    Jans, Lise
    Albers, Casper
    Steg, Linda
    [J]. FRONTIERS IN PSYCHOLOGY, 2019, 10
  • [5] TRANSFER LEARNING FOR NONPARAMETRIC CLASSIFICATION: MINIMAX RATE AND ADAPTIVE CLASSIFIER
    Cai, T. Tony
    Wei, Hongji
    [J]. ANNALS OF STATISTICS, 2021, 49 (01) : 100 - 128
  • [6] ESTIMATING SPARSE PRECISION MATRIX: OPTIMAL RATES OF CONVERGENCE AND ADAPTIVE ESTIMATION
    Cai, T. Tony
    Liu, Weidong
    Zhou, Harrison H.
    [J]. ANNALS OF STATISTICS, 2016, 44 (02) : 455 - 488
  • [7] A Constrained l1 Minimization Approach to Sparse Precision Matrix Estimation
    Cai, Tony
    Liu, Weidong
    Luo, Xi
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2011, 106 (494) : 594 - 607
  • [8] Chen X, 2020, J MACH LEARN RES, V21
  • [9] Advanced Modelling of the Interplay between Public Governance and Digital Transformation: New Empirical Evidence from Structural Equation Modelling and Gaussian and Mixed-Markov Graphical Models
    Craciun, Andreea-Florentina
    Taran, Alexandra-Madalina
    Noja, Gratiela Georgiana
    Pirtea, Marilen Gabriel
    Racataian, Raluca-Ioana
    [J]. MATHEMATICS, 2023, 11 (05)
  • [10] Dai R., 2022, Dispfl: Towards CommunicationEfficient Personalized Federated Learning via Decentralized Sparse Training, P4587