Efficient Distributed Transfer Learning for Large-Scale Gaussian Graphic Models

被引：0

作者：

Zhou, Xingcai ^{[1
]}

Zheng, Haotian ^{[1
]}

Zhang, Haoran ^{[2
]}

Huang, Chao ^{[3
]}

机构：

[1] Nanjing Audit Univ, Sch Stat & Data Sci, Nanjing, Jiangsu, Peoples R China

[2] Monash Univ, Fac Sci, Clayton, Vic, Australia

[3] Univ Georgia, Dept Epidemiol & Biostat, Athens, GA USA

来源：

STAT | 2024年 / 13卷 / 04期

关键词：

distributed learning; FDR; Gaussian graphic models; transfer learning; REGRESSION;

D O I：

10.1002/sta4.70004

中图分类号：

O21 [概率论与数理统计]; C8 [统计学];

学科分类号：

020208 ; 070103 ; 0714 ;

摘要：

A transfer learning for the large-scale Gaussian graphical models (GGMs) is considered in a distributed architecture for fitting these target and auxiliary studies scattered across multiple sites. A distributed transfer learning algorithm, DisPower-Trans-CLIME, is proposed to learn the target GGMs by incorporating the data from similar and related auxiliary studies. The algorithm is communication efficient via the distributed power method for matrix decomposition. We show that DisPower-Trans-CLIME has a fast convergence rate comparable to the centred Trans-CLIME algorithm. A debiased DisPower-Trans-CLIME is constructed and is proved to be element-wise asymptotically normal for statistical inference. Thus, a multiple testing procedure is developed to detect edge of GGMs with false discovery rate (FDR) control. Extensive simulation experiments have been conducted to demonstrate superior numerical performance of our proposed learning algorithm on estimation and edge detection. It is also applied to infer the regional connection networks in brain regions of interest (ROIs) based on a target hospital site by leveraging the graph from multiple other hospital sites for autism spectrum disorder. We observe that the degrees of regional connectivity in the right brain are balanced, while the ones of the left brain region are extremely uneven because of the presence of multiple strong connections. However, the specific association, between the degrees of connectivity of these regions and ASD disease, is unknown.

引用

页数：14

共 43 条

[1] Improving the Reliability of Deep Neural Networks in NLP: A Review
Alshemali, Basemah
Kalita, Jugal
[J]. KNOWLEDGE-BASED SYSTEMS, 2020, 191
[2] Predicting with Proxies: Transfer Learning in High Dimension
Bastani, Hamsa
[J]. MANAGEMENT SCIENCE, 2021, 67 (05) : 2964 - 2984
[3] DISTRIBUTED TESTING AND ESTIMATION UNDER SPARSE HIGH DIMENSIONAL MODELS
Battey, Heather
Fan, Jianqing
Liu, Han
Lu, Junwei
Zhu, Ziwei
[J]. ANNALS OF STATISTICS, 2018, 46 (03) : 1352 - 1382
[4] Using a Gaussian Graphical Model to Explore Relationships Between Items and Variables in Environmental Psychology Research
Bhushan, Nitin
Mohnert, Florian
Sloot, Daniel
Jans, Lise
Albers, Casper
Steg, Linda
[J]. FRONTIERS IN PSYCHOLOGY, 2019, 10
[5] TRANSFER LEARNING FOR NONPARAMETRIC CLASSIFICATION: MINIMAX RATE AND ADAPTIVE CLASSIFIER
Cai, T. Tony
Wei, Hongji
[J]. ANNALS OF STATISTICS, 2021, 49 (01) : 100 - 128
[6] ESTIMATING SPARSE PRECISION MATRIX: OPTIMAL RATES OF CONVERGENCE AND ADAPTIVE ESTIMATION
Cai, T. Tony
Liu, Weidong
Zhou, Harrison H.
[J]. ANNALS OF STATISTICS, 2016, 44 (02) : 455 - 488
[7] A Constrained l1 Minimization Approach to Sparse Precision Matrix Estimation
Cai, Tony
Liu, Weidong
Luo, Xi
[J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2011, 106 (494) : 594 - 607
[8] Chen X, 2020, J MACH LEARN RES, V21
[9] Advanced Modelling of the Interplay between Public Governance and Digital Transformation: New Empirical Evidence from Structural Equation Modelling and Gaussian and Mixed-Markov Graphical Models
Craciun, Andreea-Florentina
Taran, Alexandra-Madalina
Noja, Gratiela Georgiana
Pirtea, Marilen Gabriel
Racataian, Raluca-Ioana
[J]. MATHEMATICS, 2023, 11 (05)
[10] Dai R., 2022, Dispfl: Towards CommunicationEfficient Personalized Federated Learning via Decentralized Sparse Training, P4587

← 1 2 3 4 5 →