Linear Regression With Distributed Learning: A Generalization Error Perspective

被引：5

作者：

Hellkvist, Martin ^{[1
]}

Ozcelikkale, Ayca ^{[1
]}

Ahlen, Anders ^{[1
]}

机构：

[1] Uppsala Univ, Dept Elect Engn, S-75237 Uppsala, Sweden

来源：

IEEE TRANSACTIONS ON SIGNAL PROCESSING | 2021年 / 69卷

基金：

瑞典研究理事会;

关键词：

Distance learning; Computer aided instruction; Training; Data models; Training data; Distributed databases; Numerical models; Distributed estimation; distributed optimization; supervised learning; generalization error; networked systems; OPTIMIZATION; ALGORITHMS;

D O I：

10.1109/TSP.2021.3106441

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Distributed learning provides an attractive framework for scaling the learning task by sharing the computational load over multiple nodes in a network. Here, we investigate the performance of distributed learning for large-scale linear regression where the model parameters, i.e., the unknowns, are distributed over the network. We adopt a statistical learning approach. In contrast to works that focus on the performance on the training data, we focus on the generalization error, i.e., the performance on unseen data. We provide high-probability bounds on the generalization error for both isotropic and correlated Gaussian data as well as sub-gaussian data. These results reveal the dependence of the generalization performance on the partitioning of the model over the network. In particular, our results show that the generalization error of the distributed solution can be substantially higher than that of the centralized solution even when the error on the training data is at the same level for both the centralized and distributed approaches. Our numerical results illustrate the performance with both real-world image data as well as synthetic data.

引用

页码：5479 / 5495

页数：17

共 50 条

[31] Model Pruning for Distributed Learning Over the Air
Zhao, Zhongyuan
Xu, Kailei
Hong, Wei
Peng, Mugen
Ding, Zhiguo
Quek, Tony Q. S.
Yang, Howard H.
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2024, 72 : 5533 - 5549
[32] Data representations and generalization error in kernel based learning machines
Ancona, Nicola
Maglietta, Rosalia
Stella, Ettore
PATTERN RECOGNITION, 2006, 39 (09) : 1588 - 1603
[33] Generalization error of three layered learning model in Bayesian estimation
Aoyagi, Miki
Watanabe, Sumio
PROCEEDINGS OF THE SECOND IASTED INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE, 2006, : 405 - +
[34] Distributed estimation of functional linear regression with functional responses
Liu, Jiamin
Li, Rui
Lian, Heng
METRIKA, 2024, 87 (01) : 21 - 30
[35] Distributed estimation for linear regression with covariates missing at random
Pan, Yingli
Wang, Haoyu
Xu, Kaidong
Huang, He
COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2025, 54 (02) : 583 - 601
[36] Generalization error for Tweedie models: decomposition and error reduction with bagging
Michel Denuit
Julien Trufin
European Actuarial Journal, 2021, 11 : 325 - 331
[37] Distributed estimation of functional linear regression with functional responses
Jiamin Liu
Rui Li
Heng Lian
Metrika, 2024, 87 : 21 - 30
[38] Distributed Non-Cooperative Games and Distributed Learning in Linear and Nonlinear Systems: An Overview
Tan, Fuxiao
Qi, Kuankuan
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2024, 71 (08) : 3843 - 3856
[39] Generalization error for Tweedie models: decomposition and error reduction with bagging
Denuit, Michel
Trufin, Julien
EUROPEAN ACTUARIAL JOURNAL, 2021, 11 (01) : 325 - 331
[40] Learning Linear Models Using Distributed Iterative Hessian Sketching
Wang, Han
Anderson, James
LEARNING FOR DYNAMICS AND CONTROL CONFERENCE, VOL 168, 2022, 168

← 1 2 3 4 5 →