Linear Regression With Distributed Learning: A Generalization Error Perspective

被引:5
作者
Hellkvist, Martin [1 ]
Ozcelikkale, Ayca [1 ]
Ahlen, Anders [1 ]
机构
[1] Uppsala Univ, Dept Elect Engn, S-75237 Uppsala, Sweden
基金
瑞典研究理事会;
关键词
Distance learning; Computer aided instruction; Training; Data models; Training data; Distributed databases; Numerical models; Distributed estimation; distributed optimization; supervised learning; generalization error; networked systems; OPTIMIZATION; ALGORITHMS;
D O I
10.1109/TSP.2021.3106441
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Distributed learning provides an attractive framework for scaling the learning task by sharing the computational load over multiple nodes in a network. Here, we investigate the performance of distributed learning for large-scale linear regression where the model parameters, i.e., the unknowns, are distributed over the network. We adopt a statistical learning approach. In contrast to works that focus on the performance on the training data, we focus on the generalization error, i.e., the performance on unseen data. We provide high-probability bounds on the generalization error for both isotropic and correlated Gaussian data as well as sub-gaussian data. These results reveal the dependence of the generalization performance on the partitioning of the model over the network. In particular, our results show that the generalization error of the distributed solution can be substantially higher than that of the centralized solution even when the error on the training data is at the same level for both the centralized and distributed approaches. Our numerical results illustrate the performance with both real-world image data as well as synthetic data.
引用
收藏
页码:5479 / 5495
页数:17
相关论文
共 50 条
  • [31] Model Pruning for Distributed Learning Over the Air
    Zhao, Zhongyuan
    Xu, Kailei
    Hong, Wei
    Peng, Mugen
    Ding, Zhiguo
    Quek, Tony Q. S.
    Yang, Howard H.
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2024, 72 : 5533 - 5549
  • [32] Data representations and generalization error in kernel based learning machines
    Ancona, Nicola
    Maglietta, Rosalia
    Stella, Ettore
    PATTERN RECOGNITION, 2006, 39 (09) : 1588 - 1603
  • [33] Generalization error of three layered learning model in Bayesian estimation
    Aoyagi, Miki
    Watanabe, Sumio
    PROCEEDINGS OF THE SECOND IASTED INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE, 2006, : 405 - +
  • [34] Distributed estimation of functional linear regression with functional responses
    Liu, Jiamin
    Li, Rui
    Lian, Heng
    METRIKA, 2024, 87 (01) : 21 - 30
  • [35] Distributed estimation for linear regression with covariates missing at random
    Pan, Yingli
    Wang, Haoyu
    Xu, Kaidong
    Huang, He
    COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2025, 54 (02) : 583 - 601
  • [36] Generalization error for Tweedie models: decomposition and error reduction with bagging
    Michel Denuit
    Julien Trufin
    European Actuarial Journal, 2021, 11 : 325 - 331
  • [37] Distributed estimation of functional linear regression with functional responses
    Jiamin Liu
    Rui Li
    Heng Lian
    Metrika, 2024, 87 : 21 - 30
  • [38] Distributed Non-Cooperative Games and Distributed Learning in Linear and Nonlinear Systems: An Overview
    Tan, Fuxiao
    Qi, Kuankuan
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2024, 71 (08) : 3843 - 3856
  • [39] Generalization error for Tweedie models: decomposition and error reduction with bagging
    Denuit, Michel
    Trufin, Julien
    EUROPEAN ACTUARIAL JOURNAL, 2021, 11 (01) : 325 - 331
  • [40] Learning Linear Models Using Distributed Iterative Hessian Sketching
    Wang, Han
    Anderson, James
    LEARNING FOR DYNAMICS AND CONTROL CONFERENCE, VOL 168, 2022, 168