Distributed Continual Learning With CoCoA in High-Dimensional Linear Regression

被引:0
|
作者
Hellkvist, Martin [1 ]
Ozcelikkale, Ayca [1 ]
Ahlen, Anders [1 ]
机构
[1] Uppsala Univ, Dept Elect Engn, S-75121 Uppsala, Sweden
基金
瑞典研究理事会;
关键词
Task analysis; Training; Distributed databases; Distance learning; Computer aided instruction; Data models; Training data; Multi-task networks; networked systems; distributed estimation; adaptation; overparametrization; NEURAL-NETWORKS; ALGORITHMS;
D O I
10.1109/TSP.2024.3361714
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
We consider estimation under scenarios where the signals of interest exhibit change of characteristics over time. In particular, we consider the continual learning problem where different tasks, e.g., data with different distributions, arrive sequentially and the aim is to perform well on the newly arrived task without performance degradation on the previously seen tasks. In contrast to the continual learning literature focusing on the centralized setting, we investigate the problem from a distributed estimation perspective. We consider the well-established distributed learning algorithm CoCoA, which distributes the model parameters and the corresponding features over the network. We provide exact analytical characterization for the generalization error of CoCoA under continual learning for linear regression in a range of scenarios, where overparameterization is of particular interest. These analytical results characterize how the generalization error depends on the network structure, the task similarity and the number of tasks, and show how these dependencies are intertwined. In particular, our results show that the generalization error can be significantly reduced by adjusting the network size, where the most favorable network size depends on task similarity and the number of tasks. We present numerical results verifying the theoretical analysis and illustrate the continual learning performance of CoCoA with a digit classification task.
引用
收藏
页码:1015 / 1031
页数:17
相关论文
共 50 条
  • [11] Graph Linear Convolution Pooling for Learning in Incomplete High-Dimensional Data
    Bi, Fanghui
    He, Tiantian
    Ong, Yew-Soon
    Luo, Xin
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2025, 37 (04) : 1838 - 1852
  • [12] Challenges in High-Dimensional Reinforcement Learning with Evolution Strategies
    Mueller, Nils
    Glasmachers, Tobias
    PARALLEL PROBLEM SOLVING FROM NATURE - PPSN XV, PT II, 2018, 11102 : 411 - 423
  • [13] An adaptive shortest-solution guided decimation approach to sparse high-dimensional linear regression
    Yu, Xue
    Sun, Yifan
    Zhou, Hai-Jun
    SCIENTIFIC REPORTS, 2021, 11 (01)
  • [14] Local-Density Subspace Distributed Clustering for High-Dimensional Data
    Geng, Yangli-ao
    Li, Qingyong
    Liang, Mingfei
    Chi, Chong-Yung
    Tan, Juan
    Huang, Heng
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2020, 31 (08) : 1799 - 1814
  • [15] ADMM for High-Dimensional Sparse Penalized Quantile Regression
    Gu, Yuwen
    Fan, Jun
    Kong, Lingchen
    Ma, Shiqian
    Zou, Hui
    TECHNOMETRICS, 2018, 60 (03) : 319 - 331
  • [16] Centralized and Distributed Anonymization for High-Dimensional Healthcare Data
    Mohammed, Noman
    Fung, Benjamin C. M.
    Hung, Patrick C. K.
    Lee, Cheuk-Kwong
    ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2010, 4 (04)
  • [17] Distributed Min-Max Learning Scheme for Neural Networks With Applications to High-Dimensional Classification
    Raghavan, Krishnan
    Garg, Shweta
    Jagannathan, Sarangapani
    Samaranayake, V. A.
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 32 (10) : 4323 - 4333
  • [18] Deep learning for high-dimensional reliability analysis
    Li, Mingyang
    Wang, Zequn
    MECHANICAL SYSTEMS AND SIGNAL PROCESSING, 2020, 139
  • [19] Universal Features for High-Dimensional Learning and Inference
    Huang, Shao-Lun
    Makur, Anuran
    Wornell, Gregory W.
    Zheng, Lizhong
    FOUNDATIONS AND TRENDS IN COMMUNICATIONS AND INFORMATION THEORY, 2024, 21 (1-2): : 1 - 299
  • [20] Probabilistic partition of unity networks for high-dimensional regression problems
    Fan, Tiffany
    Trask, Nathaniel
    D'Elia, Marta
    Darve, Eric
    INTERNATIONAL JOURNAL FOR NUMERICAL METHODS IN ENGINEERING, 2023, 124 (10) : 2215 - 2236