Distributed Learning With Dependent Samples

被引:3
作者
Sun, Zirui [1 ]
Lin, Shao-Bo [1 ]
机构
[1] Xi An Jiao Tong Univ, Ctr Intelligent Decis Making & Machine Learning, Sch Management, Xian 710049, Peoples R China
基金
中国国家自然科学基金;
关键词
Distance learning; Computer aided instruction; Distributed databases; Kernel; Servers; Data privacy; Time series analysis; Distributed learning; strong mixing sequences; kernel ridge regression; learning rate; REGULARIZED REGRESSION; BIG DATA; RATES; CONVERGENCE; ALGORITHMS; INFERENCE; GRADIENT; PRIVACY; DESIGN;
D O I
10.1109/TIT.2022.3175761
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper focuses on learning rate analysis of distributed kernel ridge regression (DKRR) for strong mixing sequences. Using a recently developed integral operator approach and a classical covariance inequality for Banach-valued strong mixing sequences, we succeed in deriving optimal learning rates of DKRR. As a byproduct, we deduce a sufficient condition for the mixing property to guarantee the optimal learning rates for kernel ridge regression, which fills the gap of learning rates between i.i.d. samples and strong mixing sequences. A series of numerical experiments are conducted to verify our theoretical assertions via showing excellent learning performance of DKRR in learning both toy and real world time series data. All these results extend the applicable range of distributed learning from i.i.d. samples to non-i.i.d. sequences.
引用
收藏
页码:6003 / 6020
页数:18
相关论文
共 61 条
  • [1] UNIFORM CONVERGENCE OF VAPNIK-CHERVONENKIS CLASSES UNDER ERGODIC SAMPLING
    Adams, Terrence M.
    Nobel, Andrew B.
    [J]. ANNALS OF PROBABILITY, 2010, 38 (04) : 1345 - 1367
  • [2] Prediction of time series by statistical learning: general losses and fast rates
    Alquier, Pierre
    Li, Xiaoyin
    Wintenberger, Olivier
    [J]. DEPENDENCE MODELING, 2013, 1 (01): : 65 - 93
  • [3] [Anonymous], 2008, Support Vector Machines
  • [4] The Big Data Newsvendor: Practical Insights from Machine Learning
    Ban, Gah-Yi
    Rudin, Cynthia
    [J]. OPERATIONS RESEARCH, 2019, 67 (01) : 90 - 108
  • [5] Bertsekas D., 1989, PARALLEL DISTRIBUTED
  • [6] Convergence rates of Kernel Conjugate Gradient for random design regression
    Blanchard, Gilles
    Kraemer, Nicole
    [J]. ANALYSIS AND APPLICATIONS, 2016, 14 (06) : 763 - 794
  • [7] Boyd S, 2005, IEEE INFOCOM SER, P1653
  • [8] CENTRAL LIMIT-THEOREMS UNDER WEAK DEPENDENCE
    BRADLEY, RC
    [J]. JOURNAL OF MULTIVARIATE ANALYSIS, 1981, 11 (01) : 1 - 16
  • [9] Basic Properties of Strong Mixing Conditions. A Survey and Some Open Questions
    Bradley, Richard C.
    [J]. PROBABILITY SURVEYS, 2005, 2 : 107 - 144
  • [10] Caponnetto A, 2007, FOUND COMPUT MATH, V7, P331, DOI [10.1007/s10208-006-0196-8, 10.1007/S10208-006-0196-8]