Optimal Distributed Model Averaging for Multivariate Additive Model

被引:0
作者
Song, Minghui [1 ]
Qu, Tianyao [1 ]
Zhao, Zhihao [2 ]
Zou, Guohua [1 ]
机构
[1] Capital Normal Univ, Sch Math Sci, Beijing 100048, Peoples R China
[2] Capital Univ Econ & Business, Sch Stat, Beijing 100070, Peoples R China
基金
中国国家自然科学基金;
关键词
Additive model; asymptotic optimality; consistency; distributed algorithm; weight choice; REGRESSION;
D O I
10.1007/s11424-025-5054-y
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
In the era of massive data, the study of distributed data is a significant topic. Model averaging can be effectively applied to distributed data by combining information from all machines. For linear models, the model averaging approach has been developed in the context of distributed data. However, further investigation is needed for more complex models. In this paper, the authors propose a distributed optimal model averaging approach based on multivariate additive models, which approximates unknown functions using B-splines allowing each machine to have a different smoothing degree. To utilize the information from the covariance matrix of dependent errors in multivariate multiple regressions, the authors use the Mahalanobis distance to construct a Mallows-type weight choice criterion. The criterion can be computed by transmitting information between the local machines and the center machine in two steps. The authors demonstrate the asymptotic optimality of the proposed model averaging estimator when the covariates are subject to uncertainty, and obtain the convergence rate of the weight vector to the theoretically optimal weights. The results remain novel even for additive models with a single response variable. The numerical examples show that the proposed method yields good performance.
引用
收藏
页数:25
相关论文
共 31 条
[21]  
WHITTLE P., 1960, Theory of Probability and its Applications, V5, P302, DOI DOI 10.1137/1105028
[22]   Generalized additive models for large data sets [J].
Wood, Simon N. ;
Goude, Yannig ;
Shaw, Simon .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES C-APPLIED STATISTICS, 2015, 64 (01) :139-155
[23]   Distributed Generalized Cross-Validation for Divide-and-Conquer Kernel Ridge Regression and Its Asymptotic Optimality [J].
Xu, Ganggang ;
Shang, Zuofeng ;
Cheng, Guang .
JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2019, 28 (04) :891-908
[24]  
Xue L, 2009, STAT SINICA, V19, P1281
[25]   Distributed Mallows Model Averaging for Ridge Regressions [J].
Zhang, Haili ;
Wan, Alan T. K. ;
You, Kang ;
Zou, Guohua .
ACTA MATHEMATICA SINICA-ENGLISH SERIES, 2025, 41 (02) :780-826
[26]   Comparative efficacy of different treatments for menstrual migraine: a systematic review and network meta-analysis [J].
Zhang, Han ;
Qi, Jian-Zhi ;
Zhang, Zhi-Hua .
JOURNAL OF HEADACHE AND PAIN, 2023, 24 (01)
[27]   OPTIMAL MODEL AVERAGING ESTIMATION FOR PARTIALLY LINEAR MODELS [J].
Zhang, Xinyu ;
Wang, Wendun .
STATISTICA SINICA, 2019, 29 (02) :693-718
[28]   Model averaging by jackknife criterion in models with dependent data [J].
Zhang, Xinyu ;
Wan, Alan T. K. ;
Zou, Guohua .
JOURNAL OF ECONOMETRICS, 2013, 174 (02) :82-94
[29]  
Zhang YC, 2015, J MACH LEARN RES, V16, P3299
[30]   Kernel Averaging Estimators [J].
Zhu, Rong ;
Zhang, Xinyu ;
Wan, Alan T. K. ;
Zou, Guohua .
JOURNAL OF BUSINESS & ECONOMIC STATISTICS, 2022, 41 (01) :157-169