consistency;
distributed data;
divide and conquer algorithm;
Mallows' criterion;
model averaging;
optimality;
FOCUSED INFORMATION CRITERION;
BIG DATA;
REGRESSION;
SELECTION;
INFERENCE;
D O I:
暂无
中图分类号:
TP [自动化技术、计算机技术];
学科分类号:
0812 ;
摘要:
Divide and conquer algorithm is a common strategy applied in big data. Model averaging has the natural divide-and-conquer feature, but its theory has not been developed in big data scenarios. The goal of this paper is to fill this gap. We propose two divide-and conquer-type model averaging estimators for linear models with distributed data. Under some regularity conditions, we show that the weights from Mallows model averaging criterion converge in L-2 to the theoretically optimal weights minimizing the risk of the model averaging estimator. We also give the bounds of the in-sample and out-of-sample mean squared errors and prove the asymptotic optimality for the proposed model averaging estimators. Our conclusions hold even when the dimensions and the number of candidate models are divergent. Simulation results and a real airline data analysis illustrate that the proposed model averaging methods perform better than the commonly used model selection and model averaging methods in distributed data cases. Our approaches contribute to model averaging theory in distributed data and parallel computations, and can be applied in big data analysis to save time and reduce the computational burden.
机构:
City Univ Hong Kong, Dept Math, Tat Chee Ave, Kowloon, Hong Kong, Peoples R ChinaCity Univ Hong Kong, Dept Math, Tat Chee Ave, Kowloon, Hong Kong, Peoples R China
Lin, Shao-Bo
Guo, Xin
论文数: 0引用数: 0
h-index: 0
机构:
Hong Kong Polytech Univ, Dept Appl Math, Kowloon, Hong Kong, Peoples R ChinaCity Univ Hong Kong, Dept Math, Tat Chee Ave, Kowloon, Hong Kong, Peoples R China
Guo, Xin
Zhou, Ding-Xuan
论文数: 0引用数: 0
h-index: 0
机构:
City Univ Hong Kong, Dept Math, Tat Chee Ave, Kowloon, Hong Kong, Peoples R ChinaCity Univ Hong Kong, Dept Math, Tat Chee Ave, Kowloon, Hong Kong, Peoples R China
机构:
Renmin Univ China, Sch Stat, Beijing 100872, Peoples R China
Capital Normal Univ, Sch Math Sci, Beijing 100048, Peoples R ChinaRenmin Univ China, Sch Stat, Beijing 100872, Peoples R China
Liao, Jun
Zou, Guohua
论文数: 0引用数: 0
h-index: 0
机构:
Capital Normal Univ, Sch Math Sci, Beijing 100048, Peoples R ChinaRenmin Univ China, Sch Stat, Beijing 100872, Peoples R China
机构:
Univ Chinese Acad Sci, Beijing 100049, Peoples R China
Chinese Acad Sci, Acad Math & Syst Sci, Beijing 100190, Peoples R ChinaUniv Chinese Acad Sci, Beijing 100049, Peoples R China
Zhang, Haili
Zou, Guohua
论文数: 0引用数: 0
h-index: 0
机构:
Capital Normal Univ, Sch Math Sci, Beijing 100048, Peoples R ChinaUniv Chinese Acad Sci, Beijing 100049, Peoples R China
机构:
Microsoft Res, Bangalore 560001, Karnataka, IndiaMicrosoft Res, Bangalore 560001, Karnataka, India
Jain, Prateek
Netrapalli, Praneeth
论文数: 0引用数: 0
h-index: 0
机构:
Microsoft Res, Bangalore 560001, Karnataka, IndiaMicrosoft Res, Bangalore 560001, Karnataka, India
Netrapalli, Praneeth
Kakade, Sham M.
论文数: 0引用数: 0
h-index: 0
机构:
Univ Washington, Paul G Allen Sch Comp Sci, Seattle, WA 98195 USA
Univ Washington, Dept Stat, Seattle, WA 98195 USAMicrosoft Res, Bangalore 560001, Karnataka, India
Kakade, Sham M.
Kidambi, Rahul
论文数: 0引用数: 0
h-index: 0
机构:
Univ Washington, Dept Elect Engn, Seattle, WA 98195 USAMicrosoft Res, Bangalore 560001, Karnataka, India
Kidambi, Rahul
Sidford, Aaron
论文数: 0引用数: 0
h-index: 0
机构:
Stanford Univ, Dept Management Sci & Engn, Palo Alto, CA 94305 USAMicrosoft Res, Bangalore 560001, Karnataka, India
机构:
Qingdao Univ, Qingdao, Shandong, Peoples R China
Chinese Acad Sci, Acad Math & Syst Sci, N55 Zhong Guan Cun East Rd, Beijing 100190, Peoples R ChinaQingdao Univ, Qingdao, Shandong, Peoples R China
Zhang, Xinyu
Wang, Wendun
论文数: 0引用数: 0
h-index: 0
机构:
Erasmus Univ, Econometr Inst, POB 1738, NL-3000 DR Rotterdam, Netherlands
Tinbergen Inst, Amsterdam, NetherlandsQingdao Univ, Qingdao, Shandong, Peoples R China
机构:
Chinese Acad Sci, Acad Math & Syst Sci, Beijing 100190, Peoples R ChinaChinese Acad Sci, Acad Math & Syst Sci, Beijing 100190, Peoples R China
Zhang, Xinyu
Wan, Alan T. K.
论文数: 0引用数: 0
h-index: 0
机构:
City Univ Hong Kong, Dept Management Sci, Kowloon, Hong Kong, Peoples R ChinaChinese Acad Sci, Acad Math & Syst Sci, Beijing 100190, Peoples R China
Wan, Alan T. K.
Zou, Guohua
论文数: 0引用数: 0
h-index: 0
机构:
Chinese Acad Sci, Acad Math & Syst Sci, Beijing 100190, Peoples R ChinaChinese Acad Sci, Acad Math & Syst Sci, Beijing 100190, Peoples R China