consistency;
distributed data;
divide and conquer algorithm;
Mallows' criterion;
model averaging;
optimality;
FOCUSED INFORMATION CRITERION;
BIG DATA;
REGRESSION;
SELECTION;
INFERENCE;
D O I:
暂无
中图分类号:
TP [自动化技术、计算机技术];
学科分类号:
0812 ;
摘要:
Divide and conquer algorithm is a common strategy applied in big data. Model averaging has the natural divide-and-conquer feature, but its theory has not been developed in big data scenarios. The goal of this paper is to fill this gap. We propose two divide-and conquer-type model averaging estimators for linear models with distributed data. Under some regularity conditions, we show that the weights from Mallows model averaging criterion converge in L-2 to the theoretically optimal weights minimizing the risk of the model averaging estimator. We also give the bounds of the in-sample and out-of-sample mean squared errors and prove the asymptotic optimality for the proposed model averaging estimators. Our conclusions hold even when the dimensions and the number of candidate models are divergent. Simulation results and a real airline data analysis illustrate that the proposed model averaging methods perform better than the commonly used model selection and model averaging methods in distributed data cases. Our approaches contribute to model averaging theory in distributed data and parallel computations, and can be applied in big data analysis to save time and reduce the computational burden.
机构:
City Univ Hong Kong, Dept Management Sci, Kowloon, Hong Kong, Peoples R ChinaCity Univ Hong Kong, Dept Management Sci, Kowloon, Hong Kong, Peoples R China
Wan, Alan T. K.
Zhang, Xinyu
论文数: 0引用数: 0
h-index: 0
机构:
Chinese Acad Sci, Acad Math & Syst Sci, Beijing 100190, Peoples R ChinaCity Univ Hong Kong, Dept Management Sci, Kowloon, Hong Kong, Peoples R China
Zhang, Xinyu
Zou, Guohua
论文数: 0引用数: 0
h-index: 0
机构:
Chinese Acad Sci, Acad Math & Syst Sci, Beijing 100190, Peoples R ChinaCity Univ Hong Kong, Dept Management Sci, Kowloon, Hong Kong, Peoples R China
机构:
Qingdao Univ, Sch Math & Stat, Qingdao 266071, Peoples R ChinaQingdao Univ, Sch Math & Stat, Qingdao 266071, Peoples R China
Li, Xin-min
Zou, Guo-hua
论文数: 0引用数: 0
h-index: 0
机构:
Capital Normal Univ, Sch Math Sci, Beijing 100048, Peoples R ChinaQingdao Univ, Sch Math & Stat, Qingdao 266071, Peoples R China
Zou, Guo-hua
Zhang, Xin-yu
论文数: 0引用数: 0
h-index: 0
机构:
Chinese Acad Sci, Acad Math & Syst Sci, Beijing 100190, Peoples R China
Beijing Acad Artificial Intelligence, Beijing 100084, Peoples R ChinaQingdao Univ, Sch Math & Stat, Qingdao 266071, Peoples R China
Zhang, Xin-yu
Zhao, Shang-wei
论文数: 0引用数: 0
h-index: 0
机构:
Minzu Univ China, Coll Sci, Beijing 100081, Peoples R ChinaQingdao Univ, Sch Math & Stat, Qingdao 266071, Peoples R China