Robust estimation for nonrandomly distributed data

被引:0
作者
Li, Shaomin [1 ]
Wang, Kangning [2 ]
Xu, Yong [3 ]
机构
[1] Beijing Normal Univ, Ctr Stat & Data Sci, 18 Jinfeng Rd, Zhuhai 519087, Peoples R China
[2] Shandong Technol & Business Univ, Sch Stat, 191 Binhai Middle Rd, Yantai 264005, Peoples R China
[3] Shandong Technol & Business Univ, Sch Business Adm, 191 Binhai Middle Rd, Yantai 264005, Peoples R China
关键词
Distributed data; Communication-efficient; Modal regression; Robustness; VARIABLE SELECTION; REGRESSION;
D O I
10.1007/s10463-022-00852-4
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
In recent years, many methodologies for distributed data have been developed. However, there are two problems. First, most of these methods require the data to be randomly and uniformly distributed across different machines. Second, the methods are mainly not robust. To solve these problems, we propose a distributed pilot modal regression estimator, which achieves robustness and can adapt when the data are stored nonrandomly. First, we collect a random pilot sample from different machines; then, we approximate the global MR objective function by a communication-efficient surrogate that can be efficiently evaluated by the pilot sample and the local gradients. The final estimator is obtained by minimizing the surrogate function in the master machine, while the other machines only need to calculate their gradients. Theoretical results show the new estimator is asymptotically efficient as the global MR estimator. Simulation studies illustrate the utility of the proposed approach.
引用
收藏
页码:493 / 509
页数:17
相关论文
共 27 条
[1]  
[Anonymous], 2013, P 30 INT C MACHINE
[2]   DISTRIBUTED TESTING AND ESTIMATION UNDER SPARSE HIGH DIMENSIONAL MODELS [J].
Battey, Heather ;
Fan, Jianqing ;
Liu, Han ;
Lu, Junwei ;
Zhu, Ziwei .
ANNALS OF STATISTICS, 2018, 46 (03) :1352-1382
[3]   QUANTILE REGRESSION UNDER MEMORY CONSTRAINT [J].
Chen, Xi ;
Liu, Weidong ;
Zhang, Yichen .
ANNALS OF STATISTICS, 2019, 47 (06) :3244-3273
[4]   NONPARAMETRIC MODAL REGRESSION [J].
Chen, Yen-Chi ;
Genovese, Christopher R. ;
Tibshirani, Ryan J. ;
Wasserman, Larry .
ANNALS OF STATISTICS, 2016, 44 (02) :489-514
[5]  
Duchi J., 2014, ARXIV
[6]   Communication-Efficient Accurate Statistical Estimation [J].
Fan, Jianqing ;
Guo, Yongyi ;
Wang, Kaizheng .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2023, 118 (542) :1000-1010
[7]   DISTRIBUTED ESTIMATION OF PRINCIPAL EIGENSPACES [J].
Fan, Jianqing ;
Wang, Dong ;
Wang, Kaizheng ;
Zhu, Ziwei .
ANNALS OF STATISTICS, 2019, 47 (06) :3009-3031
[8]  
Feng YL, 2020, J MACH LEARN RES, V21
[9]  
Huber P.J., 1981, Wiley Series in Probability and Mathematical Statistics
[10]   Communication-Efficient Distributed Statistical Inference [J].
Jordan, Michael I. ;
Lee, Jason D. ;
Yang, Yun .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2019, 114 (526) :668-681