Distributed smoothed rank regression with heterogeneous errors for massive data

被引:0
作者
Yuan, Xiaohui [1 ]
Zhang, Xinran [1 ]
Wang, Yue [1 ]
Wang, Chunjie [1 ]
机构
[1] Changchun Univ Technol, Sch Math & Stat, Changchun 130012, Jilin, Peoples R China
关键词
Heterogeneous error; Massive data; Variable selection; Weighted rank estimator; ALGORITHMS; INFERENCE; LASSO;
D O I
10.1007/s42952-023-00237-0
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Rank estimation methods are robust and highly efficient for estimating linear regression model. This paper investigates the rank regression estimation for massive data. To deal with the situation that the data are distributed heterogeneously in different blocks, we propose a weighted distributed rank-based estimator for massive data, which can improve the efficiency of the standard divide and conquer estimator. Under mild conditions, the asymptotic distributions of the weighted distributed rank-based estimator is derived. To achieve sparsity with high-dimensional covariates, the variable selection procedure is also proposed. Both simulations and data analysis are included to illustrate the finite sample performance of the proposed methods.
引用
收藏
页码:1078 / 1103
页数:26
相关论文
共 36 条
  • [1] [Anonymous], 2012, Hadoop: The definitive guide
  • [2] Balakrishnan S, 2008, J MACH LEARN RES, V9, P313
  • [3] Semi-parametric rank regression with missing responses
    Bindele, Huybrechts F.
    Abebe, Ash
    [J]. JOURNAL OF MULTIVARIATE ANALYSIS, 2015, 142 : 117 - 132
  • [4] Standard errors and covariance matrices for smoothed rank estimators
    Brown, BM
    Wang, YG
    [J]. BIOMETRIKA, 2005, 92 (01) : 149 - 158
  • [5] Quantile regression in big data: A divide and conquer based strategy
    Chen, Lanjue
    Zhou, Yong
    [J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2020, 144
  • [6] A SPLIT-AND-CONQUER APPROACH FOR ANALYSIS OF EXTRAORDINARILY LARGE DATA
    Chen, Xueying
    Xie, Min-ge
    [J]. STATISTICA SINICA, 2014, 24 (04) : 1655 - 1684
  • [7] Sure independence screening for ultrahigh dimensional feature space
    Fan, Jianqing
    Lv, Jinchi
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2008, 70 : 849 - 883
  • [8] SURE INDEPENDENCE SCREENING IN GENERALIZED LINEAR MODELS WITH NP-DIMENSIONALITY
    Fan, Jianqing
    Song, Rui
    [J]. ANNALS OF STATISTICS, 2010, 38 (06) : 3567 - 3604
  • [9] Fan JQ, 2009, J MACH LEARN RES, V10, P2013
  • [10] Distributed adaptive lasso penalized generalized linear models for big data
    Fan, Ye
    Fan, Suning
    [J]. COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2023, 52 (04) : 1679 - 1698