Robust Integrative Analysis via Quantile Regression with and

被引:1
作者
Zeng, Hao [1 ]
Wan, Chuang [2 ]
Zhong, Wei [3 ,4 ]
Liu, Tuo [3 ,4 ]
机构
[1] Xiamen Univ, Paula & Gregory Chow Inst Studies Econ, Xiamen 361005, Fujian, Peoples R China
[2] Jinan Univ, Sch Econ, Dept Stat, Guangzhou 510632, Peoples R China
[3] Xiamen Univ, WISE, MOE Key Lab Econometr, Xiamen 361005, Fujian, Peoples R China
[4] Xiamen Univ, Dept Stat & Data Sci SOE, Xiamen 361005, Fujian, Peoples R China
基金
中国博士后科学基金; 中国国家自然科学基金; 国家重点研发计划;
关键词
Homogeneity; Integrative analysis; Quantile regression; Robustness; Sparsity; NONCONCAVE PENALIZED LIKELIHOOD; TUNING PARAMETER SELECTION; CLIPPED ABSOLUTE DEVIATION; VARIABLE SELECTION; DIVERGING NUMBER; MODEL SELECTION; CRITERION; LASSO;
D O I
10.1016/j.jspi.2024.106196
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Integrative analysis plays a critical role in integrating heterogeneous data from multiple datasets to provide a comprehensive view of the overall data features. However, in multiple datasets, outliers and heavy -tailed data can render least squares estimation unreliable. In response, we propose a Robust Integrative Analysis via Quantile Regression (RIAQ) that accounts for homogeneity and sparsity in multiple datasets. The RIAQ approach is not only able to identify latent homogeneous coefficient structures but also recover the sparsity of high -dimensional covariates via double penalty terms. The integration of sample information across multiple datasets improves estimation efficiency, while a sparse model improves model interpretability. Furthermore, quantile regression allows the detection of subgroup structures under different quantile levels, providing a comprehensive picture of the relationship between response and high -dimensional covariates. We develop an efficient alternating direction method of multipliers (ADMM) algorithm to solve the optimization problem and study its convergence. We also derive the parameter selection consistency of the modified Bayesian information criterion. Numerical studies demonstrate that our proposed estimator has satisfactory finite -sample performance, especially in heavy -tailed cases.
引用
收藏
页数:19
相关论文
共 51 条
[1]   Individual and contextual influences on delinquency: the role of the single-parent family [J].
Anderson, AL .
JOURNAL OF CRIMINAL JUSTICE, 2002, 30 (06) :575-587
[2]   l1-PENALIZED QUANTILE REGRESSION IN HIGH-DIMENSIONAL SPARSE MODELS [J].
Belloni, Alexandre ;
Chernozhukov, Victor .
ANNALS OF STATISTICS, 2011, 39 (01) :82-130
[3]   Distributed optimization and statistical learning via the alternating direction method of multipliers [J].
Boyd S. ;
Parikh N. ;
Chu E. ;
Peleato B. ;
Eckstein J. .
Foundations and Trends in Machine Learning, 2010, 3 (01) :1-122
[4]   Extended Bayesian information criteria for model selection with large model spaces [J].
Chen, Jiahua ;
Chen, Zehua .
BIOMETRIKA, 2008, 95 (03) :759-771
[5]   The joint lasso: high-dimensional regression for group structured data [J].
Dondelinger, Frank ;
Mukherjee, Sach .
BIOSTATISTICS, 2020, 21 (02) :219-235
[6]  
DONNELLY PG, 1989, SOCIOL FOCUS, V22, P69
[7]   Sure independence screening for ultrahigh dimensional feature space [J].
Fan, Jianqing ;
Lv, Jinchi .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2008, 70 :849-883
[8]   ADAPTIVE ROBUST VARIABLE SELECTION [J].
Fan, Jianqing ;
Fan, Yingying ;
Barut, Emre .
ANNALS OF STATISTICS, 2014, 42 (01) :324-351
[9]   Nonconcave penalized likelihood with a diverging number of parameters [J].
Fan, JQ ;
Peng, H .
ANNALS OF STATISTICS, 2004, 32 (03) :928-961
[10]   Variable selection via nonconcave penalized likelihood and its oracle properties [J].
Fan, JQ ;
Li, RZ .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2001, 96 (456) :1348-1360