Robust Integrative Analysis via Quantile Regression with and

被引:1
作者
Zeng, Hao [1 ]
Wan, Chuang [2 ]
Zhong, Wei [3 ,4 ]
Liu, Tuo [3 ,4 ]
机构
[1] Xiamen Univ, Paula & Gregory Chow Inst Studies Econ, Xiamen 361005, Fujian, Peoples R China
[2] Jinan Univ, Sch Econ, Dept Stat, Guangzhou 510632, Peoples R China
[3] Xiamen Univ, WISE, MOE Key Lab Econometr, Xiamen 361005, Fujian, Peoples R China
[4] Xiamen Univ, Dept Stat & Data Sci SOE, Xiamen 361005, Fujian, Peoples R China
基金
中国博士后科学基金; 中国国家自然科学基金; 国家重点研发计划;
关键词
Homogeneity; Integrative analysis; Quantile regression; Robustness; Sparsity; NONCONCAVE PENALIZED LIKELIHOOD; TUNING PARAMETER SELECTION; CLIPPED ABSOLUTE DEVIATION; VARIABLE SELECTION; DIVERGING NUMBER; MODEL SELECTION; CRITERION; LASSO;
D O I
10.1016/j.jspi.2024.106196
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Integrative analysis plays a critical role in integrating heterogeneous data from multiple datasets to provide a comprehensive view of the overall data features. However, in multiple datasets, outliers and heavy -tailed data can render least squares estimation unreliable. In response, we propose a Robust Integrative Analysis via Quantile Regression (RIAQ) that accounts for homogeneity and sparsity in multiple datasets. The RIAQ approach is not only able to identify latent homogeneous coefficient structures but also recover the sparsity of high -dimensional covariates via double penalty terms. The integration of sample information across multiple datasets improves estimation efficiency, while a sparse model improves model interpretability. Furthermore, quantile regression allows the detection of subgroup structures under different quantile levels, providing a comprehensive picture of the relationship between response and high -dimensional covariates. We develop an efficient alternating direction method of multipliers (ADMM) algorithm to solve the optimization problem and study its convergence. We also derive the parameter selection consistency of the modified Bayesian information criterion. Numerical studies demonstrate that our proposed estimator has satisfactory finite -sample performance, especially in heavy -tailed cases.
引用
收藏
页数:19
相关论文
共 51 条
[31]   Augmented Lagrangian alternating direction method for matrix separation based on low-rank factorization [J].
Shen, Y. ;
Wen, Z. ;
Zhang, Y. .
OPTIMIZATION METHODS & SOFTWARE, 2014, 29 (02) :239-263
[32]  
Singh A., 2014, International Journal of Science and Research, V3, P866
[33]   An integrative sparse boosting analysis of cancer genomic commonality and difference [J].
Sun, Yifan ;
Sun, Zhengyang ;
Jiang, Yu ;
Li, Yang ;
Ma, Shuangge .
STATISTICAL METHODS IN MEDICAL RESEARCH, 2020, 29 (05) :1325-1337
[34]   Poststratification fusion learning in longitudinal data analysis [J].
Tang, Lu ;
Song, Peter X. -K. .
BIOMETRICS, 2021, 77 (03) :914-928
[35]  
Tang L, 2016, J MACH LEARN RES, V17
[36]   Individualized Multidirectional Variable Selection [J].
Tang, Xiwei ;
Xue, Fei ;
Qu, Annie .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2021, 116 (535) :1280-1296
[37]   THE IMPACT OF CRIME ON COMMUNITIES [J].
TAYLOR, RB .
ANNALS OF THE AMERICAN ACADEMY OF POLITICAL AND SOCIAL SCIENCE, 1995, 539 :28-45
[38]   Unified LASSO estimation by least squares approximation [J].
Wang, Hansheng ;
Leng, Chenlei .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2007, 102 (479) :1039-1048
[39]   Tuning parameter selectors for the smoothly clipped absolute deviation method [J].
Wang, Hansheng ;
Li, Runze ;
Tsai, Chih-Ling .
BIOMETRIKA, 2007, 94 (03) :553-568
[40]   Shrinkage tuning parameter selection with a diverging number of parameters [J].
Wang, Hansheng ;
Li, Bo ;
Leng, Chenlei .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2009, 71 :671-683