Two-Stage Online Debiased Lasso Estimation and Inference for High-Dimensional Quantile Regression with Streaming Data

被引:4
作者
Peng, Yanjin [1 ]
Wang, Lei [1 ]
机构
[1] KLMDASR LEBPS & LPMC, Sch Stat & Data Sci, Tianjin 300071, Peoples R China
基金
中国国家自然科学基金;
关键词
Adaptive tuning; asymptotic normality; debiased lasso; online updating; quantile regression; CONFIDENCE-INTERVALS; VARIABLE SELECTION; LIKELIHOOD; PARAMETERS; REGIONS; TESTS;
D O I
10.1007/s11424-023-3014-y
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
In this paper, the authors propose a two-stage online debiased lasso estimation and statistical inference method for high-dimensional quantile regression (QR) models in the presence of streaming data. In the first stage, the authors modify the QR score function based on kernel smoothing and obtain the online lasso smoothed QR estimator through iterative algorithms. The estimation process only involves the current data batch and specific historical summary statistics, which perfectly accommodates to the special structure of streaming data. In the second stage, an online debiasing procedure is carried out to eliminate biases caused by the lasso penalty as well as the accumulative approximation error so that the asymptotic normality of the resulting estimator can be established. The authors conduct extensive numerical experiments to evaluate the performance of the proposed method. These experiments demonstrate the effectiveness of the proposed method and support the theoretical results. An application to the Beijing PM2.5 Dataset is also presented.
引用
收藏
页码:1251 / 1270
页数:20
相关论文
共 50 条
[21]   Oracle Estimation of a Change Point in High-Dimensional Quantile Regression [J].
Lee, Sokbae ;
Liao, Yuan ;
Seo, Myung Hwan ;
Shin, Youngki .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2018, 113 (523) :1184-1194
[22]   Communication-efficient estimation and inference for high-dimensional longitudinal data [J].
Li, Xing ;
Peng, Yanjing ;
Wang, Lei .
COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2025, 208
[23]   HIGH-DIMENSIONAL GAUSSIAN COPULA REGRESSION: ADAPTIVE ESTIMATION AND STATISTICAL INFERENCE [J].
Cai, T. Tony ;
Zhang, Linjun .
STATISTICA SINICA, 2018, 28 (02) :963-993
[24]   Improved two-stage model averaging for high-dimensional linear regression, with application to Riboflavin data analysis [J].
Juming Pan .
BMC Bioinformatics, 22
[25]   Improved two-stage model averaging for high-dimensional linear regression, with application to Riboflavin data analysis [J].
Pan, Juming .
BMC BIOINFORMATICS, 2021, 22 (01)
[26]   A two-stage sparse logistic regression for optimal gene selection in high-dimensional microarray data classification [J].
Algamal, Zakariya Yahya ;
Lee, Muhammad Hisyam .
ADVANCES IN DATA ANALYSIS AND CLASSIFICATION, 2019, 13 (03) :753-771
[27]   Variable Selection via SCAD-Penalized Quantile Regression for High-Dimensional Count Data [J].
Khan, Dost Muhammad ;
Yaqoob, Anum ;
Iqbal, Nadeem ;
Wahid, Abdul ;
Khalil, Umair ;
Khan, Mukhtaj ;
Abd Rahman, Mohd Amiruddin ;
Mustafa, Mohd Shafie ;
Khan, Zardad .
IEEE ACCESS, 2019, 7 :153205-153216
[28]   A global two-stage algorithm for non-convex penalized high-dimensional linear regression problems [J].
Li, Peili ;
Liu, Min ;
Yu, Zhou .
COMPUTATIONAL STATISTICS, 2023, 38 (02) :871-898
[29]   Nonnegative adaptive lasso for ultra-high dimensional regression models and a two-stage method applied in financial modeling [J].
Yang, Yuehan ;
Wu, Lan .
JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2016, 174 :52-67
[30]   High-dimensional Mixed Graphical Model with Ordinal Data: Parameter Estimation and Statistical Inference [J].
Feng, Huijie ;
Ning, Yang .
22ND INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 89, 2019, 89 :654-663