Online debiased lasso estimation and inference for heterogenous updating regressions

被引:0
作者
Mi, Yajie [1 ,2 ]
Wang, Lei [1 ,2 ]
机构
[1] Nankai Univ, Sch Stat & Data Sci, KLMDASR, LEBPS, Tianjin 300071, Peoples R China
[2] Nankai Univ, LPMC, Tianjin 300071, Peoples R China
基金
中国国家自然科学基金;
关键词
Homogenization; Debiased lasso; Heterogenous regression; Online updating; CONFIDENCE-INTERVALS; TESTS;
D O I
10.1007/s42952-024-00278-z
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
In the era of big data, online updating problems have attracted extensive attention. In practice, the covariates set of the models may change according to the conditions of data streams. In this paper, we propose a two-stage online debiased lasso estimation and inference method for high-dimensional heterogenous linear regression models with new variables added midway. At the first stage, the homogenization strategy is conducted to represent the heterogenous models by defining the pseudo covariates and responses. At the second stage, we conduct the online debiased lasso estimation procedure to obtain the final estimator. Theoretically, the asymptotic normality of the heterogenous online debiased lasso estimator (HODL) is established. The finite-sample performance of the proposed estimators is studied through simulation studies and a real data example.
引用
收藏
页码:1049 / 1090
页数:42
相关论文
共 31 条
  • [1] Real-time sufficient dimension reduction through principal least squares support vector machines
    Artemiou, Andreas
    Dong, Yuexiao
    Shin, Seung Jun
    [J]. PATTERN RECOGNITION, 2021, 112
  • [2] Online Principal Component Analysis in High Dimension: Which Algorithm to Choose?
    Cardot, Herve
    Degras, David
    [J]. INTERNATIONAL STATISTICAL REVIEW, 2018, 86 (01) : 29 - 50
  • [3] Influencing initial public offering investors with prestige: Signaling with board structures
    Certo, ST
    [J]. ACADEMY OF MANAGEMENT REVIEW, 2003, 28 (03) : 432 - 446
  • [4] QUANTILE REGRESSION UNDER MEMORY CONSTRAINT
    Chen, Xi
    Liu, Weidong
    Zhang, Yichen
    [J]. ANNALS OF STATISTICS, 2019, 47 (06) : 3244 - 3273
  • [5] An iterative thresholding algorithm for linear inverse problems with a sparsity constraint
    Daubechies, I
    Defrise, M
    De Mol, C
    [J]. COMMUNICATIONS ON PURE AND APPLIED MATHEMATICS, 2004, 57 (11) : 1413 - 1457
  • [6] Online Debiasing for Adaptively Collected High-Dimensional Data With Applications to Time Series Analysis
    Deshpande, Yash
    Javanmard, Adel
    Mehrabi, Mohammad
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2023, 118 (542) : 1126 - 1139
  • [7] Profiting from business model innovation: Evidence from Pay-As-You-Drive auto insurance
    Desyllas, Panos
    Sako, Mari
    [J]. RESEARCH POLICY, 2013, 42 (01) : 101 - 116
  • [8] High-Dimensional Inference: Confidence Intervals, p-Values and R-Software hdi
    Dezeure, Ruben
    Buehlmann, Peter
    Meier, Lukas
    Meinshausen, Nicolai
    [J]. STATISTICAL SCIENCE, 2015, 30 (04) : 533 - 558
  • [9] Duchi J, 2011, J MACH LEARN RES, V12, P2121
  • [10] PATHWISE COORDINATE OPTIMIZATION
    Friedman, Jerome
    Hastie, Trevor
    Hoefling, Holger
    Tibshirani, Robert
    [J]. ANNALS OF APPLIED STATISTICS, 2007, 1 (02) : 302 - 332