Hard-thresholding regularization method for high-dimensional heterogeneous models

被引:0
作者
Wu, Yue [1 ]
Zheng, Zemin [1 ]
机构
[1] Univ Sci & Technol China, Int Inst Finance, Sch Management, Hefei 230026, Peoples R China
基金
国家重点研发计划;
关键词
High dimensionality; hard-thresholding; subgroup analysis; prediction and variable selection; oracle property; NONCONCAVE PENALIZED LIKELIHOOD; VARIABLE SELECTION; SUBGROUP ANALYSIS; REGRESSION; SHRINKAGE; INFERENCE; LASSO;
D O I
10.1080/00949655.2025.2461648
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Heterogeneity is often natural in many contemporary applications involving massive data, as exemplified by domains such as personalized education and tailored pricing strategy. In this article, we propose a new methodology for subgroup analysis via a unified penalization approach, which combines the concave pairwise fusion penalty and an additional hard-thresholding penalty to concurrently estimate the heterogeneous effects and coefficients for high-dimensional covariates. Furthermore, we show that the oracle least squares estimator based on a prior knowledge of the true grouping structure and the index set of nonzero true coefficients enjoys appealing statistical properties including oracle inequalities under various prediction and estimation losses. Moreover, we provide theoretical guarantees for the proposed approach by establishing the joint estimation error bound for both heterogeneous effects and coefficients of high-dimensional covariates. The practical utility of our proposed method is illustrated through simulation and real data examples.
引用
收藏
页码:1538 / 1555
页数:18
相关论文
共 28 条
[1]   Online Decision Making with High-Dimensional Covariates [J].
Bastani, Hamsa ;
Bayati, Mohsen .
OPERATIONS RESEARCH, 2020, 68 (01) :276-294
[2]   PROGRAM EVALUATION AND CAUSAL INFERENCE WITH HIGH-DIMENSIONAL DATA [J].
Belloni, A. ;
Chernozhukov, V. ;
Fernandez-Val, I. ;
Hansen, C. .
ECONOMETRICA, 2017, 85 (01) :233-298
[3]   Square-root lasso: pivotal recovery of sparse signals via conic programming [J].
Belloni, A. ;
Chernozhukov, V. ;
Wang, L. .
BIOMETRIKA, 2011, 98 (04) :791-806
[4]   SIMULTANEOUS ANALYSIS OF LASSO AND DANTZIG SELECTOR [J].
Bickel, Peter J. ;
Ritov, Ya'acov ;
Tsybakov, Alexandre B. .
ANNALS OF STATISTICS, 2009, 37 (04) :1705-1732
[5]   Benchmark for filter methods for feature selection in high-dimensional classification data [J].
Bommert, Andrea ;
Sun, Xudong ;
Bischl, Bernd ;
Rahnenfuehrer, Joerg ;
Lang, Michel .
COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2020, 143
[6]  
Candes E, 2007, ANN STAT, V35, P2313, DOI 10.1214/009053606000001523
[7]   Robust analysis of cancer heterogeneity for high-dimensional data [J].
Cheng, Chao ;
Feng, Xingdong ;
Li, Xiaoguang ;
Wu, Mengyun .
STATISTICS IN MEDICINE, 2022, 41 (27) :5448-5462
[8]   Double/debiased machine learning for treatment and structural parameters [J].
Chernozhukov, Victor ;
Chetverikov, Denis ;
Demirer, Mert ;
Duflo, Esther ;
Hansen, Christian ;
Newey, Whitney ;
Robins, James .
ECONOMETRICS JOURNAL, 2018, 21 (01) :C1-C68
[9]   Nonconcave Penalized Likelihood With NP-Dimensionality [J].
Fan, Jianqing ;
Lv, Jinchi .
IEEE TRANSACTIONS ON INFORMATION THEORY, 2011, 57 (08) :5467-5484
[10]   Variable selection via nonconcave penalized likelihood and its oracle properties [J].
Fan, JQ ;
Li, RZ .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2001, 96 (456) :1348-1360