Variable selection in measurement error models

被引:57
作者
Ma, Yanyuan [1 ]
Li, Runze [2 ,3 ]
机构
[1] Texas A&M Univ, Dept Stat, College Stn, TX 77843 USA
[2] Penn State Univ, Dept Stat, University Pk, PA 16802 USA
[3] Penn State Univ, Methodol Ctr, University Pk, PA 16802 USA
关键词
errors in variables; estimating equations; measurement error models; non-concave penalty function; SCAD; semi-parametric methods; NONCONCAVE PENALIZED LIKELIHOOD; FUNCTIONAL-MEASUREMENT ERROR; SEMIPARAMETRIC ESTIMATORS; DIVERGING NUMBER; OPTIMAL RATES; CONVERGENCE; INFERENCE;
D O I
10.3150/09-BEJ205
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Measurement error data or errors-in-variable data have been collected in many studies. Natural criterion functions are often unavailable for general functional measurement error models due to the lack of information on the distribution of the unobservable covariates. Typically, the parameter estimation is via solving estimating equations. In addition, the construction of such estimating equations routinely requires solving integral equations, hence the computation is often much more intensive compared with ordinary regression models. Because of these difficulties, traditional best subset variable selection procedures are not applicable, and in the measurement error model context, variable selection remains an unsolved issue. In this paper, we develop a framework for variable selection in measurement error models via penalized estimating eqUations. We first propose a class of selection procedures for general parametric measurement error models and for general semi-pararnetric measurement error models. and study the asymptotic properties of the proposed procedures. Then, under certain regularity conditions and with a properly chosen regularization parameter, we demonstrate that the proposed procedure performs as well as an oracle procedure. We assess the finite sample performance via Monte Carlo simulation studies and illustrate the proposed methodology through the empirical analysis of a familiar data set.
引用
收藏
页码:274 / 300
页数:27
相关论文
共 50 条
  • [31] VARIABLE SELECTION FOR HIGH-DIMENSIONAL GENERALIZED VARYING-COEFFICIENT MODELS
    Lian, Heng
    STATISTICA SINICA, 2012, 22 (04) : 1563 - 1588
  • [32] Gaussian Latent Variable Models for Variable Selection
    Jiang, Xiubao
    You, Xinge
    Mou, Yi
    Yu, Shujian
    Zeng, Wu
    2014 INTERNATIONAL CONFERENCE ON SECURITY, PATTERN ANALYSIS, AND CYBERNETICS (SPAC), 2014, : 353 - 357
  • [33] Instrumental variable estimation of nonlinear models with nonclassical measurement error using control variables
    Hahn, Jinyong
    Ridder, Geert
    JOURNAL OF ECONOMETRICS, 2017, 200 (02) : 238 - 250
  • [34] Estimation of the average treatment effect with variable selection and measurement error simultaneously addressed for potential confounders
    Yi, Grace Y.
    Chen, Li-Pang
    STATISTICAL METHODS IN MEDICAL RESEARCH, 2023, 32 (04) : 691 - 711
  • [35] Variable selection and estimation in high-dimensional models
    Horowitz, Joel L.
    CANADIAN JOURNAL OF ECONOMICS-REVUE CANADIENNE D ECONOMIQUE, 2015, 48 (02): : 389 - 407
  • [36] Variable Selection for Fixed Effects Varying Coefficient Models
    Li, Gao Rong
    Lian, Heng
    Lai, Peng
    Peng, Heng
    ACTA MATHEMATICA SINICA-ENGLISH SERIES, 2015, 31 (01) : 91 - 110
  • [37] Robust variable selection for finite mixture regression models
    Tang, Qingguo
    Karunamuni, R. J.
    ANNALS OF THE INSTITUTE OF STATISTICAL MATHEMATICS, 2018, 70 (03) : 489 - 521
  • [38] High Dimensional Variable Selection with Error Control
    Kim, Sangjin
    Halabi, Susan
    BIOMED RESEARCH INTERNATIONAL, 2016, 2016
  • [39] Semiparametric estimation for measurement error models with validation data
    Xu, Yuhang
    Kim, Jae Kwang
    Li, Yehua
    CANADIAN JOURNAL OF STATISTICS-REVUE CANADIENNE DE STATISTIQUE, 2017, 45 (02): : 185 - 201
  • [40] Component selection and variable selection for mixture regression models
    Qi, Xuefei
    Xu, Xingbai
    Feng, Zhenghui
    Peng, Heng
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2025, 206