A Bayesian mixture of experts approach to covariate misclassification

被引:4
作者
Xia, Michelle [1 ]
Hahn, P. Richard [2 ]
Gustafson, Paul [3 ]
机构
[1] Northern Illinois Univ, Dept Stat & Actuarial Sci, De Kalb, IL 60115 USA
[2] Arizona State Univ, Sch Math & Stat Sci, Tempe, AZ 85281 USA
[3] Univ British Columbia, Dept Stat, Vancouver, BC V6T 1Z4, Canada
来源
CANADIAN JOURNAL OF STATISTICS-REVUE CANADIENNE DE STATISTIQUE | 2020年 / 48卷 / 04期
基金
加拿大自然科学与工程研究理事会;
关键词
Bayesian inference; covariate misclassification; identifiability; Markov chain Monte Carlo; mixture of experts; SAMPLE-SIZE DETERMINATION; LIKELIHOOD METHODS; MEASUREMENT ERRORS; FINITE MIXTURES; MISSING DATA; MODELS; IDENTIFIABILITY; CONSISTENCY; BIAS; MLE;
D O I
10.1002/cjs.11560
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
This-115 article considers misclassification of categorical covariates in the context of regression analysis; if unaccounted for, such errors usually result in mis-estimation of model parameters. With the presence of additional covariates, we exploit the fact that explicitly modelling non-differential misclassification with respect to the response leads to a mixture regression representation. Under the framework of mixture of experts, we enable the reclassification probabilities to vary with other covariates, a situation commonly caused by misclassification that is differential on certain covariates and/or by dependence between the misclassified and additional covariates. Using Bayesian inference, the mixture approach combines learning from data with external information on the magnitude of errors when it is available. In addition to proving the theoretical identifiability of the mixture of experts approach, we study the amount of efficiency loss resulting from covariate misclassification and the usefulness of external information in mitigating such loss. The method is applied to adjust for misclassification on self-reported cocaine use in the Longitudinal Studies of HIV-Associated Lung Infections and Complications.
引用
收藏
页码:731 / 750
页数:20
相关论文
共 41 条
  • [1] Clinical Course and Outcome of Cocaine-Induced Pneumomediastinum
    Alnas, Majd
    Altayeh, Abdullah
    Zaman, Mumtaz
    [J]. AMERICAN JOURNAL OF THE MEDICAL SCIENCES, 2010, 339 (01) : 65 - 67
  • [2] On the consistency of MLE in finite mixture models of exponential families
    Atienza, N.
    Garcia-Heras, J.
    Munoz-Pichardo, J. M.
    Villa-Caro, R.
    [J]. JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2007, 137 (02) : 496 - 505
  • [3] Bayesian sample size determination for binary regression with a misclassified covariate and no gold standard
    Beavers, Daniel P.
    Stamey, James D.
    [J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2012, 56 (08) : 2574 - 2582
  • [4] On the effect of misclassification on bias of perfectly measured covariates in regression
    Buonaccorsi, JP
    Laake, P
    Veierod, MB
    [J]. BIOMETRICS, 2005, 61 (03) : 831 - 836
  • [5] Crothers Kristina, 2011, Proc Am Thorac Soc, V8, P275, DOI 10.1513/pats.201009-059WR
  • [6] Davis P.J., 2007, METHODS NUMERICAL IN
  • [7] Spontaneous pneumothorax associated with talc pulmonary granulomatosis after cocaine inhalation
    Fiorelli, Alfonso
    Accardo, Marina
    Rossi, Francesco
    Santini, Mario
    [J]. GENERAL THORACIC AND CARDIOVASCULAR SURGERY, 2016, 64 (03) : 174 - 176
  • [8] Grün B, 2008, J STAT SOFTW, V28, P1
  • [9] On model expansion, model contraction, identifiability and prior information: Two illustrative scenarios involving mismeasured variables
    Gustafson, P
    [J]. STATISTICAL SCIENCE, 2005, 20 (02) : 111 - 129
  • [10] Gustafson P., 2015, HDB EPIDEMIOLOGY, P639