A weak-signal-assisted procedure for variable selection and statistical inference with an informative subsample

被引:7
作者
Fang, Fang [1 ]
Zhao, Jiwei [2 ]
Ahmed, S. Ejaz [3 ]
Qu, Annie [4 ]
机构
[1] East China Normal Univ, Sch Stat, Key Lab Adv Theory & Applicat Stat & Data Sci MOE, Shanghai, Peoples R China
[2] Univ Wisconsin, Dept Biostat & Med Informat, Madison, WI USA
[3] Brock Univ, Fac Math & Sci, St Catharines, ON, Canada
[4] Univ Calif Irvine, Dept Stat, Irvine, CA 92697 USA
基金
美国国家科学基金会; 加拿大自然科学与工程研究理事会; 中国国家自然科学基金;
关键词
informative subsample; pairwise conditional likelihood; penalization; post-selection inference; variable selection; weak signal; CONFIDENCE-INTERVALS; MISSING DATA; LIKELIHOOD; REGIONS; TESTS;
D O I
10.1111/biom.13346
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
This paper is motivated from an HIV-1 drug resistance study where we encounter three analytical challenges: to analyze data with an informative subsample, to take into account the weak signals, and to detect important signals and also conduct statistical inference. We start with an initial estimation method, which adopts a penalized pairwise conditional likelihood approach for variable selection. This initial estimator incorporates the informative subsample issue. To accounting for the effect of weak signals, we use a key idea of partial ridge regression. We also propose a one-step estimation method for each of the signal coefficients and then construct confidence intervals accordingly. We apply the proposed method to the Stanford HIV-1 drug resistance study and compare the results with existing approaches. We also conduct comprehensive simulation studies to demonstrate the superior performance of our proposed method.
引用
收藏
页码:996 / 1010
页数:15
相关论文
共 36 条
[1]   Missing Data on the Estimation of the Prevalence of Accumulated Human Immunodeficiency Virus Drug Resistance in Patients Treated With Antiretroviral Drugs in North America [J].
Abraham, Alison G. ;
Lau, Bryan ;
Deeks, Steven ;
Moore, Richard D. ;
Zhang, Jinbing ;
Eron, Joseph ;
Harrigan, Richard ;
Gill, M. John ;
Kitahata, Mari ;
Klein, Marina ;
Napravnik, Sonia ;
Rachlis, Anita ;
Rodriguez, Benigno ;
Rourke, Sean ;
Benson, Constance ;
Bosch, Ron ;
Collier, Ann ;
Gebo, Kelly ;
Goedert, James ;
Hogg, Robert ;
Horberg, Michael ;
Jacobson, Lisa ;
Justice, Amy ;
Kirk, Greg ;
Martin, Jeff ;
McKaig, Rosemary ;
Silverberg, Michael ;
Sterling, Timothy ;
Thorne, Jennifer ;
Willig, James ;
Gange, Stephen J. .
AMERICAN JOURNAL OF EPIDEMIOLOGY, 2011, 174 (06) :727-735
[2]   Immunogenicity, Immunologic Memory, and Safety Following Measles Revaccination in HIV-Infected Children Receiving Highly Active Antiretroviral Therapy [J].
Abzug, Mark J. ;
Qin, Min ;
Levin, Myron J. ;
Fenton, Terence ;
Beeler, Judy A. ;
Bellini, William J. ;
Audet, Susette ;
Sowers, Sun Bae ;
Borkowsky, William ;
Nachman, Sharon A. ;
Pelton, Stephen I. ;
Rosenblatt, Howard M. .
JOURNAL OF INFECTIOUS DISEASES, 2012, 206 (04) :512-522
[3]  
[Anonymous], 2009, THESIS
[4]   ONE-STEP HUBER ESTIMATES IN LINEAR-MODEL [J].
BICKEL, PJ .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1975, 70 (350) :428-434
[5]   A Direct Estimation Approach to Sparse Linear Discriminant Analysis [J].
Cai, Tony ;
Liu, Weidong .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2011, 106 (496) :1566-1577
[6]  
Candes E, 2007, ANN STAT, V35, P2313, DOI 10.1214/009053606000001523
[7]  
Clavel F, 2004, NEW ENGL J MED, V350, P1023, DOI 10.1056/NEJM2ra025195
[8]   Vaccination in HIV-Infected Adults [J].
Crum-Cianflone, Nancy F. ;
Wallace, Mark R. .
AIDS PATIENT CARE AND STDS, 2014, 28 (08) :397-410
[9]   Higher criticism thresholding: Optimal feature selection when useful features are rare and weak [J].
Donoho, David ;
Jin, Jiashun .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2008, 105 (39) :14790-14795
[10]   Estimation and Accuracy After Model Selection [J].
Efron, Bradley .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2014, 109 (507) :991-1007