Correcting for Multiple Testing During Diagnostic Accuracy Studies

被引:3
作者
Bullen, Jennifer A. [1 ]
Obuchowski, Nancy A. [1 ]
机构
[1] Cleveland Clin Fdn, Quantitat Hlth Sci, 9500 Euclid Ave, Cleveland, OH 44195 USA
关键词
Correlated endpoints; Gatekeeping; Receiver operating characteristic curve; Sensitivity; Specificity; CLINICAL-TRIALS;
D O I
10.1080/19466315.2017.1280413
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
During diagnostic accuracy studies, hypotheses are often tested on more than one accuracy measure. One common example is to make inferences both on the area under the receiver operating characteristic curve (AUC) and on the sensitivity and specificity at a particular operating point. This represents a unique multiple testing situation as the AUC is a function of the sensitivity and specificity. Through a large simulation study, we showed that a naive approach of testing all three accuracy measures at a 0.05-level greatly elevates the family-wise Type I error rate (i.e., up to 10%). When a gatekeeping approach to controlling the FWER was appropriate (family 1: AUC, family 2: sensitivity and specificity), adjustment in the second family was necessary to control the FWER across all scenarios. When a nonhierarchical approach was appropriate, the methods of Xie (2012) offered increased power over Holm's step-down procedure, but required assumptions about the correlation among the test statistics. In our simulations, the correlation between the test statistics of the AUC and sensitivity and between the test statistics of the AUC and specificity ranged from 0.3 to 0.6, while there was no correlation between the test statistics of the sensitivity and specificity.
引用
收藏
页码:243 / 248
页数:6
相关论文
共 12 条
[1]  
[Anonymous], 2007, Statistical guidance on reporting results from studies evaluating diagnostic tests
[2]  
[Anonymous], 2011, STAT METHODS DIAGNOS, DOI DOI 10.1002/9780470906514
[3]   Effect of Computer-aided Detection for CT Colonography in a Multireader, Multicase Trial [J].
Dachman, Abraham H. ;
Obuchowski, Nancy A. ;
Hoffmeister, Jeffrey W. ;
Hinshaw, J. Louis ;
Frew, Michael I. ;
Winter, Thomas C. ;
Van Uitert, Robert L. ;
Periaswamy, Senthil ;
Summers, Ronald M. ;
Hillman, Bruce J. .
RADIOLOGY, 2010, 256 (03) :827-835
[4]   COMPARING THE AREAS UNDER 2 OR MORE CORRELATED RECEIVER OPERATING CHARACTERISTIC CURVES - A NONPARAMETRIC APPROACH [J].
DELONG, ER ;
DELONG, DM ;
CLARKEPEARSON, DI .
BIOMETRICS, 1988, 44 (03) :837-845
[5]   Gatekeeping strategies for clinical trials that do not require all primary effects to be significant [J].
Dmitrienko, A ;
Offen, WW ;
Westfall, PH .
STATISTICS IN MEDICINE, 2003, 22 (15) :2387-2400
[6]   Multistage and Mixture Parallel Gatekeeping Procedures in Clinical Trials [J].
Dmitrienko, Alex ;
Kordzakhia, George ;
Tamhane, Ajit C. .
JOURNAL OF BIOPHARMACEUTICAL STATISTICS, 2011, 21 (04) :726-747
[7]   Diagnostic Accuracy of CT Enterography for Active Inflammatory Terminal Ileal Crohn Disease: Comparison of Full-Dose and Half-Dose Images Reconstructed with FBP and Half-Dose Images with SAFIRE [J].
Gandhi, Namita S. ;
Baker, Mark E. ;
Goenka, Ajit H. ;
Bullen, Jennifer A. ;
Obuchowski, Nancy A. ;
Remer, Erick M. ;
Coppa, Christopher P. ;
Einstein, David ;
Feldman, Myra K. ;
Kanmaniraja, Devaraju ;
Purysko, Andrei S. ;
Vahdat, Noushin ;
Primak, Andrew N. ;
Karim, Wadih ;
Herts, Brian R. .
RADIOLOGY, 2016, 280 (02) :436-445
[8]  
HOLM S, 1979, SCAND J STAT, V6, P65
[9]   Overview of multiple testing methodology and recent development in clinical trials [J].
Wang, Deli ;
Li, Yihan ;
Wang, Xin ;
Liu, Xuan ;
Fu, Bo ;
Lin, Yunzhi ;
Larsen, Lois ;
Offen, Walter .
CONTEMPORARY CLINICAL TRIALS, 2015, 45 :13-20
[10]  
Westfall P.H., 1993, RESAMPLING BASED MUL