When does combining markers improve classification performance and what are implications for practice?

被引:28
作者
Bansal, Aasthaa [1 ]
Pepe, Margaret Sullivan [2 ]
机构
[1] Univ Washington, Dept Biostat, Seattle, WA 98195 USA
[2] Fred Hutchinson Canc Res Ctr, Seattle, WA 98109 USA
基金
美国国家卫生研究院;
关键词
diagnosis; marker combination; incremental value; ROC curve; sensitivity; specificity; CANCER; BIOMARKERS; TESTS;
D O I
10.1002/sim.5736
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
When an existing standard marker does not have sufficient classification accuracy on its own, new markers are sought with the goal of yielding a combination with better performance. The primary criterion for selecting new markers is that they have good performance on their own and preferably be uncorrelated with the standard. Most often linear combinations are considered. In this paper, we investigate the increment in performance that is possible by combining a novel continuous marker with a moderately performing standard continuous marker under a variety of biologically motivated models for their joint distribution. We find that an uncorrelated continuous marker with moderate performance on its own usually yields only minimally improved performance. We identify other settings that lead to large improvements, including a novel marker that has very poor performance on its own but is highly correlated with the standard and a novel marker with poor to moderate performance that is highly correlated with the standard but only in one class category. These results suggest changing current strategies for identifying markers to be included in panels for possible combination. Using simulated and real datasets, we examine the merits of a broadened strategy that selects panels of markers as candidates on the basis of their joint performance with existing markers, compared with the standard strategy that selects markers on the basis of their marginal performance. We find that a broadened strategy can be fruitful but necessitates using studies with large numbers of subjects. Copyright (c) 2013 John Wiley & Sons, Ltd.
引用
收藏
页码:1877 / 1892
页数:16
相关论文
共 14 条
[1]   Assessing Lead Time of Selected Ovarian Cancer Biomarkers: A Nested Case-Control Study [J].
Anderson, Garnet L. ;
McIntosh, Martin ;
Wu, Lieling ;
Barnett, Matt ;
Goodman, Gary ;
Thorpe, Jason D. ;
Bergan, Lindsay ;
Thornquist, Mark D. ;
Scholler, Nathalie ;
Kim, Nam ;
O'Briant, Kathy ;
Drescher, Charles ;
Urban, Nicole .
JNCI-JOURNAL OF THE NATIONAL CANCER INSTITUTE, 2010, 102 (01) :26-38
[2]   Longitudinal PSA changes in men with and without prostate cancer: Assessment of prostate cancer risk [J].
Berger, AP ;
Deibl, M ;
Steiner, H ;
Bektic, J ;
Pelzer, A ;
Spranger, R ;
Klocker, H ;
Bartsch, G ;
Horninger, W .
PROSTATE, 2005, 64 (03) :240-245
[3]   Discriminatory accuracy from single-nucleotide polymorphisms in models to predict breast cancer risk [J].
Gail, Mitchell H. .
JOURNAL OF THE NATIONAL CANCER INSTITUTE, 2008, 100 (14) :1037-1041
[4]  
Guyon I., 2003, J MACH LEARN RES, V3, P1157
[5]  
Hosmer D.W., 1989, Applied Logistic regression
[6]   Combining several screening tests: Optimality of the risk score [J].
McIntosh, MW ;
Pepe, MS .
BIOMETRICS, 2002, 58 (03) :657-664
[7]  
McIntosh MW, 2002, CANCER EPIDEM BIOMAR, V11, P159
[8]   STATISTICAL SIGNIFICANCE TESTS FOR BINORMAL ROC CURVES [J].
METZ, CE ;
KRONMAN, HB .
JOURNAL OF MATHEMATICAL PSYCHOLOGY, 1980, 22 (03) :218-243
[10]   Limitations of the odds ratio in gauging the performance of a diagnostic, prognostic, or screening marker [J].
Pepe, MS ;
Janes, H ;
Longton, G ;
Leisenring, W ;
Newcomb, P .
AMERICAN JOURNAL OF EPIDEMIOLOGY, 2004, 159 (09) :882-890