Evaluating technologies for classification and prediction in medicine

被引：38

作者：

Pepe, MS

机构：

[1] Fred Hutchinson Canc Res Ctr, Seattle, WA 98109 USA

[2] Univ Washington, Dept Biostat, Seattle, WA 98195 USA

来源：

STATISTICS IN MEDICINE | 2005年 / 24卷 / 24期

关键词：

diagnostic test; receiver operating characteristic; odds ratio; disease screening; prognosis;

D O I：

10.1002/sim.2431

中图分类号：

Q [生物科学];

学科分类号：

07 ; 0710 ; 09 ;

摘要：

Modern technologies promise to provide new ways of diagnosing disease, detecting subclinical disease, predicting prognosis, selecting patient specific treatment, identifying subjects at risk for disease, and so forth. Advances in genomics, proteomics and imaging modalities in particular hold great potential for assisting with classification/prediction in medicine. Before a classifier can be adopted for routine use in health care, its classification accuracy must be determined. Standards for evaluating new clinical classifiers however, lag far behind the well established standards that exist for evaluating new clinical treatments. In this paper, we discuss a phased approach to developing a new classifier (or biomarker). It mirrors the internationally established phase 1-2-3 paradigm for therapeutic drugs. The defined phases lead to a logical sequence of studies for classifier development. We emphasize that evaluating classification accuracy is fundamentally different from simply establishing association with outcome. Therefore, study objectives and designs differ from the familiar methods of clinical trials. We discuss these briefly for each phase. Finally, we argue that classifier development requires some rethinking of traditional data analysis techniques. As an example we show that maximizing the likelihood function to fit a logistic regression model to multiple predictors, can yield a poor classifier. Instead we demonstrate that an approach that maximizes an alternative objective function characterizing classification accuracy performs better. Copyright (c) 2005 John Wiley & Sons, Ltd.

引用

页码：3687 / 3696

页数：10

共 31 条

[11] EVALUATING AND COMPARING IMAGING TECHNIQUES - A REVIEW AND CLASSIFICATION OF STUDY DESIGNS
FREEDMAN, LS
[J]. BRITISH JOURNAL OF RADIOLOGY, 1987, 60 (719) : 1071 - 1081
[12] Friedman J., 2001, ELEMENTS STAT LEARNI, V1
[13] THE EFFICACY OF DIAGNOSTIC-IMAGING
FRYBACK, DG
THORNBURY, JR
[J]. MEDICAL DECISION MAKING, 1991, 11 (02) : 88 - 94
[14] GUYATT GH, 1986, CAN MED ASSOC J, V134, P587
[15] HANLEY JA, 1989, CRIT REV DIAGN IMAG, V29, P307
[16] The winding road towards evidence based diagnoses
Hernández-Aguado, I
[J]. JOURNAL OF EPIDEMIOLOGY AND COMMUNITY HEALTH, 2002, 56 (05) : 323 - 325
[17] Evidence base of clinical diagnosis - Evaluation of diagnostic procedures
Knottnerus, JA
van Weel, C
Muris, JWM
[J]. BMJ-BRITISH MEDICAL JOURNAL, 2002, 324 (7335): : 477 - 480
[18] Written in blood
Liotta, LA
Ferrari, M
Petricoin, E
[J]. NATURE, 2003, 425 (6961) : 905 - 905
[19] MEASURING THE EFFECTS OF IMAGING - AN EVALUATIVE FRAMEWORK
MACKENZIE, R
DIXON, AK
[J]. CLINICAL RADIOLOGY, 1995, 50 (08) : 513 - 518
[20] Combining several screening tests: Optimality of the risk score
McIntosh, MW
Pepe, MS
[J]. BIOMETRICS, 2002, 58 (03) : 657 - 664

← 1 2 3 4 →