Consequences of dichotomization

被引:187
作者
Fedorov, Valerii [1 ]
Mannino, Frank [1 ]
Zhang, Rongmei [1 ,2 ]
机构
[1] GlaxoSmithKline, Stat Res Unit, Collegeville, PA 19426 USA
[2] Univ Penn, Sch Med, Dept Biostat & Epidemiol, Philadelphia, PA 19104 USA
关键词
dichotomization; categorization; grouping;
D O I
10.1002/pst.331
中图分类号
R9 [药学];
学科分类号
1007 ;
摘要
Dichotomization is the transformation of a continuous outcome (response) to a binary outcome. This approach, while somewhat common, is harmful from the viewpoint of statistical estimation and hypothesis testing. We show that this leads to loss of information, which can he large. For normally distributed data, this loss in terms of Fisher's information is at least 1 - 2/pi (or 36%). lit other words, 100 continuous observations are statistically, equivalent to 158 dichotomized observations. The amount of information lost depends A greatly on the prior choice of cut points, with the optimal cut point depending upon the unknown parameters. The loss of information leads to loss of power or conversely a sample size increase to maintain power. Only in certain cases, for instance, in estimating a value of the cumulative distribution function and when the assumed model is very different from the true model, can the use of dichotomized outcomes he considered a reasonable approach. Copyright (C) 2008 John Wiley & Sons, Ltd.
引用
收藏
页码:50 / 61
页数:12
相关论文
共 24 条
  • [1] Altman DG, 2000, STAT MED, V19, P3275, DOI 10.1002/1097-0258(20001215)19:23<3275::AID-SIM626>3.3.CO
  • [2] 2-D
  • [3] [Anonymous], 1993, Continuous Univariate Distributions, DOI DOI 10.1016/0167-9473(96)90015-8
  • [4] [Anonymous], 1995, Continuous Univariate Distributions
  • [5] Multimarker strategy for risk prediction in patients presenting with acute dyspnea to the emergency department
    Christ, Michael
    Laule, Kirsten
    Klima, Theresia
    Hochholzer, Willibald
    Breidthardt, Tobias
    Perruchoud, Andre P.
    Mueller, Christian
    [J]. INTERNATIONAL JOURNAL OF CARDIOLOGY, 2008, 126 (01) : 73 - 78
  • [6] NOTE ON GROUPING
    COX, DR
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1957, 52 (280) : 543 - 547
  • [7] Farrington D., 2000, Criminal Behaviour and Mental Health, V10, P100, DOI [DOI 10.1002/CBM.349, 10.1002/cbm.349]
  • [8] Generalized probit model in design of dose finding experiments
    Fedorov, Valerii V.
    Wu, Yuehui
    [J]. MODA 8 - ADVANCES IN MODEL-ORIENTED DESIGN AND ANALYSIS, 2007, : 67 - +
  • [9] Haitovsky Yoel., 1973, REGRESSION ESTIMATIO
  • [10] HEITJAN DF, 1989, STAT SCI, V4, P164, DOI DOI 10.1214/SS/1177012601