Probabilistic index: an intuitive non-parametric approach to measuring the size of treatment effects

被引:188
作者
Acion, L [1 ]
Peterson, JJ
Temple, S
Arndt, S
机构
[1] Univ Iowa, Coll Med, MEB Psychiat Res 1 192, Dept Psychiat, Iowa City, IA 52242 USA
[2] Univ Iowa, Coll Publ Hlth, Dept Biostat, Iowa City, IA 52242 USA
[3] GlaxoSmithKline, Dept Stat Sci, King Of Prussia, PA USA
关键词
effect size; P(X > Y); area under the curve; Mann-Whitney U statistic; Kendall's iota;
D O I
10.1002/sim.2256
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Effect sizes (ES) tell the magnitude of the difference between treatments and, ideally, should tell clinicians how likely their patients will benefit from the treatment. Currently used ES are expressed in statistical rather than in clinically useful terms and may not give clinicians the appropriate information. We restrict our discussion to studies with two groups: one with it patients receiving a new treatment and the other with m patients receiving the usual or no treatment. The standardized mean difference (e.g. Cohen's d) is a well-known index for continuous outcomes. There is some intuitive value to d, but measuring improvement in standard deviations (SD) is a statistical concept that may not help a clinician. How much improvement is a half SD? A more intuitive and simple-to-calculate ES is the probability that the response of a patient given the new treatment (X) is better than the one for a randomly chosen patient given the old or no treatment (Y) (i.e. P(X > Y), larger values meaning better outcomes). This probability has an immediate identity with the area under the curve (AUC) measure in procedures for receiver operator characteristic (ROC) curve comparing responses to two treatments. It also can be easily calculated from the Mann-Whitney U, Wilcoxon, or Kendall tau statistics. We describe the characteristics of an ideal ES. We propose P(X > Y) as an alternative index, summarize its correspondence with well-known non-parametric statistics, compare it to the standardized mean difference index, and illustrate with clinical data. Copyright (c) 2005 John Wiley & Sons, Ltd.
引用
收藏
页码:591 / 602
页数:12
相关论文
共 36 条
  • [1] The revised CONSORT statement for reporting randomized trials: Explanation and elaboration
    Altman, DG
    Schulz, KF
    Moher, D
    Egger, M
    Davidoff, F
    Elbourne, D
    Gotzsche, PC
    Lang, T
    [J]. ANNALS OF INTERNAL MEDICINE, 2001, 134 (08) : 663 - 694
  • [2] Brunner E., 2001, NONPARAMETRIC ANAL L
  • [3] ESTIMATION OF RELIABILITY FROM STRESS-STRENGTH RELATIONSHIPS
    CHURCH, JD
    HARRIS, B
    [J]. TECHNOMETRICS, 1970, 12 (01) : 49 - &
  • [4] Cohen J., 1988, STAT POWER ANAL BEHA
  • [5] THE NUMBER NEEDED TO TREAT - A CLINICALLY USEFUL MEASURE OF TREATMENT EFFECT
    COOK, RJ
    SACKETT, DL
    [J]. BRITISH MEDICAL JOURNAL, 1995, 310 (6977) : 452 - 454
  • [6] Effect size estimates: Issues and problems in interpretation
    Fern, EF
    Monroe, KB
    [J]. JOURNAL OF CONSUMER RESEARCH, 1996, 23 (02) : 89 - 105
  • [7] Editors can lead researchers to confidence intervals, but can't make them think - Statistical reform lessons from medicine
    Fidler, F
    Thomason, N
    Cumming, G
    Finch, S
    Leeman, J
    [J]. PSYCHOLOGICAL SCIENCE, 2004, 15 (02) : 119 - 126
  • [8] PROBABILITY OF THE SUPERIOR OUTCOME OF ONE TREATMENT OVER ANOTHER
    GRISSOM, RJ
    [J]. JOURNAL OF APPLIED PSYCHOLOGY, 1994, 79 (02) : 314 - 316
  • [9] THE MEANING AND USE OF THE AREA UNDER A RECEIVER OPERATING CHARACTERISTIC (ROC) CURVE
    HANLEY, JA
    MCNEIL, BJ
    [J]. RADIOLOGY, 1982, 143 (01) : 29 - 36
  • [10] Hauck WW, 2000, STAT MED, V19, P887, DOI 10.1002/(SICI)1097-0258(20000415)19:7<887::AID-SIM388>3.3.CO