Performance of intraclass correlation coefficient (ICC) as a reliability index under various distributions in scale reliability studies

被引:118
作者
Mehta, Shraddha [1 ]
Bastero-Caballero, Rowena F. [1 ,2 ]
Sun, Yijun [1 ]
Zhu, Ray [1 ]
Murphy, Diane K. [1 ]
Hardas, Bhushan [1 ]
Koch, Gary [3 ]
机构
[1] Allergan Plc, 2525 Dupont Dr, Irvine, CA 92612 USA
[2] Univ Maryland Baltimore Cty, 1000 Hilltop Circle, Baltimore, MD 21250 USA
[3] Univ North Carolina Chapel Hill, 135 NottinghamDr, Chapel Hill, NC 27517 USA
关键词
aesthetics; intra-class correlation; reliability; sample size; scales; subject distribution; VALIDATED GRADING SCALE; CONCORDANCE CORRELATION-COEFFICIENT; PHOTONUMERIC SCALE; ASSESSING AGREEMENT; CUTANEOUS PHOTODAMAGE; PSYCHIATRIC-DIAGNOSIS; PHOTOGRAPHIC SCALE; FOREHEAD LINES; WEIGHTED KAPPA; VOLUME DEFICIT;
D O I
10.1002/sim.7679
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Many published scale validation studies determine inter-rater reliability using the intra-class correlation coefficient . However, the use of this statistic must consider its advantages, limitations, and applicability. This paper evaluates how interaction of subject distribution, sample size, and levels of rater disagreement affects and provides an approach for obtaining relevant estimates under suboptimal conditions. Simulation results suggest that for a fixed number of subjects, from the convex distribution is smaller than for the uniform distribution, which in turn is smaller than for the concave distribution. The variance component estimates also show that the dissimilarity of among distributions is attributed to the study design (ie, distribution of subjects) component of subject variability and not the scale quality component of rater error variability. The dependency of on the distribution of subjects makes it difficult to compare results across reliability studies. Hence, it is proposed that reliability studies should be designed using a uniform distribution of subjects because of the standardization it provides for representing objective disagreement. In the absence of uniform distribution, a sampling method is proposed to reduce the non-uniformity. In addition, as expected, high levels of disagreement result in low , and when the type of distribution is fixed, any increase in the number of subjects beyond a moderately large specification such as does not have a major impact on ICC.
引用
收藏
页码:2734 / 2752
页数:19
相关论文
共 37 条
  • [1] [Anonymous], 2009, GUID IND PAT REP OUT
  • [2] An overview on assessing agreement with continuous measurements
    Barnhart, Huiman X.
    Haber, Michael J.
    Lin, Lawrence I.
    [J]. JOURNAL OF BIOPHARMACEUTICAL STATISTICS, 2007, 17 (04) : 529 - 569
  • [3] Overall concordance correlation coefficient for evaluating agreement among multiple observers
    Barnhart, HX
    Haber, M
    Song, JL
    [J]. BIOMETRICS, 2002, 58 (04) : 1020 - 1027
  • [4] MEASUREMENT AND RELIABILITY - STATISTICAL THINKING CONSIDERATIONS
    BARTKO, JJ
    [J]. SCHIZOPHRENIA BULLETIN, 1991, 17 (03) : 483 - 489
  • [5] STATISTICAL METHODS FOR ASSESSING AGREEMENT BETWEEN TWO METHODS OF CLINICAL MEASUREMENT
    BLAND, JM
    ALTMAN, DG
    [J]. LANCET, 1986, 1 (8476) : 307 - 310
  • [6] Validation of a Photonumeric Wrinkle Assessment Scale for Assessing Nasolabial Fold Wrinkles
    Buchner, Lawrence
    Vamvakias, George
    Rom, Dror
    [J]. PLASTIC AND RECONSTRUCTIVE SURGERY, 2010, 126 (02) : 596 - 601
  • [7] BIAS, PREVALENCE AND KAPPA
    BYRT, T
    BISHOP, J
    CARLIN, JB
    [J]. JOURNAL OF CLINICAL EPIDEMIOLOGY, 1993, 46 (05) : 423 - 429
  • [8] Development and Validation of a Photonumeric Scale for Evaluation of Static Horizontal Forehead Lines
    Carruthers, Alastair
    Donofrio, Lisa
    Hardas, Bhushan
    Murphy, Diane K.
    Carruthers, Jean
    Sykes, Jonathan M.
    Jones, Derek
    Creutz, Lela
    Marx, Ann
    Dill, Sara
    [J]. DERMATOLOGIC SURGERY, 2016, 42 (10) : S243 - S250
  • [9] A Validated Grading Scale for Crow's Feet
    Carruthers, Alastair
    Carruthers, Jean
    Hardas, Bhushan
    Kaur, Mandeep
    Goertelmeyer, Roman
    Jones, Derek
    Rzany, Berthold
    Cohen, Joel
    Kerscher, Martina
    Flynn, Timothy Corcoran
    Maas, Corey
    Sattler, Gerhard
    Gebauer, Alexander
    Pooth, Rainer
    McClure, Kathleen
    Simone-Korbel, Ulli
    Buchner, Larry
    [J]. DERMATOLOGIC SURGERY, 2008, 34 (02) : S173 - S178
  • [10] A Validated Hand Grading Scale
    Carruthers, Alastair
    Carruthers, Jean
    Hardas, Bhushan
    Kaur, Mandeep
    Goertelmeyer, Roman
    Jones, Derek
    Rzany, Berthold
    Cohen, Joel
    Kerscher, Martina
    Flynn, Timothy Corcoran
    Maas, Corey
    Sattler, Gerhard
    Gebauer, Alexander
    Pooth, Rainer
    McClure, Kathleen
    Simone-Korbel, Ulli
    Buchner, Larry
    [J]. DERMATOLOGIC SURGERY, 2008, 34 (02) : S179 - S183