An old but still burning problem: Inter-rater reliability in clinical trials with antidepressant medication

被引:3
作者
Berendsen, Steven [1 ,2 ]
Verdegaal, Loek M. A. [1 ]
van Tricht, Mirjam J. [1 ]
Blankers, Matthijs [2 ]
Van, Henricus L. [2 ]
de Haan, Lieuwe [1 ,2 ]
机构
[1] Univ Med Ctr Amsterdam, Locat Acad Med Ctr, Dept Psychiat, Meibergdreef 9, NL-1105 AZ Amsterdam, Netherlands
[2] Arkin Mental Hlth Care, Dept Res, Amsterdam, Netherlands
关键词
Inter-rater reliability; Antidepressant medication; Training procedures; Methodology; IMPACT; SCALE;
D O I
10.1016/j.jad.2020.07.080
中图分类号
R74 [神经病学与精神病学];
学科分类号
摘要
Background: Antidepressant trials are criticized due to potential methodological flaws. Root causes of failing methodology can be found in insufficient inter-rater reliability (IRR) and training practices, leading to higher placebo response and reduced study-power. However, it is unknown to what extent reliability estimates or training procedures are currently included in antidepressant reports. Therefore, we aimed to determine the proportion of publications concerning double-blind randomized controlled antidepressant trials that report on IRR coefficients and training procedures. Methods: We extracted all double-blind randomized clinical trials (RCTs) from the meta-analysis of Cipriani et al. (2018) concerning the period from 2000 until January 2016. Further, we conducted a Medline-search for double-blind RCTs from January 2016 until January 2020 for additional reports. We identified IRR coefficients and training procedures in these publications. Results: In total we identified 179 double-blind RCTs. Only 4.5% reported an IRR coefficient whereas 27.9% reported training procedures. Limitations: We did not contact individual authors for additional information regarding implementation of training procedures or inter-rater reliability assessment. Conclusions: There is a substantial lack of reporting IRR coefficients and training procedures in RCTs with antidepressant medication. Considering the large implications of insufficient reliability, we urge researchers to conduct and report training procedures and reliability estimations.
引用
收藏
页码:748 / 751
页数:4
相关论文
共 16 条
[1]  
Berendsen S., 2019, PRETRAINING IN UNPUB
[2]   Comparative efficacy and acceptability of 21 antidepressant drugs for the acute treatment of adults with major depressive disorder: a systematic review and network meta-analysis [J].
Cipriani, Andrea ;
Furukawa, Tashi A. ;
Salanti, Georgia ;
Chaimani, Anna ;
Atkinson, Lavren Z. ;
Ogawa, Yusuke ;
Levcht, Stefan ;
Ruhe, Henricus G. ;
Turner, Erick H. ;
Higgins, Julian P. T. ;
Egger, Matthias ;
Takeshima, Nozomi ;
Hayasaka, Yu ;
Imai, Hissei ;
Shinohara, Kiyomi ;
Tajika, Aran ;
Ioannidis, John P. A. ;
Geddes, Jahn R. .
LANCET, 2018, 391 (10128) :1357-1366
[3]   THE MONTGOMERY-ASBERG DEPRESSION SCALE - RELIABILITY AND VALIDITY [J].
DAVIDSON, J ;
TURNBULL, CD ;
STRICKLAND, R ;
MILLER, R ;
GRAVES, K .
ACTA PSYCHIATRICA SCANDINAVICA, 1986, 73 (05) :544-548
[4]   The History and Current State of Antidepressant Clinical Trial Design: A Call to Action for Proof-of-Concept Studies [J].
Gelenberg, Alan J. ;
Thase, Michael E. ;
Meyer, Roger E. ;
Goodwin, Frederick K. ;
Katz, Martin M. ;
Kraemer, Helena Chmura ;
Potter, William Z. ;
Shelton, Richard C. ;
Fava, Maurizio ;
Khan, Arif ;
Trivedi, Madhukar H. ;
Ninan, Philip T. ;
Mann, John J. ;
Bergeson, Susan ;
Endicott, Jean ;
Kocsis, James H. ;
Leon, Andrew C. ;
Manji, Husseini K. ;
Rosenbaum, Jerrold F. .
JOURNAL OF CLINICAL PSYCHIATRY, 2008, 69 (10) :1513-+
[5]   A new approach to rater training and certification in a multicenter clinical trial [J].
Kobak, KA ;
Lipsitz, JD ;
Williams, JBW ;
Engelhardt, N ;
Bellew, KM .
JOURNAL OF CLINICAL PSYCHOPHARMACOLOGY, 2005, 25 (05) :407-412
[6]   Rater training in multicenter clinical trials: Issues and recommendations [J].
Kobak, KA ;
Engelhardt, N ;
Williams, JBW ;
Lipsitz, JD .
JOURNAL OF CLINICAL PSYCHOPHARMACOLOGY, 2004, 24 (02) :113-117
[7]   Site Versus Centralized Raters in a Clinical Depression Trial Impact on Patient Selection and Placebo Response [J].
Kobak, Kenneth A. ;
Leuchter, Andrew ;
DeBrota, David ;
Engelhardt, Nina ;
Williams, Janet B. W. ;
Cook, Ian A. ;
Leon, Andrew C. ;
Alpert, Jonathan .
JOURNAL OF CLINICAL PSYCHOPHARMACOLOGY, 2010, 30 (02) :193-197
[8]   Routine evaluation of mental health: Reliable information or worthless 'guesstimates'? [J].
Loevdahl, H ;
Friis, S .
ACTA PSYCHIATRICA SCANDINAVICA, 1996, 93 (02) :125-128
[9]   Why Are Innovative Drugs Failing in Phase III? [J].
Marder, Stephen R. ;
Laughren, Thomas ;
Romano, Steven J. .
AMERICAN JOURNAL OF PSYCHIATRY, 2017, 174 (09) :829-831
[10]   Improvement of inter-rater reliability of PANSS items and subscales by a standardized rater training [J].
Muller, MJ ;
Wetzel, H .
ACTA PSYCHIATRICA SCANDINAVICA, 1998, 98 (02) :135-139