EEG interpretation reliability and interpreter confidence: A large single-center study

被引:90
作者
Grant, Arthur C. [1 ,2 ,3 ]
Abdel-Baki, Samah G. [4 ]
Weedon, Jeremy [5 ]
Arnedo, Vanessa [1 ]
Chari, Geetha [1 ]
Koziorynska, Ewa [1 ]
Lushbough, Catherine [1 ]
Maus, Douglas [1 ,2 ,3 ]
McSween, Tresa [1 ]
Mortati, Katherine A. [1 ]
Reznikov, Alexandra [1 ]
Omurtag, Ahmet [4 ,6 ]
机构
[1] Suny Downstate Med Ctr, Dept Neurol, Brooklyn, NY 11203 USA
[2] Suny Downstate Med Ctr, Dept Physiol, Brooklyn, NY 11203 USA
[3] Suny Downstate Med Ctr, Dept Pharmacol, Brooklyn, NY 11203 USA
[4] BioSignal Grp Corp, Brooklyn, NY USA
[5] Suny Downstate Med Ctr, Ctr Comp Sci, Brooklyn, NY 11203 USA
[6] Univ Houston, Dept Biomed Engn, Houston, TX USA
关键词
Interrater reliability; Intrarater reliability; Confidence; EEG; SCALP ICTAL EEG; INTERRATER RELIABILITY; INTEROBSERVER RELIABILITY; RESEARCH TERMINOLOGY; AGREEMENT; ACCURACY; PATTERNS; CHILDREN;
D O I
10.1016/j.yebeh.2014.01.011
中图分类号
B84 [心理学]; C [社会科学总论]; Q98 [人类学];
学科分类号
03 ; 0303 ; 030303 ; 04 ; 0402 ;
摘要
The intrarater and interrater reliability (I&IR) of EEG interpretation has significant implications for the value of EEG as a diagnostic tool. We measured both the intrarater reliability and the interrater reliability of EEG interpretation based on the interpretation of complete EEGs into standard diagnostic categories and rater confidence in their interpretations and investigated sources of variance in EEG interpretations. During two distinct time intervals, six board-certified clinical neurophysiologists classified 300 EEGs into one or more of seven diagnostic categories and assigned a subjective confidence to their interpretations. Each EEG was read by three readers. Each reader interpreted 150 unique studies, and 50 studies were re-interpreted to generate intrarater data. A generalizability study assessed the contribution of subjects, readers, and the interaction between subjects and readers to interpretation variance. Five of the six readers had a median confidence of >= 99%, and the upper quartile of confidence values was 100% for all six readers. Intrarater Cohen's kappa (kappa(c)) ranged from 0.33 to 0.73 with an aggregated value of 0.59. Cohen's kappa ranged from 0.29 to 0.62 for the 15 reader pairs, with an aggregated Fleiss kappa of 0.44 for interrater agreement. Cohen's kappa was not significantly different across rater pairs (chi-square = 17.3, df = 14, p = 0.24). Variance due to subjects (i.e., EEGs) was 65.3%, due to readers was 3.9%, and due to the interaction between readers and subjects was 30.8%. Experienced epileptologists have very high confidence in their EEG interpretations and low to moderate I&IR, a common paradox in clinical medicine. A necessary, but insufficient, condition to improve EEG interpretation accuracy is to increase intrarater and interrater reliability. This goal could be accomplished, for instance, with an automated online application integrated into a continuing medical education module that measures and reports EEG I&IR to individual users. (C) 2014 Elsevier Inc. All rights reserved.
引用
收藏
页码:102 / 107
页数:6
相关论文
共 23 条
[1]   Interobserver Reproducibility of Electroencephalogram Interpretation in Critically Ill Children [J].
Abend, Nicholas S. ;
Gutierrez-Colina, Ana ;
Zhao, Huaqing ;
Guo, Rong ;
Marsh, Eric ;
Clancy, Robert R. ;
Dlugos, Dennis J. .
JOURNAL OF CLINICAL NEUROPHYSIOLOGY, 2011, 28 (01) :15-19
[2]   An intervention to improve the interrater reliability of clinical EEG interpretations [J].
Azuma, H ;
Hori, S ;
Nakanishi, M ;
Fujimoto, S ;
Ichikawa, N ;
Furukawa, TA .
PSYCHIATRY AND CLINICAL NEUROSCIENCES, 2003, 57 (05) :485-489
[3]   High inter-reviewer variability of spike detection on intracranial EEG addressed by an automated multi-channel algorithm [J].
Barkmeier, Daniel T. ;
Shah, Aashit K. ;
Flanagan, Danny ;
Atkinson, Marie D. ;
Agarwal, Rajeev ;
Fuerst, Darren R. ;
Jafari-Khouzani, Kourosh ;
Loeb, Jeffrey A. .
CLINICAL NEUROPHYSIOLOGY, 2012, 123 (06) :1088-1095
[4]   Interrater reliability of EEG-video monitoring [J].
Benbadis, S. R. ;
LaFrance, W. C. ;
Papandonatos, G. D. ;
Korabathina, K. ;
Lin, K. ;
Kraemer, H. C. .
NEUROLOGY, 2009, 73 (11) :843-846
[5]   Overconfidence as a cause of diagnostic error in medicine [J].
Berner, Eta S. ;
Graber, Mark L. .
AMERICAN JOURNAL OF MEDICINE, 2008, 121 (05) :2-23
[6]   Overconfidence in clinical decision making [J].
Croskerry, Pat ;
Norman, Geoff .
AMERICAN JOURNAL OF MEDICINE, 2008, 121 (05) :24-29
[7]  
Fleiss JL., 1981, STAT METHODS RATES P
[8]   Interobserver Agreement in the Interpretation of EEG Patterns in Critically III Adults [J].
Gerber, Paula A. ;
Chapman, Kevin E. ;
Chung, Steve S. ;
Drees, Cornelia ;
Maganti, Rama K. ;
Ng, Yu-tze ;
Treiman, David M. ;
Little, Andrew S. ;
Kerrigan, John F. .
JOURNAL OF CLINICAL NEUROPHYSIOLOGY, 2008, 25 (05) :241-249
[9]   Web-Based Collection of Expert Opinion on Routine Scalp EEG: Software Development and Interrater Reliability [J].
Halford, Jonathan J. ;
Pressly, William B. ;
Benbadis, Selim R. ;
Tatum, William O. ;
Turner, Robert P. ;
Arain, Amir ;
Pritchard, Paul B. ;
Edwards, Jonathan C. ;
Dean, Brian C. .
JOURNAL OF CLINICAL NEUROPHYSIOLOGY, 2011, 28 (02) :178-184
[10]   The ACNS subcommittee on research terminology for continuous EEG monitoring: Proposed standardized terminology for rhythmic and periodic EEG patterns encountered in critically ill patients [J].
Hirsch, LJ ;
Brenner, RP ;
Drislane, FW ;
So, E ;
Kaplan, PW ;
Jordan, KG ;
Herman, ST ;
LaRoche, SM ;
Young, B ;
Bleck, TP ;
Scheuer, ML ;
Emerson, RG .
JOURNAL OF CLINICAL NEUROPHYSIOLOGY, 2005, 22 (02) :128-135