EEG interpretation reliability and interpreter confidence: A large single-center study

被引：90

作者：

Grant, Arthur C. ^{[1
,2
,3
]}

Abdel-Baki, Samah G. ^{[4
]}

Weedon, Jeremy ^{[5
]}

Arnedo, Vanessa ^{[1
]}

Chari, Geetha ^{[1
]}

Koziorynska, Ewa ^{[1
]}

Lushbough, Catherine ^{[1
]}

Maus, Douglas ^{[1
,2
,3
]}

McSween, Tresa ^{[1
]}

Mortati, Katherine A. ^{[1
]}

Reznikov, Alexandra ^{[1
]}

Omurtag, Ahmet ^{[4
,6
]}

机构：

[1] Suny Downstate Med Ctr, Dept Neurol, Brooklyn, NY 11203 USA

[2] Suny Downstate Med Ctr, Dept Physiol, Brooklyn, NY 11203 USA

[3] Suny Downstate Med Ctr, Dept Pharmacol, Brooklyn, NY 11203 USA

[4] BioSignal Grp Corp, Brooklyn, NY USA

[5] Suny Downstate Med Ctr, Ctr Comp Sci, Brooklyn, NY 11203 USA

[6] Univ Houston, Dept Biomed Engn, Houston, TX USA

来源：

EPILEPSY & BEHAVIOR | 2014年 / 32卷

关键词：

Interrater reliability; Intrarater reliability; Confidence; EEG; SCALP ICTAL EEG; INTERRATER RELIABILITY; INTEROBSERVER RELIABILITY; RESEARCH TERMINOLOGY; AGREEMENT; ACCURACY; PATTERNS; CHILDREN;

D O I：

10.1016/j.yebeh.2014.01.011

中图分类号：

B84 [心理学]; C [社会科学总论]; Q98 [人类学];

学科分类号：

03 ; 0303 ; 030303 ; 04 ; 0402 ;

摘要：

The intrarater and interrater reliability (I&IR) of EEG interpretation has significant implications for the value of EEG as a diagnostic tool. We measured both the intrarater reliability and the interrater reliability of EEG interpretation based on the interpretation of complete EEGs into standard diagnostic categories and rater confidence in their interpretations and investigated sources of variance in EEG interpretations. During two distinct time intervals, six board-certified clinical neurophysiologists classified 300 EEGs into one or more of seven diagnostic categories and assigned a subjective confidence to their interpretations. Each EEG was read by three readers. Each reader interpreted 150 unique studies, and 50 studies were re-interpreted to generate intrarater data. A generalizability study assessed the contribution of subjects, readers, and the interaction between subjects and readers to interpretation variance. Five of the six readers had a median confidence of >= 99%, and the upper quartile of confidence values was 100% for all six readers. Intrarater Cohen's kappa (kappa(c)) ranged from 0.33 to 0.73 with an aggregated value of 0.59. Cohen's kappa ranged from 0.29 to 0.62 for the 15 reader pairs, with an aggregated Fleiss kappa of 0.44 for interrater agreement. Cohen's kappa was not significantly different across rater pairs (chi-square = 17.3, df = 14, p = 0.24). Variance due to subjects (i.e., EEGs) was 65.3%, due to readers was 3.9%, and due to the interaction between readers and subjects was 30.8%. Experienced epileptologists have very high confidence in their EEG interpretations and low to moderate I&IR, a common paradox in clinical medicine. A necessary, but insufficient, condition to improve EEG interpretation accuracy is to increase intrarater and interrater reliability. This goal could be accomplished, for instance, with an automated online application integrated into a continuing medical education module that measures and reports EEG I&IR to individual users. (C) 2014 Elsevier Inc. All rights reserved.

引用

页码：102 / 107

页数：6

共 23 条

[1] Interobserver Reproducibility of Electroencephalogram Interpretation in Critically Ill Children [J].