Smile: A System to Support Machine Learning on EEG Data at Scale

被引:6
作者
Cao, Lei [1 ]
Tao, Wenbo [1 ]
An, Sungtae [3 ]
Jin, Jing [2 ]
Yan, Yizhou [1 ]
Liu, Xiaoyu [1 ]
Ge, Wendong [2 ]
Sah, Adam [1 ]
Battle, Leilani [4 ]
Sun, Jimeng [3 ]
Chang, Remco [5 ]
Westover, Brandon [2 ]
Madden, Samuel [1 ]
Stonebrakerl, Michael [1 ]
机构
[1] MIT, 77 Massachusetts Ave, Cambridge, MA 02139 USA
[2] Massachusetts Gen Hosp, Boston, MA 02114 USA
[3] Georgia Inst Technol, Atlanta, GA 30332 USA
[4] Univ Maryland, College Pk, MD 20742 USA
[5] Tufts Univ, Medford, MA 02155 USA
来源
PROCEEDINGS OF THE VLDB ENDOWMENT | 2019年 / 12卷 / 12期
关键词
CRITICALLY-ILL; ADULT EEGS; EXPLORATION;
D O I
10.14778/3352063.3352138
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In order to reduce the possibility of neural injury from seizures and sidestep the need for a neurologist to spend hours on manually reviewing the EEG recording, it is critical to automatically detect and classify "interictal-ictal continuum" (IIC) patterns from EEG data. However, the existing IIC classification techniques are shown to be not accurate and robust enough for clinical use because of the lack of high quality labels of EEG segments as training data. Obtaining high-quality labeled data is traditionally a manual process by trained clinicians that can be tedious, time-consuming, and error-prone. In this work, we propose Smile, an industrial scale system that provides an end-to-end solution to the IIC pattern classification problem. The core components of Smile include a visualization-based time series labeling module and a deep-learning based active learning module. The labeling module enables the users to explore and label 350 million EEG segments (30TB) at interactive speed. The multiple coordinated views allow the users to examine the EEG signals from both time domain and frequency domain simultaneously. The active learning module first trains a deep neural network that automatically extracts both the local features with respect to each segment itself and the long term dynamics of the EEG signals to classify IIC patterns. Then leveraging the output of the deep learning model, the EEG segments that can best improve the model are selected and prompted to clinicians to label. This process is iterated until the clinicians and the models show high degree of agreement. Our initial experimental results show that our Smile system allows the clinicians to label the EEG segments at will with a response time below 500 ms. The accuracy of the model is progressively improved as more and more high quality labels are acquired over time.
引用
收藏
页码:2230 / 2241
页数:12
相关论文
共 59 条
[1]   Automatic EEG analysis during long-term monitoring in the ICU [J].
Agarwal, R ;
Gotman, J ;
Flanagan, D ;
Rosenblatt, B .
ELECTROENCEPHALOGRAPHY AND CLINICAL NEUROPHYSIOLOGY, 1998, 107 (01) :44-58
[2]   Performance of Spectrogram-Based Seizure Identification of Adult EEGs by Critical Care Nurses and Neurophysiologists [J].
Amorim, Edilberto ;
Williamson, Craig A. ;
Moura, Lidia M. V. R. ;
Shafi, Mouhsin M. ;
Gaspard, Nicolas ;
Rosenthal, Eric S. ;
Guanci, Mary M. ;
Rajajee, Venkatakrishna ;
Westover, M. Brandon .
JOURNAL OF CLINICAL NEUROPHYSIOLOGY, 2017, 34 (04) :359-364
[3]   Dynamic Prefetching of Data Tiles for Interactive Visualization [J].
Battle, Leilani ;
Chang, Remco ;
Stonebraker, Michael .
SIGMOD'16: PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2016, :1363-1375
[4]   Commercial Visual Analytics Systems-Advances in the Big Data Analytics Field [J].
Behrisch, Michael ;
Streeb, Dirk ;
Stoffel, Florian ;
Seebacher, Daniel ;
Matejek, Brian ;
Weber, Stefan Hagen ;
Mittelstaedt, Sebastian ;
Pfister, Hanspeter ;
Keim, Daniel .
IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2019, 25 (10) :3011-3031
[5]   A Linear-Time Algorithm for the Aggregation and Visualization of Big Spatial Point Data [J].
Beilschmidt, Christian ;
Fober, Thomas ;
Mattig, Michael ;
Seeger, Bernhard .
25TH ACM SIGSPATIAL INTERNATIONAL CONFERENCE ON ADVANCES IN GEOGRAPHIC INFORMATION SYSTEMS (ACM SIGSPATIAL GIS 2017), 2017,
[6]   Expert-level sleep scoring with deep neural networks [J].
Biswal, Siddharth ;
Sun, Haoqi ;
Goparaju, Balaji ;
Westover, M. Brandon ;
Sun, Jimeng ;
Bianchi, Matt T. .
JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2018, 25 (12) :1643-1650
[7]   FEATURE EXTRACTION FROM ELECTROENCEPHALOGRAM BY ADAPTIVE SEGMENTATION [J].
BODENSTEIN, G ;
PRAETORIUS, HM .
PROCEEDINGS OF THE IEEE, 1977, 65 (05) :642-652
[8]   Maintaining Interactivity While Exploring Massive Time Series [J].
Chan, Sye-Min ;
Xiao, Ling ;
Gerth, John ;
Hanrahan, Pat .
IEEE SYMPOSIUM ON VISUAL ANALYTICS SCIENCE AND TECHNOLOGY 2008, PROCEEDINGS, 2008, :59-66
[9]   Visual Abstraction and Exploration of Multi-class Scatterplots [J].
Chen, Haidong ;
Chen, Wei ;
Mel, Honghui ;
Liu, Zhiqi ;
Zhou, Kun ;
Chen, Weifeng ;
Gu, Wentao ;
Ma, Kwan-Liu .
IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2014, 20 (12) :1683-1692
[10]  
Cheng Daniel, 2013, 2013 IEEE International Conference on Big Data, P2, DOI 10.1109/BigData.2013.6691787