EEG data augmentation: towards class imbalance problem in sleep staging tasks

被引:46
作者
Fan, Jiahao [1 ,2 ]
Sun, Chenglu [1 ]
Chen, Chen [1 ]
Jiang, Xinyu [1 ]
Liu, Xiangyu [3 ]
Zhao, Xian [1 ]
Meng, Long [1 ]
Dai, Chenyun [1 ]
Chen, Wei [1 ,2 ]
机构
[1] Fudan Univ, Sch Informat Sci & Technol, Dept Elect Engn, Ctr Intelligent Med Elect, Shanghai 200433, Peoples R China
[2] Fudan Univ, Human Phenome Inst, 825 Zhangheng Rd, Shanghai 201203, Peoples R China
[3] East China Univ Sci & Technol, Sch Art Design & Media, Shanghai 200237, Peoples R China
基金
中国博士后科学基金; 国家重点研发计划;
关键词
sleep stage classification; class imbalance problem; data augmentation; generative adversarial network; NEURAL-NETWORK; RESOURCE; TIME;
D O I
10.1088/1741-2552/abb5be
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Objective.Automatic sleep staging models suffer from an inherent class imbalance problem (CIP), which hinders the classifiers from achieving a better performance. To address this issue, we systematically studied sleep electroencephalogram data augmentation (DA) approaches. Furthermore, we modified and transferred novel DA approaches from related research fields, yielding new efficient ways to enhance sleep datasets.Approach.This study covers five DA methods, including repeating minority classes, morphological change, signal segmentation and recombination, dataset-to-dataset transfer, as well as generative adversarial network (GAN). We evaluated these mentioned DA methods by a sleep staging model on two datasets, the Montreal archive of sleep studies (MASS) and Sleep-EDF. We used a classification model with a typical convolutional neural network architecture to evaluate the effectiveness of the mentioned DA approaches. We also conducted a comprehensive analysis of these methods.Main results.The classification results showed that DA methods, especially DA by GAN, significantly improved the total classification performance in comparison with the baseline. The improvement of accuracy, F1 score and Cohen Kappa coefficient range from 0.90% to 3.79%, 0.73% to 3.48%, 2.61% to 5.43% on MASS and 1.36% to 4.79%, 1.47% to 4.23%, 2.22% to 4.04% on Sleep-EDF, respectively. DA methods improved the classification performance in most cases, whereas the performance of class N1 showed a subtle degradation in the F1 scores.Significance.Overall, our study proved that DA approaches are efficient in alleviating CIP lying in sleep staging tasks. Meanwhile, this study provided avenues for further improving the sleep staging accuracy using DA methods.
引用
收藏
页数:14
相关论文
共 39 条
  • [1] 1 T, 2016, 2016 ADV NEURAL INFO, P2234
  • [2] AN IMPROVED ALGORITHM FOR NEURAL-NETWORK CLASSIFICATION OF IMBALANCED TRAINING SETS
    ANAND, R
    MEHROTRA, KG
    MOHAN, CK
    RANKA, S
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS, 1993, 4 (06): : 962 - 969
  • [3] [Anonymous], 2006, SLEEP DISORDERS SLEE
  • [4] [Anonymous], 2016, ARXIV161001683
  • [5] [Anonymous], 2007, DMIN
  • [6] [Anonymous], 2019, IEEE ACCESS
  • [7] Arjovsky Martin, 2017, ARXIV170107875
  • [8] Berry R B, 2012, RULES TERMINOL TECH, V176, P2012
  • [9] Borgwardt K. M., 2007, Adv. Neural Inf. Process. Syst.
  • [10] A systematic study of the class imbalance problem in convolutional neural networks
    Buda, Mateusz
    Maki, Atsuto
    Mazurowski, Maciej A.
    [J]. NEURAL NETWORKS, 2018, 106 : 249 - 259