Ensemble deep learning models for EEG-based auditory attention decoding

被引:0
作者
Zhao, Peng [1 ]
Wang, Ruicong [1 ]
Lin, Zijie [1 ]
Pan, Zexu [2 ]
Li, Haizhou [1 ,2 ]
Zhang, Xueyi [1 ]
机构
[1] Chinese Univ Hong Kong, Shenzhen Res Inst Big Data, Sch Data Sci, Shenzhen, Guangdong, Peoples R China
[2] Natl Univ Singapore, Singapore, Singapore
来源
2024 IEEE 14TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING, ISCSLP 2024 | 2024年
基金
中国国家自然科学基金;
关键词
Auditory attention; Electroencephalography; Brain-Computer Interface; Ensemble Learning;
D O I
10.1109/ISCSLP63861.2024.10800199
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Auditory attention decoding (AAD) is gaining traction in brain-computer interface (BCI) research, with a focus on using EEG signals to identify the attended speech. The First Chinese Auditory Attention Decoding Challenge presents two distinct scenarios: audio-only and audio-video conditions. Participating in the challenge, we introduce a novel multi-model ensemble approach designed to enhance the robustness and accuracy of AAD. Specifically, this approach integrates several advanced deep learning architectures, including CA-CNN, ResNet18, PyramidNet, and T-CNN, to effectively capture both temporal and spatial features from EEG data. We evaluate the performance of our ensemble model across two AAD single-subject and cross-subject-under both audio-only and audio-video conditions. Results demonstrate that the ensemble approach achieves relatively high AAD accuracy, with average accuracies of 97.99% and 97.81% for within-subject tasks under the respective conditions. We ranked third in the challenge.
引用
收藏
页码:339 / 343
页数:5
相关论文
共 19 条
[1]  
Cai S., 2024, Asa: An auditory spatial at- tention dataset with multiple speaking locations, DOI 10.5281zenodo.11541114
[2]  
Cai S., 2021, APSIPA Transactions on Signal and Information Processing, V12
[3]  
Cai S., 2024, IEEE T COGNITIVE DEV
[4]   Semantic and Correlation Disentangled Graph Convolutions for Multilabel Image Recognition [J].
Cai, Shaofei ;
Li, Liang ;
Han, Xinzhe ;
Huang, Shan ;
Tian, Qi ;
Huang, Qingming .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2025, 36 (01) :1609-1621
[5]   EEG-based Auditory Attention Detection in Cocktail Party Environment [J].
Cai, Siqi ;
Zhu, Hongxu ;
Schultz, Tanja ;
Li, Haizhou .
APSIPA TRANSACTIONS ON SIGNAL AND INFORMATION PROCESSING, 2023, 12 (03)
[6]   Ensemble deep learning in bioinformatics [J].
Cao, Yue ;
Geddes, Thomas Andrew ;
Yang, Jean Yee Hwa ;
Yang, Pengyi .
NATURE MACHINE INTELLIGENCE, 2020, 2 (09) :500-508
[8]   Ensemble methods in machine learning [J].
Dietterich, TG .
MULTIPLE CLASSIFIER SYSTEMS, 2000, 1857 :1-15
[9]   Auditory attention - focusing the searchlight on sound [J].
Fritz, Jonathan B. ;
Elhilali, Mounya ;
David, Stephen V. ;
Shamma, Shihab A. .
CURRENT OPINION IN NEUROBIOLOGY, 2007, 17 (04) :437-455
[10]   Deep Pyramidal Residual Networks [J].
Han, Dongyoon ;
Kim, Jiwhan ;
Kim, Junmo .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :6307-6315