Robust EEG-Based Decoding of Auditory Attention With High-RMS-Level Speech Segments in Noisy Conditions

被引:7
作者
Wang, Lei [1 ,2 ]
Wu, Ed X. [2 ]
Chen, Fei [1 ]
机构
[1] Southern Univ Sci & Technol, Dept Elect & Elect Engn, Shenzhen, Peoples R China
[2] Univ Hong Kong, Dept Elect & Elect Engn, Hong Kong, Peoples R China
来源
FRONTIERS IN HUMAN NEUROSCIENCE | 2020年 / 14卷
基金
中国国家自然科学基金;
关键词
EEG; temporal response function (TRF); auditory attention decoding; speech RMS-level segments; signal-to-noise ratio; CORTICAL ENTRAINMENT; NORMAL-HEARING; INTELLIGIBILITY; TRACKING; COMPREHENSION; OSCILLATIONS; RESPONSES; BRAIN; DELTA; THETA;
D O I
10.3389/fnhum.2020.557534
中图分类号
Q189 [神经科学];
学科分类号
071006 ;
摘要
The attended speech stream can be detected robustly, even in adverse auditory scenarios with auditory attentional modulation, and can be decoded using electroencephalographic (EEG) data. Speech segmentation based on the relative root-mean-square (RMS) intensity can be used to estimate segmental contributions to perception in noisy conditions. High-RMS-level segments contain crucial information for speech perception. Hence, this study aimed to investigate the effect of high-RMS-level speech segments on auditory attention decoding performance under various signal-to-noise ratio (SNR) conditions. Scalp EEG signals were recorded when subjects listened to the attended speech stream in the mixed speech narrated concurrently by two Mandarin speakers. The temporal response function was used to identify the attended speech from EEG responses of tracking to the temporal envelopes of intact speech and high-RMS-level speech segments alone, respectively. Auditory decoding performance was then analyzed under various SNR conditions by comparing EEG correlations to the attended and ignored speech streams. The accuracy of auditory attention decoding based on the temporal envelope with high-RMS-level speech segments was not inferior to that based on the temporal envelope of intact speech. Cortical activity correlated more strongly with attended than with ignored speech under different SNR conditions. These results suggest that EEG recordings corresponding to high-RMS-level speech segments carry crucial information for the identification and tracking of attended speech in the presence of background noise. This study also showed that with the modulation of auditory attention, attended speech can be decoded more robustly from neural activity than from behavioral measures under a wide range of SNR.
引用
收藏
页数:13
相关论文
共 60 条
[1]   A Tutorial on Auditory Attention Identification Methods [J].
Alickovic, Emina ;
Lunner, Thomas ;
Gustafsson, Fredrik ;
Ljung, Lennart .
FRONTIERS IN NEUROSCIENCE, 2019, 13
[2]  
[Anonymous], 2002, E-Prime: User's guide
[3]   Impact of Different Acoustic Components on EEG-Based Auditory Attention Decoding in Noisy and Reverberant Conditions [J].
Aroudi, Ali ;
Mirkovic, Bojana ;
De Vos, Maarten ;
Doclo, Simon .
IEEE TRANSACTIONS ON NEURAL SYSTEMS AND REHABILITATION ENGINEERING, 2019, 27 (04) :652-663
[4]   Auditory-Inspired Speech Envelope Extraction Methods for Improved EEG-Based Auditory Attention Detection in a Cocktail Party Scenario [J].
Biesmans, Wouter ;
Das, Neetha ;
Francart, Tom ;
Bertrand, Alexander .
IEEE TRANSACTIONS ON NEURAL SYSTEMS AND REHABILITATION ENGINEERING, 2017, 25 (05) :402-412
[5]   Semantic Context Enhances the Early Auditory Encoding of Natural Speech [J].
Broderick, Michael P. ;
Anderson, Andrew J. ;
Lalor, Edmund C. .
JOURNAL OF NEUROSCIENCE, 2019, 39 (38) :7564-7575
[6]  
Chen F, 2013, INT CONF ACOUST SPEE, P7810, DOI 10.1109/ICASSP.2013.6639184
[7]   Contributions of cochlea-scaled entropy and consonant-vowel boundaries to prediction of speech intelligibility in noise [J].
Chen, Fei ;
Loizou, Philipos C. .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2012, 131 (05) :4104-4113
[8]   Predicting the intelligibility of vocoded and wideband Mandarin Chinese [J].
Chen, Fei ;
Loizou, Philipos C. .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2011, 129 (05) :3281-3290
[10]   Ear-EEG-Based Objective Hearing Threshold Estimation Evaluated on Normal Hearing Subjects [J].
Christensen, Christian Bech ;
Harte, James Michael ;
Lunner, Thomas ;
Kidmose, Preben .
IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2018, 65 (05) :1026-1034