Comparison of Two-Talker Attention Decoding from EEG with Nonlinear Neural Networks and Linear Methods

被引:84
作者
Ciccarelli, Gregory [1 ]
Nolan, Michael [1 ]
Perricone, Joseph [1 ]
Calamia, Paul T. [1 ]
Haro, Stephanie [1 ,2 ]
O'Sullivan, James [3 ]
Mesgarani, Nima [3 ]
Quatieri, Thomas F. [1 ,2 ]
Smalt, Christopher J. [1 ]
机构
[1] MIT, Lincoln Lab, Bioengn Syst & Technol Grp, 244 Wood St, Lexington, MA 02173 USA
[2] Harvard Med Sch, Speech & Hearing Biosci & Technol, Boston, MA 02115 USA
[3] Columbia Univ, Dept Elect Engn, New York, NY 10027 USA
关键词
AUDITORY ATTENTION; ATTENDED SPEECH; HEARING-LOSS; ENVIRONMENT; TRACKING; SPEAKER;
D O I
10.1038/s41598-019-47795-0
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Auditory attention decoding (AAD) through a brain-computer interface has had a flowering of developments since it was first introduced by Mesgarani and Chang (2012) using electrocorticograph recordings. AAD has been pursued for its potential application to hearing-aid design in which an attention-guided algorithm selects, from multiple competing acoustic sources, which should be enhanced for the listener and which should be suppressed. Traditionally, researchers have separated the AAD problem into two stages: reconstruction of a representation of the attended audio from neural signals, followed by determining the similarity between the candidate audio streams and the reconstruction. Here, we compare the traditional two-stage approach with a novel neural-network architecture that subsumes the explicit similarity step. We compare this new architecture against linear and non-linear (neural-network) baselines using both wet and dry electroencephalogram (EEG) systems. Our results indicate that the new architecture outperforms the baseline linear stimulus-reconstruction method, improving decoding accuracy from 66% to 81% using wet EEG and from 59% to 87% for dry EEG. Also of note was the finding that the dry EEG system can deliver comparable or even better results than the wet, despite the latter having one third as many EEG channels as the former. The 11-subject, wet-electrode AAD dataset for two competing, co-located talkers, the 11-subject, dry-electrode AAD dataset, and our software are available for further validation, experimentation, and modification.
引用
收藏
页数:10
相关论文
共 40 条
  • [1] ABRAMS HB, 2015, HEAR REV, V22, P16
  • [2] Human cortical responses to the speech envelope
    Aiken, Steven J.
    Picton, Terence W.
    [J]. EAR AND HEARING, 2008, 29 (02) : 139 - 157
  • [3] Robust decoding of selective auditory attention from MEG in a competing-speaker environment via state-space modeling
    Akram, Sahar
    Presacco, Alessandro
    Simon, Jonathan Z.
    Shamma, Shihab A.
    Babadi, Behtash
    [J]. NEUROIMAGE, 2016, 124 : 906 - 917
  • [4] Andrade CC, 2018, AGEING SOC, V38, P2468, DOI [10.1017/s0144686x17000708, 10.1017/S0144686X17000708]
  • [5] [Anonymous], 2018, J ACOUST SOC AM, DOI DOI 10.1121/1.5035691
  • [6] Arlinger S, 2003, INT J AUDIOL, V42, pS17
  • [7] Aroudi A, 2016, INT CONF ACOUST SPEE, P694, DOI 10.1109/ICASSP.2016.7471764
  • [8] Attention selectively modulates cortical entrainment in different regions of the speech spectrum
    Baltzell, Lucas S.
    Horton, Cort
    Shen, Yi
    Richards, Virginia M.
    D'Zmura, Michael
    Srinivasan, Ramesh
    [J]. BRAIN RESEARCH, 2016, 1644 : 203 - 212
  • [9] Auditory-Inspired Speech Envelope Extraction Methods for Improved EEG-Based Auditory Attention Detection in a Cocktail Party Scenario
    Biesmans, Wouter
    Das, Neetha
    Francart, Tom
    Bertrand, Alexander
    [J]. IEEE TRANSACTIONS ON NEURAL SYSTEMS AND REHABILITATION ENGINEERING, 2017, 25 (05) : 402 - 412
  • [10] Identifying auditory attention with ear-EEG: cEEGrid versus high-density cap-EEG comparison
    Bleichner, Martin G.
    Mirkovic, Bojana
    Debener, Stefan
    [J]. JOURNAL OF NEURAL ENGINEERING, 2016, 13 (06)