EEG-based Auditory Attention Detection in Cocktail Party Environment

被引:8
作者
Cai, Siqi [1 ]
Zhu, Hongxu [1 ]
Schultz, Tanja [2 ]
Li, Haizhou [3 ,4 ]
机构
[1] Natl Univ Singapore, Dept Elect & Comp Engn, Singapore, Singapore
[2] Univ Bremen, Cognit Syst Lab, Bremen, Germany
[3] Chinese Univ Hong Kong CUHK, Shenzhen Res Inst Big Data, Sch Data Sci, Shenzhen, Peoples R China
[4] Univ Bremen, Machine Listening Lab, Bremen, Germany
关键词
SPATIAL ATTENTION; NEURAL RESPONSES; SPEECH; BRAIN; HEARING; SPEAKER; ALPHA; REPRESENTATION; OSCILLATIONS; NEUROSCIENCE;
D O I
10.1561/116.00000128
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The cocktail party effect refers to a challenging problem in speech perception where one is able to selectively attend to one sound source in a noisy and multi-talk environment. The recent studies in neuroscience and psychoacoustics shed light on how the human brain solves the cocktail party problem, that inspires many computational solutions. With the advent of novel physiological techniques and deep learning algorithms, it is now possible to effectively detect auditory attention based on brain signals. In this paper, we provide a comprehensive overview of the most recent EEG-based auditory attention detection techniques and the methods to evaluate their performance. We examine both statistical and deep learning approaches, exploring their strengths and limitations. Furthermore, we also point out the gaps between the state-of-the-art and the practical needs in real-world applications. We also offer an overview of the available resources for EEG-based auditory attention detection research.
引用
收藏
页数:36
相关论文
共 118 条
[1]  
Accou B, 2021, EUR SIGNAL PR CONF, P1175, DOI [10.23919/Eusipco47968.2020.9287417, 10.23919/eusipco47968.2020.9287417]
[2]   Predicting speech intelligibility from EEG in a non-linear classification paradigm * [J].
Accou, Bernd ;
Monesi, Mohammad Jalilpour ;
Van hamme, Hugo ;
Francart, Tom .
JOURNAL OF NEURAL ENGINEERING, 2021, 18 (06)
[3]   Attention-driven auditory cortex short-term plasticity helps segregate relevant sounds from noise [J].
Ahveninen, Jyrki ;
Haemaelaeinen, Matti ;
Jaaskelainen, Iiro P. ;
Ahlfors, Seppo P. ;
Huang, Samantha ;
Lin, Fa-Hsuan ;
Raij, Tommi ;
Sams, Mikko ;
Vasios, Christos E. ;
Belliveau, John W. .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2011, 108 (10) :4182-4187
[4]   Towards reconstructing intelligible speech from the human auditory cortex [J].
Akbari, Hassan ;
Khalighinejad, Bahar ;
Herrero, Jose L. ;
Mehta, Ashesh D. ;
Mesgarani, Nima .
SCIENTIFIC REPORTS, 2019, 9 (1)
[5]   Robust decoding of selective auditory attention from MEG in a competing-speaker environment via state-space modeling [J].
Akram, Sahar ;
Presacco, Alessandro ;
Simon, Jonathan Z. ;
Shamma, Shihab A. ;
Babadi, Behtash .
NEUROIMAGE, 2016, 124 :906-917
[6]   A Tutorial on Auditory Attention Identification Methods [J].
Alickovic, Emina ;
Lunner, Thomas ;
Gustafsson, Fredrik ;
Ljung, Lennart .
FRONTIERS IN NEUROSCIENCE, 2019, 13
[7]   Hidden hearing loss selectively impairs neural adaptation to loud sound environments [J].
Bakay, Warren Michael Henry ;
Anderson, Lucy Anne ;
Garcia-Lazaro, Jose Alberto ;
McAlpine, David ;
Schaette, Roland .
NATURE COMMUNICATIONS, 2018, 9
[8]   Where is the cocktail party? Decoding locations of attended and unattended moving sound sources using EEG [J].
Bednar, Adam ;
Lalor, Edmund C. .
NEUROIMAGE, 2020, 205
[9]   Different spatio-temporal electroencephalography features drive the successful decoding of binaural and monaural cues for sound localization [J].
Bednar, Adam ;
Boland, Francis M. ;
Lalor, Edmund C. .
EUROPEAN JOURNAL OF NEUROSCIENCE, 2017, 45 (05) :679-689
[10]   Representation Learning: A Review and New Perspectives [J].
Bengio, Yoshua ;
Courville, Aaron ;
Vincent, Pascal .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2013, 35 (08) :1798-1828