EEG-BASED DECODING OF AUDITORY ATTENTION TO A TARGET INSTRUMENT IN POLYPHONIC MUSIC

被引:0
作者
Cantisani, Giorgia [1 ]
Essid, Slim [1 ]
Richard, Gael [1 ]
机构
[1] Telecom Paris, Inst Polytech Paris, LTCI, F-75013 Paris, France
来源
2019 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA) | 2019年
基金
欧盟地平线“2020”;
关键词
Auditory attention decoding; Polyphonic music; EEG; Stimulus reconstruction model; TIMBRE; CORTEX;
D O I
10.1109/waspaa.2019.8937219
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Auditory attention decoding aims at determining which sound source a subject is "focusing on". In this work, we address the problem of EEG-based decoding of auditory attention to a target instrument in realistic polyphonic music. To this end, we exploit a stimulus reconstruction model which was proven to decode successfully the attention to speech in multi-speaker environments. To our knowledge, this model was never applied to musical stimuli for decoding attention. The task we consider here is quite complex as the stimuli used are polyphonic, including duets and trios, and are reproduced using loudspeakers instead of headphones. We consider the decoding of three different audio representations and investigate the influence on the decoding performance of multiple variants of musical stimuli, such as the number and type of instruments in the mixture, the spatial rendering, the music genre and the melody/rhythmical pattern that is played. We obtain promising results, comparable to those obtained on speech data in previous works, and confirm that it is possible to correlate the human brain activity with musically relevant features of the attended source.
引用
收藏
页码:80 / 84
页数:5
相关论文
共 31 条
[1]   Towards reconstructing intelligible speech from the human auditory cortex [J].
Akbari, Hassan ;
Khalighinejad, Bahar ;
Herrero, Jose L. ;
Mehta, Ashesh D. ;
Mesgarani, Nima .
SCIENTIFIC REPORTS, 2019, 9 (1)
[2]  
Akram Sahar., 2014, Advances in Neural Information Processing Systems, P460
[3]   Single-trial analysis and classification of ERP components - A tutorial [J].
Blankertz, Benjamin ;
Lemm, Steven ;
Treder, Matthias ;
Haufe, Stefan ;
Mueller, Klaus-Robert .
NEUROIMAGE, 2011, 56 (02) :814-825
[4]   Neural source dynamics of brain responses to continuous stimuli: Speech processing from acoustics to comprehension [J].
Brodbeck, Christian ;
Presacco, Alessandro ;
Simon, Jonathan Z. .
NEUROIMAGE, 2018, 172 :162-174
[5]   Interactive processing of timbre dimensions: A Garner interference study [J].
Caclin, Anne ;
Giard, Marie-Helene ;
Smith, Bennett K. ;
McAdams, Stephen .
BRAIN RESEARCH, 2007, 1138 :159-170
[6]  
Cong FY, 2012, EUR SIGNAL PR CONF, P494
[7]   The Multivariate Temporal Response Function (mTRF) Toolbox: A MATLAB Toolbox for Relating Neural Signals to Continuous Stimuli [J].
Crosse, Michael J. ;
Di Liberto, Giovanni M. ;
Bednar, Adam ;
Lalor, Edmund C. .
FRONTIERS IN HUMAN NEUROSCIENCE, 2016, 10
[8]   Auditory stream segregation relying on timbre involves left auditory cortex [J].
Deike, S ;
Gaschler-Markefski, B ;
Brechmann, A ;
Scheich, H .
NEUROREPORT, 2004, 15 (09) :1511-1514
[9]  
Ding N., 2012, P NATL ACAD SCI USA, V109, p11 854
[10]   Changes in emotional tone and instrumental timbre are reflected by the mismatch negativity [J].
Goydke, KN ;
Altenmüller, E ;
Möller, J ;
Münte, TF .
COGNITIVE BRAIN RESEARCH, 2004, 21 (03) :351-359