REAL-TIME MULTI-MICROPHONE RECOGNITION OF SIMULTANEOUS SOUNDS IN A ROOM ENVIRONMENT

被引:0
作者
Chakraborty, Rupayan [1 ]
Nadeu, Climent [1 ]
机构
[1] Univ Politecn Cataluna, TALP Res Ctr, Dept Signal Theory & Commun, Barcelona, Spain
来源
2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2013年
关键词
Sound recognition; acoustic event detection; overlapped events; microphone arrays; null-steering beamforming; ACOUSTIC EVENT DETECTION; CLASSIFICATION;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Time overlapping of acoustic signals, which so often occurs in real life, is a challenge for current state-of-the-art sound recognition systems. In this work, we propose an approach for detecting, identifying and positioning a set of simultaneous acoustic events in a room environment, using multiple arbitrarily-located microphone arrays, and working in real time. Assuming a set of estimated acoustic source positions, the use of a frequency invariant null-steering beamformer for each position and each array yields a set of signals which show different balances among the various acoustic sources. For each signal, a model-based likelihood computation is carried out to obtain a matrix of likelihood scores. Then a MAP criterion is used to jointly detect the event classes and assign each of them to a given source position. Experimental results with two sources, one of which is speech, and two three-microphone linear arrays are reported, and a comparison with alternatives approaches is carried out.
引用
收藏
页码:8672 / 8676
页数:5
相关论文
共 15 条
[1]  
[Anonymous], 2004, COMBINING PATTERN CL, DOI DOI 10.1002/0471660264
[2]  
Butko T., 2011, P EUSIPCO BARC SPAIN
[3]  
Chakraborty R., 2012, P INTERSPEECH
[4]  
Chakraborty R., 2012, P IBERSPEECH2012 MAD
[5]  
Cotton C.V., 2011, IEEE WORKSH APPL SIG
[6]  
Dennis J., 2012, P INTERSPEECH
[7]   Spectrogram Image Feature for Sound Event Classification in Mismatched Conditions [J].
Dennis, Jonathan ;
Tran, Huy Dat ;
Li, Haizhou .
IEEE SIGNAL PROCESSING LETTERS, 2011, 18 (02) :130-133
[8]  
Heittola T., 2011, CHIME WORKSH SAT EV
[9]  
Hoshuyama O., 2001, MICROPHONE ARRAYS SI
[10]   Time and frequency filtering of filter-bank energies for robust HMM speech recognition [J].
Nadeu, C ;
Macho, D ;
Hernando, J .
SPEECH COMMUNICATION, 2001, 34 (1-2) :93-114