REAL-TIME MULTI-MICROPHONE RECOGNITION OF SIMULTANEOUS SOUNDS IN A ROOM ENVIRONMENT

被引：0

作者：

Chakraborty, Rupayan ^{[1
]}

Nadeu, Climent ^{[1
]}

机构：

[1] Univ Politecn Cataluna, TALP Res Ctr, Dept Signal Theory & Commun, Barcelona, Spain

来源：

2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2013年

关键词：

Sound recognition; acoustic event detection; overlapped events; microphone arrays; null-steering beamforming; ACOUSTIC EVENT DETECTION; CLASSIFICATION;

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Time overlapping of acoustic signals, which so often occurs in real life, is a challenge for current state-of-the-art sound recognition systems. In this work, we propose an approach for detecting, identifying and positioning a set of simultaneous acoustic events in a room environment, using multiple arbitrarily-located microphone arrays, and working in real time. Assuming a set of estimated acoustic source positions, the use of a frequency invariant null-steering beamformer for each position and each array yields a set of signals which show different balances among the various acoustic sources. For each signal, a model-based likelihood computation is carried out to obtain a matrix of likelihood scores. Then a MAP criterion is used to jointly detect the event classes and assign each of them to a given source position. Experimental results with two sources, one of which is speech, and two three-microphone linear arrays are reported, and a comparison with alternatives approaches is carried out.

引用

页码：8672 / 8676

页数：5

共 15 条

[1]

[Anonymous], 2004, COMBINING PATTERN CL, DOI DOI 10.1002/0471660264

[2]

Butko T., 2011, P EUSIPCO BARC SPAIN

[3]

Chakraborty R., 2012, P INTERSPEECH

[4]

Chakraborty R., 2012, P IBERSPEECH2012 MAD

[5]

Cotton C.V., 2011, IEEE WORKSH APPL SIG

[6]

Dennis J., 2012, P INTERSPEECH

[7] Spectrogram Image Feature for Sound Event Classification in Mismatched Conditions [J].

Dennis, Jonathan ;

Tran, Huy Dat ;

Li, Haizhou .

IEEE SIGNAL PROCESSING LETTERS, 2011, 18 (02) :130-133

[8]

Heittola T., 2011, CHIME WORKSH SAT EV

[9]

Hoshuyama O., 2001, MICROPHONE ARRAYS SI

[10] Time and frequency filtering of filter-bank energies for robust HMM speech recognition [J].

Nadeu, C ;

Macho, D ;

Hernando, J .

SPEECH COMMUNICATION, 2001, 34 (1-2) :93-114

← 1 2 →