Speech enhancement using nonlinear microphone array based on noise adaptive complementary beamforming

被引：0

作者：

Sabuwatari, H ^{[1
]}

Kajita, S

Takeda, K

Itakura, F

机构：

[1] Nagoya Univ, Grad Sch Engn, Dept Informat Elect, Nagoya, Aichi 4648603, Japan

[2] Nagoya Univ, Ctr Informat Media Studies, Nagoya, Aichi 4648603, Japan

来源：

IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES | 2000年 / E83A卷 / 05期

关键词：

speech enhancement; microphone array; complementary beamforming; noise adaptation; spectral subtraction;

D O I：

暂无

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper describes an improved complementary beamforming microphone array based on the new noise adaptation algorithm. Complementary beamforming is based on two types of beamformers designed to obtain complementary directivity patterns with respect to each other. In this system, during a pause in the target speech, two directivity patterns of the beamformers are adapted to the noise directions of arrival so that the expectation values of each noise power spectrum are minimized in the array output. Using this technique, we can realize the directional nulls for each noise even when the number of sound sources exceeds that of microphones. To evaluate the effectiveness, speech enhancement experiments and speech recognition experiments are performed based on computer simulations with a two-element array and three sound sources under various noise conditions. In comparison with the conventional adaptive beamformer and the conventional spectral subtraction method cascaded with the adaptive beamformer, it is shown that (1) the proposed array improves the signal-to-noise ratio (SNR) of degraded speech by more than 6 dB when the interfering noise is two speakers with the input SNR of below 0 dB, (2) the proposed array improves the SNR by about 2 dB when the interfering noise is bubble noise, and (3) an improvement in the recognition rate of more than 18% is obtained when the interfering noise is two speakers or two overlapped signals of some speakers under the condition that the input SNR is 10 dB.

引用

页码：866 / 876

页数：11

共 20 条

[1] SUPPRESSION OF ACOUSTIC NOISE IN SPEECH USING SPECTRAL SUBTRACTION [J].

BOLL, SF .

IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1979, 27 (02) :113-120

[2] COMPARISON OF PARAMETRIC REPRESENTATIONS FOR MONOSYLLABIC WORD RECOGNITION IN CONTINUOUSLY SPOKEN SENTENCES [J].

DAVIS, SB ;

MERMELSTEIN, P .

IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1980, 28 (04) :357-366

[3]

Deller Jr J. R., 1993, DISCRETE TIME PROCES

[4] Microphone array systems for hands-free telecommunication [J].

Elko, GW .

SPEECH COMMUNICATION, 1996, 20 (3-4) :229-240

[5] COMPUTER-STEERED MICROPHONE ARRAYS FOR SOUND TRANSDUCTION IN LARGE ROOMS [J].

FLANAGAN, JL ;

JOHNSTON, JD ;

ZAHN, R ;

ELKO, GW .

JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1985, 78 (05) :1508-1518

[6]

FROST OL, 1972, P IEEE, V60, P928

[7] AN ALTERNATIVE APPROACH TO LINEARLY CONSTRAINED ADAPTIVE BEAMFORMING [J].

GRIFFITHS, LJ ;

JIM, CW .

IEEE TRANSACTIONS ON ANTENNAS AND PROPAGATION, 1982, 30 (01) :27-34

[8]

*IEICE, 1988, HDB EL INF COMM ENG, P2220

[9]

Johnson, 1993, ARRAY SIGNAL PROCESS

[10]

Kajita S., 1997, Journal of the Acoustical Society of Japan, V53, P337

← 1 2 →