Wide-band dereverberation method based on multichannel linear prediction using prewhitening filter

被引:5
作者
Okamoto, Takuma [1 ,2 ]
Iwaya, Yukio [1 ,3 ]
Suzuki, Yoiti [1 ,3 ]
机构
[1] Tohoku Univ, Res Inst Elect Commun, Aoba Ku, Sendai, Miyagi 9808577, Japan
[2] Tohoku Univ, Grad Sch Engn, Aoba Ku, Sendai, Miyagi 9808577, Japan
[3] Tohoku Univ, Grad Sch Informat Sci, Aoba Ku, Sendai, Miyagi 9808577, Japan
基金
日本学术振兴会;
关键词
Microphone array signal processing; Dereverberation; LIME algorithm; Colored source signal; Prewhitening filter; IDENTIFICATION;
D O I
10.1016/j.apacoust.2011.07.004
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Several dereverberation algorithms have been studied. The sampling frequencies used in conventional studies are typically 8-16 kHz because their main purpose is preprocessing for improving the intelligibility of speech communication and articulation for automatic speech recognition. However, in next-generation communication systems, techniques to analyze and reproduce not only semantic information of sound but also more high-definition components such as spatial information and directivity will be increasingly necessary. To decompose these sound field characteristics with high definition, a dereverberation algorithm that is useful at high sampling frequencies is an important technique to process sound that includes high-frequency spectra such as musical sounds. The LInear-predictive Multichannel Equalization (LIME) algorithm is a promising dereverberation method. Using the LIME algorithm, however, a dereverberation signal cannot be solved at high sampling frequencies when the source signal is colored, such as in the case of speech and sound of musical signals. Because the rank of the correlation matrix calculated from such a colored signal is not full, the characteristic polynomial cannot be calculated precisely. To alleviate this problem, we propose preprocessing of all input signals with filters to whiten their spectra so that this algorithm can function for colored signals at high sampling frequencies. (C) 2011 Elsevier Ltd. All rights reserved.
引用
收藏
页码:50 / 55
页数:6
相关论文
共 20 条
[1]   Precise dereverberation using multichannel linear prediction [J].
Delcroix, Marc ;
Hikichi, Takafumi ;
Miyoshi, Masato .
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (02) :430-440
[2]   Robust speech dereverberation using multichannel blind deconvolution with spectral subtraction [J].
Furuya, Ken'ichi ;
Kataoka, Akitoshi .
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (05) :1579-1591
[3]  
Harville David A, 1998, Matrix algebra from a statistician's perspective
[4]  
Hayes M. H., 1996, Statistical digital signal processing and modeling
[5]   Identification of acoustic MIMO systems: Challenges and opportunities [J].
Huang, YT ;
Benesty, J ;
Chen, JD .
SIGNAL PROCESSING, 2006, 86 (06) :1278-1295
[6]   A class of frequency-domain adaptive approaches to blind multichannel identification [J].
Huang, YT ;
Benesty, J .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2003, 51 (01) :11-24
[7]  
Ljung L., 1987, System Identification: Theory for the User
[8]   INVERSE FILTERING OF ROOM ACOUSTICS [J].
MIYOSHI, M ;
KANEDA, Y .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1988, 36 (02) :145-152
[9]  
Moon T.K., 1999, MATH METHODS ALGORIT
[10]   SUBSPACE METHODS FOR THE BLIND IDENTIFICATION OF MULTICHANNEL FIR FILTERS [J].
MOULINES, E ;
DUHAMEL, P ;
CARDOSO, JF ;
MAYRARGUE, S .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 1995, 43 (02) :516-525