Parametric modelling for single-channel blind dereverberation of speech from a moving speaker

被引：5

作者：

Evers, C. ^{[1
]}

Hopgood, J. R. ^{[1
]}

机构：

[1] Univ Edinburgh, Inst Digital Commun, Sch Engn & Elect, Edinburgh EH9 3JL, Midlothian, Scotland

来源：

IET SIGNAL PROCESSING | 2008年 / 2卷 / 02期

基金：

英国工程与自然科学研究理事会;

关键词：

D O I：

10.1049/iet-spr:20070046

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Single-channel blind dereverberation for the enhancement of speech acquired in acoustic environments is essential in applications where microphone arrays prove impractical. In many scenarios, the source-sensor geometry is not varying rapidly, but in most applications the geometry is subject to change, for example when a user wishes to move around a room. A previous model-based approach to blind dereverberation by representing the channel as a linear time-varying all-pole filter is extended, in which the parameters of the filter are modelled as a linear combination of known basis functions with unknown weightings. Moreover, an improved block-based time-varying autoregressive model is proposed for the speech signal, which aims to reflect the underlying signal statistics more accurately on both a local and global level. Given these parametric models, their coefficients are estimated using Bayesian inference, so that the channel estimate can then be used for dereverberation. An in-depth discussion is also presented about the applicability of these models to real speech and a real acoustic environment. Results are presented to demonstrate the performance of the Bayesian inference algorithms.

引用

页码：59 / 74

页数：16

共 50 条

[31] Speaker Distance Estimation in Enclosures From Single-Channel Audio
Neri, Michael
Politis, Archontis
Krause, Daniel Aleksander
Carli, Marco
Virtanen, Tuomas
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 32 (2242-2254) : 2242 - 2254
[32] Speaker Counting and Separation From Single-Channel Noisy Mixtures
Chetupalli, Srikanth Raj
Habets, Emanuel A. P.
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2023, 31 : 1681 - 1692
[33] Assessment of Single-Channel Speech Enhancement Techniques for Speaker Identification under Mismatched Conditions
Sadjadi, Seyed Omid
Hansen, John H. L.
11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 2138 - 2141
[34] Improved single-channel noise reduction method of speech by blind source separation
Hamid, Mohammad Ekramul
Ogawa, Keita
Fukabayashi, Takeshi
ACOUSTICAL SCIENCE AND TECHNOLOGY, 2007, 28 (03) : 153 - 164
[35] Complex Cepstrum Based Single Channel Speech Dereverberation
Shen Xizhong
Meng Guang
ICCSSE 2009: PROCEEDINGS OF 2009 4TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE & EDUCATION, 2009, : 7 - +
[36] Single-Channel Speech Dereverberation Based on Block-wise Weighted Prediction Error and Nonnegative Matrix Factorization
Kwak, Chan Woong
Jeon, Kwang Myung
Park, In Young
Kim, Hong Kook
Lim, Jeong Eun
Park, Ji Hyun
2018 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS (ICCE), 2018,
[37] SINGLE-CHANNEL BLIND DEREVERBERATION BASED ON RANK-1 MATRIX LIFTING IN TIME-FREQUENCY DOMAIN
Yohena, Fumiki
Yatabe, Kohei
2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, ICASSP 2024, 2024, : 891 - 895
[38] Supervised single-channel speech dereverberation and denoising using a two-stage model based sparse representation
Zhang Long
Xu Xu
Chen Huang
Chen Jiaxu
Ye Zhongfu
SPEECH COMMUNICATION, 2018, 97 : 1 - 8
[39] Single-channel dereverberation for distant-talking speech recognition by combining denoising autoencoder and temporal structure normalization
Ueda, Yuma
Wang, Longbiao
Kai, Atsuhiko
Xiao, Xiong
Chng, Eng Siong
Li, Haizhou
2014 9TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2014, : 379 - +
[40] Single-channel Dereverberation for Distant-Talking Speech Recognition by Combining Denoising Autoencoder and Temporal Structure Normalization
Ueda, Yuma
Wang, Longbiao
Kai, Atsuhiko
Xiao, Xiong
Chng, Eng Siong
Li, Haizhou
JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2016, 82 (02): : 151 - 161

← 1 2 3 4 5 →