Parametric modelling for single-channel blind dereverberation of speech from a moving speaker

被引:5
|
作者
Evers, C. [1 ]
Hopgood, J. R. [1 ]
机构
[1] Univ Edinburgh, Inst Digital Commun, Sch Engn & Elect, Edinburgh EH9 3JL, Midlothian, Scotland
基金
英国工程与自然科学研究理事会;
关键词
D O I
10.1049/iet-spr:20070046
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Single-channel blind dereverberation for the enhancement of speech acquired in acoustic environments is essential in applications where microphone arrays prove impractical. In many scenarios, the source-sensor geometry is not varying rapidly, but in most applications the geometry is subject to change, for example when a user wishes to move around a room. A previous model-based approach to blind dereverberation by representing the channel as a linear time-varying all-pole filter is extended, in which the parameters of the filter are modelled as a linear combination of known basis functions with unknown weightings. Moreover, an improved block-based time-varying autoregressive model is proposed for the speech signal, which aims to reflect the underlying signal statistics more accurately on both a local and global level. Given these parametric models, their coefficients are estimated using Bayesian inference, so that the channel estimate can then be used for dereverberation. An in-depth discussion is also presented about the applicability of these models to real speech and a real acoustic environment. Results are presented to demonstrate the performance of the Bayesian inference algorithms.
引用
收藏
页码:59 / 74
页数:16
相关论文
共 50 条
  • [31] Speaker Distance Estimation in Enclosures From Single-Channel Audio
    Neri, Michael
    Politis, Archontis
    Krause, Daniel Aleksander
    Carli, Marco
    Virtanen, Tuomas
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 32 (2242-2254) : 2242 - 2254
  • [32] Speaker Counting and Separation From Single-Channel Noisy Mixtures
    Chetupalli, Srikanth Raj
    Habets, Emanuel A. P.
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2023, 31 : 1681 - 1692
  • [33] Assessment of Single-Channel Speech Enhancement Techniques for Speaker Identification under Mismatched Conditions
    Sadjadi, Seyed Omid
    Hansen, John H. L.
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 2138 - 2141
  • [34] Improved single-channel noise reduction method of speech by blind source separation
    Hamid, Mohammad Ekramul
    Ogawa, Keita
    Fukabayashi, Takeshi
    ACOUSTICAL SCIENCE AND TECHNOLOGY, 2007, 28 (03) : 153 - 164
  • [35] Complex Cepstrum Based Single Channel Speech Dereverberation
    Shen Xizhong
    Meng Guang
    ICCSSE 2009: PROCEEDINGS OF 2009 4TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE & EDUCATION, 2009, : 7 - +
  • [36] Single-Channel Speech Dereverberation Based on Block-wise Weighted Prediction Error and Nonnegative Matrix Factorization
    Kwak, Chan Woong
    Jeon, Kwang Myung
    Park, In Young
    Kim, Hong Kook
    Lim, Jeong Eun
    Park, Ji Hyun
    2018 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS (ICCE), 2018,
  • [37] SINGLE-CHANNEL BLIND DEREVERBERATION BASED ON RANK-1 MATRIX LIFTING IN TIME-FREQUENCY DOMAIN
    Yohena, Fumiki
    Yatabe, Kohei
    2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, ICASSP 2024, 2024, : 891 - 895
  • [38] Supervised single-channel speech dereverberation and denoising using a two-stage model based sparse representation
    Zhang Long
    Xu Xu
    Chen Huang
    Chen Jiaxu
    Ye Zhongfu
    SPEECH COMMUNICATION, 2018, 97 : 1 - 8
  • [39] Single-channel dereverberation for distant-talking speech recognition by combining denoising autoencoder and temporal structure normalization
    Ueda, Yuma
    Wang, Longbiao
    Kai, Atsuhiko
    Xiao, Xiong
    Chng, Eng Siong
    Li, Haizhou
    2014 9TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2014, : 379 - +
  • [40] Single-channel Dereverberation for Distant-Talking Speech Recognition by Combining Denoising Autoencoder and Temporal Structure Normalization
    Ueda, Yuma
    Wang, Longbiao
    Kai, Atsuhiko
    Xiao, Xiong
    Chng, Eng Siong
    Li, Haizhou
    JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2016, 82 (02): : 151 - 161