Parametric modelling for single-channel blind dereverberation of speech from a moving speaker

被引:5
|
作者
Evers, C. [1 ]
Hopgood, J. R. [1 ]
机构
[1] Univ Edinburgh, Inst Digital Commun, Sch Engn & Elect, Edinburgh EH9 3JL, Midlothian, Scotland
基金
英国工程与自然科学研究理事会;
关键词
D O I
10.1049/iet-spr:20070046
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Single-channel blind dereverberation for the enhancement of speech acquired in acoustic environments is essential in applications where microphone arrays prove impractical. In many scenarios, the source-sensor geometry is not varying rapidly, but in most applications the geometry is subject to change, for example when a user wishes to move around a room. A previous model-based approach to blind dereverberation by representing the channel as a linear time-varying all-pole filter is extended, in which the parameters of the filter are modelled as a linear combination of known basis functions with unknown weightings. Moreover, an improved block-based time-varying autoregressive model is proposed for the speech signal, which aims to reflect the underlying signal statistics more accurately on both a local and global level. Given these parametric models, their coefficients are estimated using Bayesian inference, so that the channel estimate can then be used for dereverberation. An in-depth discussion is also presented about the applicability of these models to real speech and a real acoustic environment. Results are presented to demonstrate the performance of the Bayesian inference algorithms.
引用
收藏
页码:59 / 74
页数:16
相关论文
共 50 条
  • [41] Single-channel Dereverberation for Distant-Talking Speech Recognition by Combining Denoising Autoencoder and Temporal Structure Normalization
    Yuma Ueda
    Longbiao Wang
    Atsuhiko Kai
    Xiong Xiao
    Eng Siong Chng
    Haizhou Li
    Journal of Signal Processing Systems, 2016, 82 : 151 - 161
  • [42] Single-Channel Multitalker Speech Recognition
    Rennie, Steven J.
    Hershey, John R.
    Olsen, Peder A.
    IEEE SIGNAL PROCESSING MAGAZINE, 2010, 27 (06) : 66 - 80
  • [43] Model-based Single-Channel Dereverberation in Noisy Acoustical Environments
    Bao, Xulei
    Zhu, Jie
    13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 154 - 157
  • [44] SOURCE-AWARE CONTEXT NETWORK FOR SINGLE-CHANNEL MULTI-SPEAKER SPEECH SEPARATION
    Li, Zeng-Xi
    Song, Yan
    Dai, Li-Rong
    McLoughlin, Ian
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 681 - 685
  • [45] End-to-End Single-Channel Speaker-Turn Aware Conversational Speech Translation
    Zuluaga-Gomez, Juan
    Huang, Zhaocheng
    Niu, Xing
    Paturi, Rohit
    Srinavasan, Sundararajan
    Mathur, Prashant
    Thompson, Brian
    Federico, Marcello
    2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING, EMNLP 2023, 2023, : 7255 - 7274
  • [46] Blind C50 estimation from single-channel speech using a convolutional neural network
    Gamper, Hannes
    2020 IEEE 22ND INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP), 2020,
  • [47] BINAURAL EXTENSION AND PERFORMANCE OF SINGLE-CHANNEL SPECTRAL SUBTRACTION DEREVERBERATION ALGORITHMS
    Tsilfidis, Alexandros
    Georganti, Eleftheria
    Mourjopoulos, John
    2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 1737 - 1740
  • [48] Weak Speech Recovery for Single-Channel Speech Enhancement
    Wong, Arthur
    Ming, Kok
    Low, Siow Yong
    2012 4TH INTERNATIONAL CONFERENCE ON INTELLIGENT AND ADVANCED SYSTEMS (ICIAS), VOLS 1-2, 2012, : 627 - 631
  • [49] Soft mask methods for single-channel speaker separation
    Reddy, Aarthi M.
    Raj, Bhiksha
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (06): : 1766 - 1776
  • [50] SINGLE-CHANNEL SPEAKER DISTANCE ESTIMATION IN REVERBERANT ENVIRONMENTS
    Neri, Michael
    Politis, Archontis
    Krause, Daniel
    Carli, Marco
    Virtanen, Tuomas
    2023 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS, WASPAA, 2023,