Visually Derived Wiener Filters for Speech Enhancement

被引:57
|
作者
Almajai, Ibrahim [1 ]
Milner, Ben [1 ]
机构
[1] Univ E Anglia, Sch Comp Sci, Norwich NR4 7TJ, Norfolk, England
来源
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING | 2011年 / 19卷 / 06期
关键词
Audio-visual; maximum a posteriori; speech enhancement; Wiener filter; MODELS; NOISE;
D O I
10.1109/TASL.2010.2096212
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The aim of this work is to examine whether visual speech information can be used to enhance audio speech that has been contaminated by noise. First, an analysis of audio and visual speech features is made, which identifies the pair with highest audio-visual correlation. The study also reveals that higher audio-visual correlation exists within individual phoneme sounds rather than globally across all speech. This correlation is exploited in the proposal of a visually derived Wiener filter that obtains clean speech and noise power spectrum statistics from visual speech features. Clean speech statistics are estimated from visual features using a maximum a posteriori framework that is integrated within the states of a network of hidden Markov models to provide phoneme localization. Noise statistics are obtained through a novel audio-visual voice activity detector which utilizes visual speech features to make robust speech/nonspeech classifications. The effectiveness of the visually derived Wiener filter is evaluated subjectively and objectively and is compared with three different audio-only enhancement methods over a range of signal-to-noise ratios.
引用
收藏
页码:1642 / 1651
页数:10
相关论文
共 50 条
  • [41] Robust Speech-Distortion Weighted Interframe Wiener Filters for Single-Channel Noise Reduction
    Andersen, Kristian Timm
    Moonen, Marc
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2018, 26 (01) : 97 - 107
  • [42] SINGLE-MICROPHONE SPEECH ENHANCEMENT USING MVDR FILTERING AND WIENER POST-FILTERING
    Fischer, Doerte
    Gerkmann, Timo
    2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 201 - 205
  • [43] Noise Reduction Using Modified Wiener Filter in Digital Hearing Aid for Speech Signal Enhancement
    Kumar, Madam Aravind
    Chari, Kamsali Manjunatha
    JOURNAL OF INTELLIGENT SYSTEMS, 2020, 29 (01) : 1360 - 1378
  • [44] Wiener filtering based speech enhancement with Weighted Denoising Auto-encoder and noise classification
    Xia, Bingyin
    Bao, Changchun
    SPEECH COMMUNICATION, 2014, 60 : 13 - 29
  • [45] RS-CAE-Based AR-Wiener Filtering and Harmonic Recovery for Speech Enhancement
    Yang, Yan
    Bao, Changchun
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2019, 27 (11) : 1752 - 1762
  • [46] Laplacian-Gaussian Mixture Based Dual-Gain Wiener Filter for Speech Enhancement
    Wei, Jing
    Ou, Shifeng
    Shen, Suojin
    Gao, Ying
    2016 IEEE INTERNATIONAL CONFERENCE ON SIGNAL AND IMAGE PROCESSING (ICSIP), 2016, : 543 - 547
  • [47] Speech quality enhancement using wavelet reconstruction filters
    Hayashi, S
    Suguimoto, M
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2005, E88D (06) : 1299 - 1303
  • [48] New approaches to speech enhancement using phase correction in Wiener filtering
    Fardkhaleghi P.
    Savoji M.H.
    2010 5th International Symposium on Telecommunications, IST 2010, 2010, : 895 - 899
  • [49] SPEECH ENHANCEMENT USING A FRAME ADAPTIVE GAIN FUNCTION FOR WIENER FILTERING
    da Silva, Luiz Felipe
    Bermudez, Jose C. M.
    2011 IEEE STATISTICAL SIGNAL PROCESSING WORKSHOP (SSP), 2011, : 389 - 392
  • [50] Joint enhancement and coding of speech by incorporating Wiener filtering in a CELP codec
    Fischer, Johannes
    Baeckstroem, Tom
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 1730 - 1734