Visually Derived Wiener Filters for Speech Enhancement

被引:57
|
作者
Almajai, Ibrahim [1 ]
Milner, Ben [1 ]
机构
[1] Univ E Anglia, Sch Comp Sci, Norwich NR4 7TJ, Norfolk, England
来源
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING | 2011年 / 19卷 / 06期
关键词
Audio-visual; maximum a posteriori; speech enhancement; Wiener filter; MODELS; NOISE;
D O I
10.1109/TASL.2010.2096212
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The aim of this work is to examine whether visual speech information can be used to enhance audio speech that has been contaminated by noise. First, an analysis of audio and visual speech features is made, which identifies the pair with highest audio-visual correlation. The study also reveals that higher audio-visual correlation exists within individual phoneme sounds rather than globally across all speech. This correlation is exploited in the proposal of a visually derived Wiener filter that obtains clean speech and noise power spectrum statistics from visual speech features. Clean speech statistics are estimated from visual features using a maximum a posteriori framework that is integrated within the states of a network of hidden Markov models to provide phoneme localization. Noise statistics are obtained through a novel audio-visual voice activity detector which utilizes visual speech features to make robust speech/nonspeech classifications. The effectiveness of the visually derived Wiener filter is evaluated subjectively and objectively and is compared with three different audio-only enhancement methods over a range of signal-to-noise ratios.
引用
收藏
页码:1642 / 1651
页数:10
相关论文
共 50 条
  • [1] Visually-derived wiener filters for speech enhancement
    Almajai, Ibrahim
    Ben Milner
    Darch, Jonathan
    Vaseghi, Saeed
    2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 585 - +
  • [2] A spectral filtering method based on hybrid wiener filters for speech enhancement
    Ding, Huijun
    Soon, Ing Yann
    Koh, Soo Nee
    Yeo, Chai Kiat
    SPEECH COMMUNICATION, 2009, 51 (03) : 259 - 267
  • [3] Speech intelligibility enhancement: a hybrid wiener approach
    Srinivasarao, V.
    Ghanekar, Umesh
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2020, 23 (03) : 517 - 525
  • [4] New insights on the optimality of parameterized Wiener filters for speech enhancement applications
    Chiea, Rafael Attili
    Costa, Marcio Holsbach
    Barrault, Guillaume
    SPEECH COMMUNICATION, 2019, 109 : 46 - 54
  • [5] Speech enhancement with an adaptive Wiener filter
    Abd El-Fattah, Marwa
    Dessouky, Moawad
    Abbas, Alaa
    Diab, Salaheldin
    El-Rabaie, El-Sayed
    Al-Nuaimy, Waleed
    Alshebeili, Saleh
    Abd El-Samie, Fathi
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2014, 17 (01) : 53 - 64
  • [6] SPEECH ENHANCEMENT USING IMPROVED MAP ESTIMATION AND WIENER FILTER
    Chehrehsa, Sarang
    Moir, Tom
    PROCEEDINGS OF 2016 INTERNATIONAL CONFERENCE ON AUDIO, LANGUAGE AND IMAGE PROCESSING (ICALIP), 2016, : 494 - 498
  • [7] Speech intelligibility enhancement: a hybrid wiener approach
    V. Srinivasarao
    Umesh Ghanekar
    International Journal of Speech Technology, 2020, 23 : 517 - 525
  • [8] Noise Reduction and Speech Enhancement Using Wiener Filter
    Nuha, Hilal H.
    Absa, Ahmad Abo
    2022 INTERNATIONAL CONFERENCE ON DATA SCIENCE AND ITS APPLICATIONS (ICODSA), 2022, : 177 - 180
  • [9] Speech Enhancement Based on the Wiener Filter and Wavelet Entropy
    Jiao, Mingke
    Lou, Lin
    Geng, Xiliang
    Wang, Zhongming
    Zhang, Peng
    Liao, Xijiang
    Zhang, Wenyuan
    2015 12TH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY (FSKD), 2015, : 1956 - 1960
  • [10] Speech enhancement using long short term memory with trained speech features and adaptive wiener filter
    Garg, Anil
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (03) : 3647 - 3675