Visually Derived Wiener Filters for Speech Enhancement

被引:57
|
作者
Almajai, Ibrahim [1 ]
Milner, Ben [1 ]
机构
[1] Univ E Anglia, Sch Comp Sci, Norwich NR4 7TJ, Norfolk, England
来源
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING | 2011年 / 19卷 / 06期
关键词
Audio-visual; maximum a posteriori; speech enhancement; Wiener filter; MODELS; NOISE;
D O I
10.1109/TASL.2010.2096212
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The aim of this work is to examine whether visual speech information can be used to enhance audio speech that has been contaminated by noise. First, an analysis of audio and visual speech features is made, which identifies the pair with highest audio-visual correlation. The study also reveals that higher audio-visual correlation exists within individual phoneme sounds rather than globally across all speech. This correlation is exploited in the proposal of a visually derived Wiener filter that obtains clean speech and noise power spectrum statistics from visual speech features. Clean speech statistics are estimated from visual features using a maximum a posteriori framework that is integrated within the states of a network of hidden Markov models to provide phoneme localization. Noise statistics are obtained through a novel audio-visual voice activity detector which utilizes visual speech features to make robust speech/nonspeech classifications. The effectiveness of the visually derived Wiener filter is evaluated subjectively and objectively and is compared with three different audio-only enhancement methods over a range of signal-to-noise ratios.
引用
收藏
页码:1642 / 1651
页数:10
相关论文
共 50 条
  • [31] Single Channel Speech Enhancement: using Wiener Filtering with Recursive Noise Estimation
    Upadhyay, Navneet
    Jaiswal, Rahul Kumar
    PROCEEDING OF THE SEVENTH INTERNATIONAL CONFERENCE ON INTELLIGENT HUMAN COMPUTER INTERACTION (IHCI 2015), 2016, 84 : 22 - 30
  • [32] A Novel Speech Enhancement Method Using Power Spectra Smooth in Wiener Filtering
    Bao, Feng
    Dou, Hui-jing
    Jia, Mao-shen
    Bao, Chang-chun
    2014 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2014,
  • [33] Design of Matrix Wiener Filter for Noise Reduction and Speech Enhancement in Hearing Aids
    Modhave, Nayan
    Karuna, Yepuganti
    Tonde, Sourabh
    2016 IEEE INTERNATIONAL CONFERENCE ON RECENT TRENDS IN ELECTRONICS, INFORMATION & COMMUNICATION TECHNOLOGY (RTEICT), 2016, : 843 - 847
  • [34] Speech Enhancement for Secure Communication Using Coupled Spectral Subtraction and Wiener Filter
    Pardede, Hilman
    Ramli, Kalamullah
    Suryanto, Yohan
    Hayati, Nur
    Presekal, Alfan
    ELECTRONICS, 2019, 8 (08)
  • [35] MODULATION WIENER FILTER FOR IMPROVING SPEECH INTELLIGIBILITY
    Hsu, Chung-Chien
    Cheong, Kah-Meng
    Chien, Jen-Tzung
    Chi, Tai-Shih
    2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 370 - 374
  • [36] Enhancement of Non-Stationary Speech using Harmonic Chirp Filters
    Norholm, Sidsel Marie
    Jensen, Jesper Rindom
    Christensen, Mads Graesboll
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 1755 - 1759
  • [37] Speech enhancement using sub-band cross-correlation compensated Wiener filter combined with harmonic regeneration
    Rao, Ch. V. Rama
    Murthy, M. B. Rama
    Rao, K. Srinivasa
    AEU-INTERNATIONAL JOURNAL OF ELECTRONICS AND COMMUNICATIONS, 2012, 66 (06) : 459 - 464
  • [38] A Model-Based Spectral Envelope Wiener Filter for Perceptually Motivated Speech Enhancement
    Hadir, Najib
    Faubel, Friedrich
    Klakow, Dietrich
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 220 - 223
  • [39] Design of Multichannel Wiener Filter for Speech Enhancement in Hearing Aids and Noise Reduction Technique
    Modhave, Nayan
    Karuna, Yepuganti
    Tonde, Sourabh
    PROCEEDINGS OF 2016 ONLINE INTERNATIONAL CONFERENCE ON GREEN ENGINEERING AND TECHNOLOGIES (IC-GET), 2016,
  • [40] Speech Enhancement Based on Combination of Wiener Filter and Subspace Filter
    Xia Yousheng
    Huang Jianwen
    2014 INTERNATIONAL CONFERENCE ON AUDIO, LANGUAGE AND IMAGE PROCESSING (ICALIP), VOLS 1-2, 2014, : 459 - 463