Visually Derived Wiener Filters for Speech Enhancement

被引：57

作者：

Almajai, Ibrahim ^{[1
]}

Milner, Ben ^{[1
]}

机构：

[1] Univ E Anglia, Sch Comp Sci, Norwich NR4 7TJ, Norfolk, England

来源：

IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING | 2011年 / 19卷 / 06期

关键词：

Audio-visual; maximum a posteriori; speech enhancement; Wiener filter; MODELS; NOISE;

D O I：

10.1109/TASL.2010.2096212

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

The aim of this work is to examine whether visual speech information can be used to enhance audio speech that has been contaminated by noise. First, an analysis of audio and visual speech features is made, which identifies the pair with highest audio-visual correlation. The study also reveals that higher audio-visual correlation exists within individual phoneme sounds rather than globally across all speech. This correlation is exploited in the proposal of a visually derived Wiener filter that obtains clean speech and noise power spectrum statistics from visual speech features. Clean speech statistics are estimated from visual features using a maximum a posteriori framework that is integrated within the states of a network of hidden Markov models to provide phoneme localization. Noise statistics are obtained through a novel audio-visual voice activity detector which utilizes visual speech features to make robust speech/nonspeech classifications. The effectiveness of the visually derived Wiener filter is evaluated subjectively and objectively and is compared with three different audio-only enhancement methods over a range of signal-to-noise ratios.

引用

页码：1642 / 1651

页数：10

共 50 条

[21] Distributed multichannel Wiener filtering for speech enhancement in acoustic sensor networks
Chang, Ruijiang
Chen, Zhe
Yin, Fuliang
INTERNATIONAL JOURNAL OF ADAPTIVE CONTROL AND SIGNAL PROCESSING, 2022, 36 (11) : 2732 - 2753
[22] DNN-BASED AR-WIENER FILTERING FOR SPEECH ENHANCEMENT
Yang, Yan
Bao, Changchun
2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 2901 - 2905
[23] SPEECH ENHANCEMENT USING A FREQUENCY-SPECIFIC COMPOSITE WIENER FUNCTION
Chen, Fei
Loizou, Philipos C.
2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 4726 - 4729
[24] Modified Wiener Filtering Speech Enhancement Algorithm with Phase Spectrum Compensation
Zhang Wenlu
Peng Hua
2017 IEEE 9TH INTERNATIONAL CONFERENCE ON COMMUNICATION SOFTWARE AND NETWORKS (ICCSN), 2017, : 1075 - 1079
[25] VARIABLE SPAN FILTERS FOR SPEECH ENHANCEMENT
Jensen, Jesper Rindom
Benesty, Jacob
Christensen, Mads Grcesboll
2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 6505 - 6509
[26] Wiener filters for the reconditioning of old speech recordings under uncertainty
Teodorescu, Horia-Nicolai L.
2018 23RD INTERNATIONAL CONFERENCE ON APPLIED ELECTRONICS (AE), 2018, : 145 - 148
[27] An Improved Iterative Wiener Filtering Algorithm For Speech Enhancement
Mao, Ruitang
Zhou, Yi
Yuan, Wenyi
Liu, Hongqing
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATIONS (ICCC), 2015, : 436 - 440
[28] Speech enhancement using long short term memory with trained speech features and adaptive wiener filter
Anil Garg
Multimedia Tools and Applications, 2023, 82 : 3647 - 3675
[29] Speech Enhancement Based on Nonlinear Models Using Particle Filters
Mustiere, Frederic
Bolic, Miodrag
Bouchard, Martin
IEEE TRANSACTIONS ON NEURAL NETWORKS, 2009, 20 (12): : 1923 - 1937
[30] Speech enhancement via adaptive Wiener filtering and optimized deep learning framework
Jadda, Amarendra
Prabha, Inty Santi
INTERNATIONAL JOURNAL OF WAVELETS MULTIRESOLUTION AND INFORMATION PROCESSING, 2023, 21 (01)

← 1 2 3 4 5 →