Role of Phase Estimation in Speech Enhancement

被引：0

作者：

Shannon, Benjamin J. ^{[1
]}

Paliwal, Kuldip K. ^{[1
]}

机构：

[1] Griffith Univ, Sch Engn, Brisbane, Qld 4111, Australia

来源：

INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5 | 2006年

关键词：

speech enhancement; phase; windowing;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Typical speech enhancement algorithms that operate in the Fourier domain only modify the magnitude component. It is commonly understood that the phase component is perceptually unimportant, and thus, it is passed directly to the output. In recent intelligibility experiments, it has been reported that the Short-Time Fourier Transform (STFT) phase spectrum can provide significant intelligibility when estimated using a window function lower in dynamic range than the typical Hamming window. Motivated by this, we investigate the role of the window function for STFT phase estimation in relation to speech enhancement. Using a modified STFT Analysis-Modification-Synthesis (AMS) framework, we show that noise reduction can be achieved by modifying the window function used to estimate the STFT phase spectra. We demonstrate this through spectrogram plots and results from two objective speech quality measures.

引用

页码：1423 / 1426

页数：4

共 50 条

[41] Investigations on the Optimal Estimation of Speech Envelopes for the Two-Stage Speech Enhancement
Song, Yanjue
Madhu, Nilesh
[J]. SENSORS, 2023, 23 (14)
[42] Joint Soft Threshold and Statistical Estimation for Speech Enhancement
Van Khanh Mai
Pastor, Dominique
Aissa-El-Bey, Abdeldjalil
[J]. 9TH INTERNATIONAL SYMPOSIUM ON SIGNAL, IMAGE, VIDEO AND COMMUNICATIONS (ISIVC 2018), 2018, : 249 - 253
[43] A filter constructed from estimation of clean speech and noise for speech enhancement in speech recognition systems
Meng Sha
Qin Shenghao
Liu Jia
[J]. 2006 IMACS: MULTICONFERENCE ON COMPUTATIONAL ENGINEERING IN SYSTEMS APPLICATIONS, VOLS 1 AND 2, 2006, : 1620 - +
[44] DCT-based speech enhancement using a new speech variance estimation
Ou Shifeng
Zhao Xiaohui
[J]. 2006 IEEE INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, NETWORKING AND MOBILE COMPUTING, VOLS 1-4, 2006, : 1302 - 1305
[45] NOISE POWER SPECTRUM ESTIMATION BASED ON WEAK SPEECH PROTECTION FOR SPEECH ENHANCEMENT
Feng, Yan
An, Baokun
[J]. 2014 12TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP), 2014, : 484 - 487
[46] Autoregressive parameter estimation for Kalman filtering speech enhancement
You, Chang Huai
Rahardja, Susanto
Koh, Soo Ngee
[J]. 2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 913 - +
[47] PHASE CONTINUITY: LEARNING DERIVATIVES OF PHASE SPECTRUM FOR SPEECH ENHANCEMENT
Kim, Doyeon
Han, Hyewon
Shin, Hyeon-Kyeong
Chung, Soo-Whan
Kang, Hong-Goo
[J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 6942 - 6946
[48] A Study on the Benefits of Phase-Aware Speech Enhancement in Challenging Noise Scenarios
Krawczyk-Becker, Martin
Gerkmann, Timo
[J]. LATENT VARIABLE ANALYSIS AND SIGNAL SEPARATION (LVA/ICA 2018), 2018, 10891 : 407 - 416
[49] Eigenvector-Based Speech Mask Estimation for Multi-Channel Speech Enhancement
Pfeifenberger, Lukas
Zoehrer, Matthias
Pernkopf, Franz
[J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2019, 27 (12) : 2162 - 2172
[50] Speech Enhancement in Modulation Domain Using Codebook-based Speech and Noise Estimation
Mani, Vidhyasagar
Champagne, Benoit
Zhu, Wei-Ping
[J]. 2015 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP), 2015, : 707 - 711

← 1 2 3 4 5 →