Role of Phase Estimation in Speech Enhancement

被引：0

作者：

Shannon, Benjamin J. ^{[1
]}

Paliwal, Kuldip K. ^{[1
]}

机构：

[1] Griffith Univ, Sch Engn, Brisbane, Qld 4111, Australia

来源：

INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5 | 2006年

关键词：

speech enhancement; phase; windowing;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Typical speech enhancement algorithms that operate in the Fourier domain only modify the magnitude component. It is commonly understood that the phase component is perceptually unimportant, and thus, it is passed directly to the output. In recent intelligibility experiments, it has been reported that the Short-Time Fourier Transform (STFT) phase spectrum can provide significant intelligibility when estimated using a window function lower in dynamic range than the typical Hamming window. Motivated by this, we investigate the role of the window function for STFT phase estimation in relation to speech enhancement. Using a modified STFT Analysis-Modification-Synthesis (AMS) framework, we show that noise reduction can be achieved by modifying the window function used to estimate the STFT phase spectra. We demonstrate this through spectrogram plots and results from two objective speech quality measures.

引用

页码：1423 / 1426

页数：4

共 50 条

[31] EXPLOITING THE BASEBAND PHASE STRUCTURE OF THE VOICED SPEECH FOR SPEECH ENHANCEMENT
Patil, Sanjay P.
Gowdy, John N.
2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
[32] A Dual Stream Generative Adversarial Network with Phase Awareness for Speech Enhancement
Liang, Xintao
Li, Yuhang
Li, Xiaomin
Zhang, Yue
Ding, Youdong
INFORMATION, 2023, 14 (04)
[33] Speech enhancement by spectral magnitude estimation - A unifying approach
Xie, F
VanCompernolle, D
SPEECH COMMUNICATION, 1996, 19 (02) : 89 - 104
[34] An Estimation Theory-based Approach for Speech Enhancement
Ganesh, Mirishkar Sai
Karthik, M. L. N. S.
Patnaik, Bijayananda
2017 FOURTH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, COMMUNICATION AND NETWORKING (ICSCN), 2017,
[35] Auditory Mask Estimation by RPCA for Monaural Speech Enhancement
Shi, Wenhua
Zhang, Xiongwei
Zou, Xia
Han, Wei
Min, Gang
2017 16TH IEEE/ACIS INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION SCIENCE (ICIS 2017), 2017, : 179 - 184
[36] Novel Laplacian Factor Estimation Algorithm for Speech Enhancement
Ou Shifeng
Gao Ying
Wang Xianyun
Zhao Xiaohui
PROCEEDINGS OF 2010 INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND INDUSTRIAL ENGINEERING, VOLS I AND II, 2010, : 678 - 682
[37] β-order MMSE spectral amplitude estimation for speech enhancement
You, CH
Koh, SN
Rahardja, S
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2005, 13 (04): : 475 - 486
[38] A PROGRESSIVE LEARNING APPROACH TO ADAPTIVE NOISE AND SPEECH ESTIMATION FOR SPEECH ENHANCEMENT AND NOISY SPEECH RECOGNITION
Nian, Zhaoxu
Tu, Yan-Hui
Du, Jun
Lee, Chin-Hui
2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 6913 - 6917
[39] Speech enhancement based on AR model parameters estimation
Deng, Feng
Bao, Changchun
SPEECH COMMUNICATION, 2016, 79 : 30 - 46
[40] Simultaneous Speech Detection and Magnitude Squared Spectrum Estimation Approach for Speech Enhancement
Han, Ruirui
Ou, Shifeng
Liu, Wei
Chen, Chen
Zhang, Shuo
PROCEEDINGS OF 2018 14TH IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP), 2018, : 281 - 285

← 1 2 3 4 5 →