Role of Phase Estimation in Speech Enhancement

被引:0
作者
Shannon, Benjamin J. [1 ]
Paliwal, Kuldip K. [1 ]
机构
[1] Griffith Univ, Sch Engn, Brisbane, Qld 4111, Australia
来源
INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5 | 2006年
关键词
speech enhancement; phase; windowing;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Typical speech enhancement algorithms that operate in the Fourier domain only modify the magnitude component. It is commonly understood that the phase component is perceptually unimportant, and thus, it is passed directly to the output. In recent intelligibility experiments, it has been reported that the Short-Time Fourier Transform (STFT) phase spectrum can provide significant intelligibility when estimated using a window function lower in dynamic range than the typical Hamming window. Motivated by this, we investigate the role of the window function for STFT phase estimation in relation to speech enhancement. Using a modified STFT Analysis-Modification-Synthesis (AMS) framework, we show that noise reduction can be achieved by modifying the window function used to estimate the STFT phase spectra. We demonstrate this through spectrogram plots and results from two objective speech quality measures.
引用
收藏
页码:1423 / 1426
页数:4
相关论文
共 50 条
  • [31] EXPLOITING THE BASEBAND PHASE STRUCTURE OF THE VOICED SPEECH FOR SPEECH ENHANCEMENT
    Patil, Sanjay P.
    Gowdy, John N.
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [32] A Dual Stream Generative Adversarial Network with Phase Awareness for Speech Enhancement
    Liang, Xintao
    Li, Yuhang
    Li, Xiaomin
    Zhang, Yue
    Ding, Youdong
    INFORMATION, 2023, 14 (04)
  • [33] Speech enhancement by spectral magnitude estimation - A unifying approach
    Xie, F
    VanCompernolle, D
    SPEECH COMMUNICATION, 1996, 19 (02) : 89 - 104
  • [34] An Estimation Theory-based Approach for Speech Enhancement
    Ganesh, Mirishkar Sai
    Karthik, M. L. N. S.
    Patnaik, Bijayananda
    2017 FOURTH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, COMMUNICATION AND NETWORKING (ICSCN), 2017,
  • [35] Auditory Mask Estimation by RPCA for Monaural Speech Enhancement
    Shi, Wenhua
    Zhang, Xiongwei
    Zou, Xia
    Han, Wei
    Min, Gang
    2017 16TH IEEE/ACIS INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION SCIENCE (ICIS 2017), 2017, : 179 - 184
  • [36] Novel Laplacian Factor Estimation Algorithm for Speech Enhancement
    Ou Shifeng
    Gao Ying
    Wang Xianyun
    Zhao Xiaohui
    PROCEEDINGS OF 2010 INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND INDUSTRIAL ENGINEERING, VOLS I AND II, 2010, : 678 - 682
  • [37] β-order MMSE spectral amplitude estimation for speech enhancement
    You, CH
    Koh, SN
    Rahardja, S
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2005, 13 (04): : 475 - 486
  • [38] A PROGRESSIVE LEARNING APPROACH TO ADAPTIVE NOISE AND SPEECH ESTIMATION FOR SPEECH ENHANCEMENT AND NOISY SPEECH RECOGNITION
    Nian, Zhaoxu
    Tu, Yan-Hui
    Du, Jun
    Lee, Chin-Hui
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 6913 - 6917
  • [39] Speech enhancement based on AR model parameters estimation
    Deng, Feng
    Bao, Changchun
    SPEECH COMMUNICATION, 2016, 79 : 30 - 46
  • [40] Simultaneous Speech Detection and Magnitude Squared Spectrum Estimation Approach for Speech Enhancement
    Han, Ruirui
    Ou, Shifeng
    Liu, Wei
    Chen, Chen
    Zhang, Shuo
    PROCEEDINGS OF 2018 14TH IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP), 2018, : 281 - 285