Role of Phase Estimation in Speech Enhancement

被引:0
作者
Shannon, Benjamin J. [1 ]
Paliwal, Kuldip K. [1 ]
机构
[1] Griffith Univ, Sch Engn, Brisbane, Qld 4111, Australia
来源
INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5 | 2006年
关键词
speech enhancement; phase; windowing;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Typical speech enhancement algorithms that operate in the Fourier domain only modify the magnitude component. It is commonly understood that the phase component is perceptually unimportant, and thus, it is passed directly to the output. In recent intelligibility experiments, it has been reported that the Short-Time Fourier Transform (STFT) phase spectrum can provide significant intelligibility when estimated using a window function lower in dynamic range than the typical Hamming window. Motivated by this, we investigate the role of the window function for STFT phase estimation in relation to speech enhancement. Using a modified STFT Analysis-Modification-Synthesis (AMS) framework, we show that noise reduction can be achieved by modifying the window function used to estimate the STFT phase spectra. We demonstrate this through spectrogram plots and results from two objective speech quality measures.
引用
收藏
页码:1423 / 1426
页数:4
相关论文
共 50 条
  • [21] NOISE ESTIMATION WITH LOW COMPLEXITY FOR SPEECH ENHANCEMENT
    Yong, Pei Chee
    Nordholm, Sven
    Dam, Hai Huyen
    2011 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA), 2011, : 109 - 112
  • [22] Simultaneous detection and estimation approach for speech enhancement
    Abramson, Ari
    Cohen, Israel
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (08): : 2348 - 2359
  • [23] The Effect of Spectral Estimation on Speech Enhancement Performance
    Charoenruengkit, Werayuth
    Erdoel, Nurguen
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (05): : 1170 - 1179
  • [24] Investigation on the Band Importance of Phase-aware Speech Enhancement
    Zhang, Zhuohuang
    Williamson, Donald S.
    Shen, Yi
    INTERSPEECH 2022, 2022, : 4651 - 4655
  • [25] A PROBABILISTIC APPROACH FOR PHASE ESTIMATION IN SINGLE-CHANNEL SPEECH ENHANCEMENT USING VON MISES PHASE PRIORS
    Kulmer, Josef
    Mowlaee, Pejman
    Watanabe, Mario Kaoru
    2014 IEEE INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2014,
  • [26] IRM WITH PHASE PARAMETERIZATION FOR SPEECH ENHANCEMENT
    Wang, Xianyun
    Bao, Changchun
    Cheng, Rui
    2019 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA), 2019, : 209 - 213
  • [27] Speech Enhancement Using a Risk Estimation Approach
    Sadasivan, Jishnu
    Seelamantula, Chandra Sekhar
    Muraka, Nagarjuna Reddy
    SPEECH COMMUNICATION, 2020, 116 : 12 - 29
  • [28] A Speech Enhancement Method by Coupling Speech Detection and Spectral Amplitude Estimation
    Deng, Feng
    Bao, Chang-Chun
    Bao, Feng
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 3233 - 3237
  • [29] Speech enhancement based on MAP estimation using a variable speech distribution
    Tsukamoto, Yuta
    Kawamura, Arata
    Iiguni, Youji
    IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2007, E90A (08) : 1587 - 1593
  • [30] EXPLOITING THE BASEBAND PHASE STRUCTURE OF THE VOICED SPEECH FOR SPEECH ENHANCEMENT
    Patil, Sanjay P.
    Gowdy, John N.
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,