Role of Phase Estimation in Speech Enhancement

被引:0
作者
Shannon, Benjamin J. [1 ]
Paliwal, Kuldip K. [1 ]
机构
[1] Griffith Univ, Sch Engn, Brisbane, Qld 4111, Australia
来源
INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5 | 2006年
关键词
speech enhancement; phase; windowing;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Typical speech enhancement algorithms that operate in the Fourier domain only modify the magnitude component. It is commonly understood that the phase component is perceptually unimportant, and thus, it is passed directly to the output. In recent intelligibility experiments, it has been reported that the Short-Time Fourier Transform (STFT) phase spectrum can provide significant intelligibility when estimated using a window function lower in dynamic range than the typical Hamming window. Motivated by this, we investigate the role of the window function for STFT phase estimation in relation to speech enhancement. Using a modified STFT Analysis-Modification-Synthesis (AMS) framework, we show that noise reduction can be achieved by modifying the window function used to estimate the STFT phase spectra. We demonstrate this through spectrogram plots and results from two objective speech quality measures.
引用
收藏
页码:1423 / 1426
页数:4
相关论文
共 50 条
  • [41] Investigations on the Optimal Estimation of Speech Envelopes for the Two-Stage Speech Enhancement
    Song, Yanjue
    Madhu, Nilesh
    [J]. SENSORS, 2023, 23 (14)
  • [42] Joint Soft Threshold and Statistical Estimation for Speech Enhancement
    Van Khanh Mai
    Pastor, Dominique
    Aissa-El-Bey, Abdeldjalil
    [J]. 9TH INTERNATIONAL SYMPOSIUM ON SIGNAL, IMAGE, VIDEO AND COMMUNICATIONS (ISIVC 2018), 2018, : 249 - 253
  • [43] A filter constructed from estimation of clean speech and noise for speech enhancement in speech recognition systems
    Meng Sha
    Qin Shenghao
    Liu Jia
    [J]. 2006 IMACS: MULTICONFERENCE ON COMPUTATIONAL ENGINEERING IN SYSTEMS APPLICATIONS, VOLS 1 AND 2, 2006, : 1620 - +
  • [44] DCT-based speech enhancement using a new speech variance estimation
    Ou Shifeng
    Zhao Xiaohui
    [J]. 2006 IEEE INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, NETWORKING AND MOBILE COMPUTING, VOLS 1-4, 2006, : 1302 - 1305
  • [45] NOISE POWER SPECTRUM ESTIMATION BASED ON WEAK SPEECH PROTECTION FOR SPEECH ENHANCEMENT
    Feng, Yan
    An, Baokun
    [J]. 2014 12TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP), 2014, : 484 - 487
  • [46] Autoregressive parameter estimation for Kalman filtering speech enhancement
    You, Chang Huai
    Rahardja, Susanto
    Koh, Soo Ngee
    [J]. 2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 913 - +
  • [47] PHASE CONTINUITY: LEARNING DERIVATIVES OF PHASE SPECTRUM FOR SPEECH ENHANCEMENT
    Kim, Doyeon
    Han, Hyewon
    Shin, Hyeon-Kyeong
    Chung, Soo-Whan
    Kang, Hong-Goo
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 6942 - 6946
  • [48] A Study on the Benefits of Phase-Aware Speech Enhancement in Challenging Noise Scenarios
    Krawczyk-Becker, Martin
    Gerkmann, Timo
    [J]. LATENT VARIABLE ANALYSIS AND SIGNAL SEPARATION (LVA/ICA 2018), 2018, 10891 : 407 - 416
  • [49] Eigenvector-Based Speech Mask Estimation for Multi-Channel Speech Enhancement
    Pfeifenberger, Lukas
    Zoehrer, Matthias
    Pernkopf, Franz
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2019, 27 (12) : 2162 - 2172
  • [50] Speech Enhancement in Modulation Domain Using Codebook-based Speech and Noise Estimation
    Mani, Vidhyasagar
    Champagne, Benoit
    Zhu, Wei-Ping
    [J]. 2015 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP), 2015, : 707 - 711