Single-Channel Speech Enhancement With Phase Reconstruction Based on Phase Distortion Averaging

被引:37
作者
Wakabayashi, Yukoh [1 ]
Fukumori, Takahiro [2 ]
Nakayama, Masato [2 ]
Nishiura, Takanobu [2 ]
Yamashita, Yoichi [2 ]
机构
[1] Ritsumeikan Univ, Grad Sch Informat Sci & Engn, Kusatsu 5258577, Japan
[2] Ritsumeikan Univ, Coll Informat Sci & Engn, Kusatsu 5258577, Japan
基金
日本学术振兴会;
关键词
Phase reconstruction; speech enhancement; phase distortion; harmonic structure; fundamental frequency; SPECTRAL COEFFICIENTS; AMPLITUDE; SUPPRESSION; NOISE;
D O I
10.1109/TASLP.2018.2831632
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Speech enhancement has been widely investigated for several decades, but by modifying only the amplitude spectrum of a speech signal, ignoring the phase spectrum, which has been regarded as an unimportant feature. However, it was recently reported that the phase spectrum plays an important role in speech quality and intelligibility. In this paper, we propose a phase reconstruction method based on harmonic enhancement using the fundamental frequency and phase distortion feature. This feature is known to show fluctuations in the phase spectrum with respect to time and frequency. We estimate the speech phase spectrum by considering the relationship between harmonic phase spectra. Experimental evaluations indicate that the proposed phase reconstruction method improves speech quality in various noisy environments.
引用
收藏
页码:1559 / 1569
页数:11
相关论文
共 36 条
  • [1] [Anonymous], 2012, P INT WORKSH AC SIGN
  • [2] [Anonymous], 2014, MACHINE LEARNING SIG
  • [3] [Anonymous], P EUR SIGN PROC C
  • [4] [Anonymous], P INT WORKSH AC SIGN
  • [5] SUPPRESSION OF ACOUSTIC NOISE IN SPEECH USING SPECTRAL SUBTRACTION
    BOLL, SF
    [J]. IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1979, 27 (02): : 113 - 120
  • [6] A uniform phase representation for the harmonic model in speech synthesis applications
    Degottex, Gilles
    Erro, Daniel
    [J]. EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2014, : 1 - 16
  • [7] SPEECH ENHANCEMENT USING A MINIMUM MEAN-SQUARE ERROR SHORT-TIME SPECTRAL AMPLITUDE ESTIMATOR
    EPHRAIM, Y
    MALAH, D
    [J]. IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1984, 32 (06): : 1109 - 1121
  • [8] Garofolo J. S., 1993, TIMIT ACOUSTIC PHONE
  • [9] Gerkmann T., 2012, IEEE 27th Convention of Electrical Electronics Engineers in Israel (IEEEI), P1
  • [10] Phase Processing for Single-Channel Speech Enhancement
    Gerkmann, Timo
    Krawczyk-Becker, Martin
    Le Roux, Jonathan
    [J]. IEEE SIGNAL PROCESSING MAGAZINE, 2015, 32 (02) : 55 - 66