HMM-Based Cue Parameters Estimation for Speech Enhancement

被引:0
|
作者
Deng, Feng [1 ]
Bao, Chang-chun [1 ]
Jia, Mao-shen [1 ]
机构
[1] Beijing Univ Technol, Sch Elect Informat & Control Engn, Speech & Audio Signal Proc Lab, Beijing 100124, Peoples R China
来源
2016 10TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP) | 2016年
基金
中国国家自然科学基金;
关键词
speech enhancement; HMM; cue parameters; priori information; NOISE;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In this paper, a hidden Markov model (HMM)-based cue parameters estimation method for single-channel speech enhancement is proposed, in which the cue parameters of binaural cue coding (BCC) are applied to single-channel speech enhancement system successfully. First, the clean speech and noise signals are considered as the left and right channels of stereo signal, respectively; and the noisy speech is treated as the down-mixed mono signal of BCC method. According to the clean speech and noise data set and the corresponding noisy speech data set, the clean cue parameters and pre-enhanced cue parameters are extracted, respectively. Then the cue HMM is trained offline, which exploits the a priori information about the clean cue parameters and the pre-enhanced cue parameters for speech enhancement. Next, using the trained cue HMM, the clean cue parameters are estimated from noisy speech online. Finally, following the synthesis principle of BCC cue parameters, the speech estimator is constructed for enhancing noisy speech. The test results demonstrate that, for the segmental signal-noise-ratio (SNR), the log spectral distortion and PESQ measures, the proposed method performs better than the reference methods.
引用
收藏
页数:4
相关论文
共 50 条
  • [21] Synthesis and evaluation of conversational characteristics in HMM-based speech synthesis
    Andersson, Sebastian
    Yamagishi, Junichi
    Clark, Robert A. J.
    SPEECH COMMUNICATION, 2012, 54 (02) : 175 - 188
  • [22] Implementation and evaluation of an HMM-based Korean speech synthesis system
    Kim, SJ
    Kim, JJ
    Hahn, M
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2006, E89D (03): : 1116 - 1119
  • [23] Voiced/Unvoiced Decision Algorithm for HMM-based Speech Synthesis
    Kang, Shiyin
    Shuang, Zhiwei
    Duan, Quansheng
    Qin, Yong
    Cai, Lianhong
    INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 420 - +
  • [24] A Comparison of Two Approaches to Bilingual HMM-Based Speech Synthesis
    Pobar, Miran
    Justin, Tadej
    Zibert, Janez
    Mihelic, France
    Ipsic, Ivo
    TEXT, SPEECH, AND DIALOGUE, TSD 2013, 2013, 8082 : 44 - 51
  • [25] Effect of MPEG Audio Compression on HMM-based Speech Synthesis
    Bollepalli, Bajibabu
    Raitio, Tuomo
    Alku, Paavo
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 1061 - 1065
  • [26] HMM-based Unit Selection Using Frame Sized Speech Segments
    Ling, Zhen-Hua
    Wang, Ren-Hua
    INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 2034 - 2037
  • [27] FULL COVARIANCE STATE DURATION MODELING FOR HMM-BASED SPEECH SYNTHESIS
    Lu, Heng
    Wu, Yi-Jian
    Tokuda, Keiichi
    Dai, Li-Rong
    Wang, Ren-Hua
    2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 4033 - +
  • [28] An improved minimum generation error based model adaptation for HMM-based speech synthesis
    Wu, Yi-Jian
    Qin, Long
    Tokuda, Keiichi
    INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 1727 - +
  • [29] ESTIMATION OF HMM PARAMETERS BASED ON GRADIENTS
    Mao Xiaoquan Hu Guangrui(Department of Electronic Engineering
    JournalofElectronics(China), 2001, (03) : 277 - 280
  • [30] Quality Assessment of HMM-Based Speech Synthesis Using Acoustical Vowel Analysis
    Coto-Jimenez, Marvin
    Goddard-Close, John
    Martinez-Licona, Fabiola M.
    SPEECH AND COMPUTER, 2014, 8773 : 368 - 375