F0 estimation of speech based on IRAPT using WLP-based TV-CAR analysis

被引:0
|
作者
Shan, Wei [1 ]
Funaki, Keiichi [2 ]
机构
[1] Univ Ryukyus, Grad Sch Engn & Sci, Nishihara, Okinawa, Japan
[2] Univ Ryukyus, C&N Ctr, Nishihara, Okinawa, Japan
关键词
F-0; estimation; IRAPT; WLP; complex analysis; analytic signal; LINEAR PREDICTION;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Fundamental frequency (F-0) estimation plays an important role in speech processing such as speech coding, synthesis, recognition and so on. Although a present F-0 estimation method performs well under clean condition, the performance deteriorates significantly in noisy environment. For this reason robust F-0 estimation against additive noise is demanded. We have previously proposed F-0 estimation methods based on Time-Varying Complex AR (TV-CAR) analysis whose criterion is the weighted correlation of the complex residual obtained by the TV-CAR analysis, sum of the harmonics for the complex residual spectrum, or so on. On the other hand, E. Azarov et al. have proposed an improved method of RAPT (Robust Algorithm for Pitch Tracking) using an instantaneous harmonics that is called IRAPT (Instantaneous RAPT). The IRAPT can perform better estimation than RAPT. Since IRAPT uses band-limited analytic signal to obtain harmonic frequencies, the complex residual signal obtained by the TV-CAR analysis can also be applied to the IRAPT. In this paper, novel F-0 estimation method using the instantaneous frequency based on the robust WLP (Weighted Linear Prediction) TV-CAR residual is proposed and evaluated.
引用
收藏
页数:4
相关论文
共 50 条
  • [41] NEURAL-NETWORK-BASED F0 TEXT-TO-SPEECH SYNTHESIZER FOR MANDARINE
    HWANG, SH
    CHEN, SH
    IEE PROCEEDINGS-VISION IMAGE AND SIGNAL PROCESSING, 1994, 141 (06): : 384 - 390
  • [42] JOINT MODELLING OF VOICING LABEL AND CONTINUOUS F0 FOR HMM BASED SPEECH SYNTHESIS
    Yu, K.
    Young, S.
    2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 4572 - 4575
  • [43] Soft context clustering for F0 modeling in HMM-based speech synthesis
    Soheil Khorram
    Hossein Sameti
    Simon King
    EURASIP Journal on Advances in Signal Processing, 2015
  • [44] Development and Perceptual Evaluation of Amplitude-Based F0 Control in Electrolarynx Speech
    Saikachi, Yoko
    Stevens, Kenneth N.
    Hillman, Robert E.
    JOURNAL OF SPEECH LANGUAGE AND HEARING RESEARCH, 2009, 52 (05): : 1360 - 1369
  • [45] Soft context clustering for F0 modeling in HMM-based speech synthesis
    Khorram, Soheil
    Sameti, Hossein
    King, Simon
    EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2015,
  • [46] MULTI-LAYER F0 MODELING FOR HMM-BASED SPEECH SYNTHESIS
    Wang, Cheng-Cheng
    Ling, Zhen-Hua
    Zhang, Bu-Fan
    Dai, Li-Rong
    2008 6TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, 2008, : 129 - 132
  • [47] SINGING VOICE ANALYSIS AND EDITING BASED ON MU TUALLY DEPENDENT F0 ESTIMATION AND SOURCE SEPARATION
    Ikemiya, Yukara
    Yoshii, Kazuyoshi
    Itoyama, Katsutoshi
    2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 574 - 578
  • [48] F0 Contour Analysis Based on Empirical Mode Decomposition for DNN Acoustic Modeling in Mandarin Speech Recognition
    Wang, Xiaoyun
    Lu, Xugang
    Kawai, Hisashi
    Yamamoto, Seiichi
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 973 - +
  • [49] HMM-BASED EXPRESSIVE SPEECH SYNTHESIS BASED ON PHRASE-LEVEL F0 CONTEXT LABELING
    Maeno, Yu
    Nose, Takashi
    Kobayashi, Takao
    Koriyama, Tomoki
    Ijima, Yusuke
    Nakajima, Hideharu
    Mizuno, Hideyuki
    Yoshioka, Osamu
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 7859 - 7863
  • [50] Hear Your Face: Face-based voice conversion with F0 estimation
    Lee, Jaejun
    Oh, Yoori
    Hwang, Injune
    Lee, Kyogu
    INTERSPEECH 2024, 2024, : 4378 - 4382