Robust F0 estimation using ELS-based robust complex speech analysis

被引:0
|
作者
Funaki, Keiichi [1 ]
Kinjo, Tatsuhiko [2 ]
机构
[1] Univ Ryukyus, Comp & Networking Ctr, Nishihara, Okinawa 9030213, Japan
[2] Toyota Commun Syst CO LTD, Higashi Ku, Nagoya, Aichi 4610005, Japan
关键词
F0; estimation; analytic signal; ELS (Extended Least Square); robust complex speech analysis; IRS filtered speech;
D O I
10.1093/ietfec/e91-a.3.868
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Complex speech analysis for an analytic speech signal can accurately estimate the spectrum in low frequencies since the analytic signal provides spectrum only over positive frequencies. The remarkable feature makes it possible to realize more accurate F0 estimation using complex residual signal extracted by complex-valued speech analysis. We have already proposed F0 estimation using complex LPC residual, in which the autocorrelation function weighted by AMDF was adopted as the criterion. The method adopted MMSE-based complex LPC analysis and it has been reported that it can estimate more accurate F0 for IRS filtered speech corrupted by white Gauss noise although it can not work better for the IRS filtered speech corrupted by pink noise. In this paper, robust complex speech analysis based on ELS (Extended Least Square) method is introduced in order to overcome the drawback. The experimental results for additive white Gauss or pink noise demonstrate that the proposed algorithm based on robust ELS-based complex AR analysis can perform better than other methods.
引用
收藏
页码:868 / 871
页数:4
相关论文
共 50 条
  • [21] F0, LPC, and MFCC Analysis for Emotion Recognition Based on Speech
    Teixeira, Felipe L.
    Teixeira, Joao Paulo
    Soares, Salviano F. P.
    Pio Abreu, J. L.
    OPTIMIZATION, LEARNING ALGORITHMS AND APPLICATIONS, OL2A 2022, 2022, 1754 : 389 - 404
  • [22] F0 Estimation Using Empirical Mode Decomposition and Complex Cepstrum Analysis in Reverberant Environments
    Boonkla, Surasak
    Unoki, Masashi
    Wutiwiwatchai, Chai
    Makhanov, Stanislav S.
    2017 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC 2017), 2017, : 980 - 986
  • [23] F0 generation in a text-to-speech system using a database of natural F0 patterns
    da Silva, CH
    Nagle, EJ
    Runstein, F
    Violaro, F
    ITS '98 PROCEEDINGS - SBT/IEEE INTERNATIONAL TELECOMMUNICATIONS SYMPOSIUM, VOLS 1 AND 2, 1998, : 213 - 218
  • [24] Robust F0 Estimation Based on Log-Time Scale Autocorrelation and Its Application to Mandarin Tone Recognition
    Kida, Yusuke
    Sakai, Masaru
    Masuko, Takashi
    Kawamura, Akinori
    INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 2931 - 2934
  • [25] Combining Atom Decomposition of the F0 Track and HMM-based Phonological Phrase Modelling for Robust Stress Detection in Speech
    Szaszak, Gyorgy
    Tundik, Mate Akos
    Gerazov, Branislav
    Gjoreski, Aleksandar
    SPEECH AND COMPUTER, 2016, 9811 : 165 - 173
  • [26] F0 analysis for Japanese conversational speech synthesis
    Nakajima, Hideharu
    Sagisaka, Yoshinori
    2009 EIGHTH INTERNATIONAL SYMPOSIUM ON NATURAL LANGUAGE PROCESSING, PROCEEDINGS, 2009, : 137 - +
  • [27] Effects of F0 Estimation Algorithms on Ultrasound- Based Silent Speech Interfaces
    Dai, Pengyu
    Al-Radhi, Mohammed Salah
    Csapo, Tamas Gabor
    2021 INTERNATIONAL CONFERENCE ON SPEECH TECHNOLOGY AND HUMAN-COMPUTER DIALOGUE (SPED), 2021, : 47 - 51
  • [28] On a robust ASR based on complex AR speech analysis
    Higa, Keita
    Funaki, Keiichi
    2015 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2015, : 1232 - 1235
  • [29] Multi-Microphone Periodicity Function for Robust F0 Estimation in Real Noisy and Reverberant Environments
    Flego, Federico
    Omologo, Maurizio
    INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 2146 - 2149
  • [30] Noise robust F0 determination and epoch-marking algorithms
    Kotnik, Bojan
    Hoege, Harald
    Kacic, Zdravko
    SIGNAL PROCESSING, 2009, 89 (12) : 2555 - 2569