Robust F0 estimation using ELS-based robust complex speech analysis

被引:0
|
作者
Funaki, Keiichi [1 ]
Kinjo, Tatsuhiko [2 ]
机构
[1] Univ Ryukyus, Comp & Networking Ctr, Nishihara, Okinawa 9030213, Japan
[2] Toyota Commun Syst CO LTD, Higashi Ku, Nagoya, Aichi 4610005, Japan
关键词
F0; estimation; analytic signal; ELS (Extended Least Square); robust complex speech analysis; IRS filtered speech;
D O I
10.1093/ietfec/e91-a.3.868
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Complex speech analysis for an analytic speech signal can accurately estimate the spectrum in low frequencies since the analytic signal provides spectrum only over positive frequencies. The remarkable feature makes it possible to realize more accurate F0 estimation using complex residual signal extracted by complex-valued speech analysis. We have already proposed F0 estimation using complex LPC residual, in which the autocorrelation function weighted by AMDF was adopted as the criterion. The method adopted MMSE-based complex LPC analysis and it has been reported that it can estimate more accurate F0 for IRS filtered speech corrupted by white Gauss noise although it can not work better for the IRS filtered speech corrupted by pink noise. In this paper, robust complex speech analysis based on ELS (Extended Least Square) method is introduced in order to overcome the drawback. The experimental results for additive white Gauss or pink noise demonstrate that the proposed algorithm based on robust ELS-based complex AR analysis can perform better than other methods.
引用
收藏
页码:868 / 871
页数:4
相关论文
共 50 条
  • [41] Review of F0 modelling and generation in HMM based speech synthesis
    Yu, Kai
    PROCEEDINGS OF 2012 IEEE 11TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP) VOLS 1-3, 2012, : 599 - 604
  • [42] Extraction of important sentences for speech summarization based on an F0 model
    Inoue, Akira
    Yamashita, Yoichi
    Acoustical Science and Technology, 2003, 24 (01) : 35 - 37
  • [43] Improving F0 Prediction Using Bidirectional Associative Memories and Syllable-Level F0 Features for HMM-based Mandarin Speech Synthesis
    Gao, Li
    Ling, Zhen-Hua
    Chen, Ling-Hui
    Dai, Li-Rong
    2014 9TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2014, : 275 - 279
  • [44] On an improved F0 estimation based on l2-norm regularized TV-CAR speech analysis using pre-filter
    Funaki, Keiichi
    IECON 2021 - 47TH ANNUAL CONFERENCE OF THE IEEE INDUSTRIAL ELECTRONICS SOCIETY, 2021,
  • [45] Robust F-0 and jitter estimation in pathological voices
    Vieira, MN
    McInnes, FR
    Jack, MA
    ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 745 - 748
  • [46] Evaluation of a noise-robust multi-stream speaker verification method using F0 information
    Asami, Taichi
    Iwano, Koji
    Furui, Sadaoki
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2008, E91D (03) : 549 - 557
  • [47] A Method for Automatically Estimating F0 Model Parameters and A Speech Re-Synthesis Tool Using F0 Model and STRAIGHT
    Sato, Shota
    Kimura, Taro
    Horiuchi, Yasuo
    Nishida, Masafumi
    Kuroiwa, Shingo
    Ichikawa, Akira
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 545 - +
  • [48] Robust ASR Based on ETSI Advanced Front-End Using Complex Speech Analysis
    Higa, Keita
    Funaki, Keiichi
    IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2015, E98A (11): : 2211 - 2219
  • [49] Combining F0 and non-negative constraint robust principal component analysis for singing voice separation
    Li, Feng
    Akagi, Masato
    SIGNAL PROCESSING, 2020, 170
  • [50] ON AN IMPROVED F0 ESTIMATION BASED ON l2-NORM REGULARIZED TV-CAR SPEECH
    Funaki, Keiichi
    2021 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2021, : 932 - 938