Robust F0 estimation using ELS-based robust complex speech analysis

被引：0

作者：

Funaki, Keiichi ^{[1
]}

Kinjo, Tatsuhiko ^{[2
]}

机构：

[1] Univ Ryukyus, Comp & Networking Ctr, Nishihara, Okinawa 9030213, Japan

[2] Toyota Commun Syst CO LTD, Higashi Ku, Nagoya, Aichi 4610005, Japan

来源：

IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES | 2008年 / E91A卷 / 03期

关键词：

F0; estimation; analytic signal; ELS (Extended Least Square); robust complex speech analysis; IRS filtered speech;

D O I：

10.1093/ietfec/e91-a.3.868

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

Complex speech analysis for an analytic speech signal can accurately estimate the spectrum in low frequencies since the analytic signal provides spectrum only over positive frequencies. The remarkable feature makes it possible to realize more accurate F0 estimation using complex residual signal extracted by complex-valued speech analysis. We have already proposed F0 estimation using complex LPC residual, in which the autocorrelation function weighted by AMDF was adopted as the criterion. The method adopted MMSE-based complex LPC analysis and it has been reported that it can estimate more accurate F0 for IRS filtered speech corrupted by white Gauss noise although it can not work better for the IRS filtered speech corrupted by pink noise. In this paper, robust complex speech analysis based on ELS (Extended Least Square) method is introduced in order to overcome the drawback. The experimental results for additive white Gauss or pink noise demonstrate that the proposed algorithm based on robust ELS-based complex AR analysis can perform better than other methods.

引用

页码：868 / 871

页数：4

共 50 条

[41] Review of F0 modelling and generation in HMM based speech synthesis
Yu, Kai
PROCEEDINGS OF 2012 IEEE 11TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP) VOLS 1-3, 2012, : 599 - 604
[42] Extraction of important sentences for speech summarization based on an F0 model
Inoue, Akira
Yamashita, Yoichi
Acoustical Science and Technology, 2003, 24 (01) : 35 - 37
[43] Improving F0 Prediction Using Bidirectional Associative Memories and Syllable-Level F0 Features for HMM-based Mandarin Speech Synthesis
Gao, Li
Ling, Zhen-Hua
Chen, Ling-Hui
Dai, Li-Rong
2014 9TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2014, : 275 - 279
[44] On an improved F0 estimation based on l2-norm regularized TV-CAR speech analysis using pre-filter
Funaki, Keiichi
IECON 2021 - 47TH ANNUAL CONFERENCE OF THE IEEE INDUSTRIAL ELECTRONICS SOCIETY, 2021,
[45] Robust F-0 and jitter estimation in pathological voices
Vieira, MN
McInnes, FR
Jack, MA
ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 745 - 748
[46] Evaluation of a noise-robust multi-stream speaker verification method using F0 information
Asami, Taichi
Iwano, Koji
Furui, Sadaoki
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2008, E91D (03) : 549 - 557
[47] A Method for Automatically Estimating F0 Model Parameters and A Speech Re-Synthesis Tool Using F0 Model and STRAIGHT
Sato, Shota
Kimura, Taro
Horiuchi, Yasuo
Nishida, Masafumi
Kuroiwa, Shingo
Ichikawa, Akira
INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 545 - +
[48] Robust ASR Based on ETSI Advanced Front-End Using Complex Speech Analysis
Higa, Keita
Funaki, Keiichi
IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2015, E98A (11): : 2211 - 2219
[49] Combining F0 and non-negative constraint robust principal component analysis for singing voice separation
Li, Feng
Akagi, Masato
SIGNAL PROCESSING, 2020, 170
[50] ON AN IMPROVED F0 ESTIMATION BASED ON l2-NORM REGULARIZED TV-CAR SPEECH
Funaki, Keiichi
2021 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2021, : 932 - 938

← 1 2 3 4 5 →