Robust F0 estimation of speech signal using harmonicity measure based on instantaneous frequency

被引:0
|
作者
Arifianto, D [1 ]
Tanaka, T [1 ]
Masuko, T [1 ]
Kobayashi, T [1 ]
机构
[1] Tokyo Inst Technol, Interdisciplinary Grad Sch Sci & Engn, Yokohama, Kanagawa 2268502, Japan
关键词
instantaneous frequency amplitude spectrum; harmonicity measure; fundamental frequency estimation;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Borrowing the notion of instantaneous frequency that was developed in the context of time-frequency signal analysis, an instantaneous frequency amplitude spectrum (IFAS) is introduced for estimating fundamental frequency of speech signal in both noiseless and adverse environments. We define harmonicity measure as a quantity that indicates degree of periodical regularity in the IFAS and that shows substantial difference between periodic signal and noise-like waveform. The harmonicity measure is applied to estimate the existence of fundamental frequency. We provide experimental examples to demonstrate the general applicability of the harmonicity measure and apply the proposed procedure to Japanese continuous speech signals. The results show that the proposed method outperforms the conventional methods with or without the presence of noise.
引用
收藏
页码:2812 / 2820
页数:9
相关论文
共 50 条
  • [1] Robust F0 estimation using ELS-based robust complex speech analysis
    Funaki, Keiichi
    Kinjo, Tatsuhiko
    IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2008, E91A (03) : 868 - 871
  • [2] On a Robust F0 Estimation of Speech based on IRAPT using Robust TV-CAR Analysis
    Hotta, Kazushi
    Funaki, Keiichi
    2014 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2014,
  • [3] ROBUST F0 ESTIMATION IN NOISY SPEECH SIGNALS USING SHIFT AUTOCORRELATION
    Kurth, Frank
    Cornaggia-Urrigshardt, Alessia
    Urrigshardt, Sebastian
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [4] F0 estimation of noisy speech based on complex speech analysis
    Kinjo, Tatsuhiko
    Funaki, Keiichi
    2006 IEEE 12TH DIGITAL SIGNAL PROCESSING WORKSHOP & 4TH IEEE SIGNAL PROCESSING EDUCATION WORKSHOP, VOLS 1 AND 2, 2006, : 434 - 437
  • [5] F0 Estimation of Speech Using SRH Based on TV-CAR Speech Analysis
    Funaki, Keiichi
    Higa, Takehito
    IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2013, E96A (11) : 2187 - 2190
  • [6] Robust F0 estimation based on complex LPC analysis for IRS filtered noisy speech
    Funaki, Keiichi
    Kinjo, Tatsuhiko
    IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2007, E90A (08) : 1579 - 1586
  • [7] Noise robust speech recognition using F0 contour information
    Iwano, K
    Seki, T
    Furui, S
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2004, E87D (05): : 1102 - 1109
  • [8] F0 ESTIMATION FOR NOISY SPEECH BASED ON EXPLORING LOCAL TIME-FREQUENCY SEGMENT
    Wang, Dongmei
    Hansen, John H. L.
    Tobey, Emily
    2015 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA), 2015,
  • [9] F0 Contour Estimation using ELS-based Robust Time-Varying Complex Speech Analysis
    Funaki, Keiichi
    2011 IEEE DIGITAL SIGNAL PROCESSING WORKSHOP AND IEEE SIGNAL PROCESSING EDUCATION WORKSHOP (DSP/SPE), 2011, : 313 - 316
  • [10] F0 ESTIMATION USING SRH BASED ON TV-CAR SPEECH ANALYSIS
    Funaki, Keiichi
    Higa, Takehito
    2012 PROCEEDINGS OF THE 20TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2012, : 2777 - 2781