Noise robust F0 determination and epoch-marking algorithms

被引:8
|
作者
Kotnik, Bojan [1 ]
Hoege, Harald [2 ]
Kacic, Zdravko [3 ]
机构
[1] ULTRA Doo, Res Ctr Maribor, SI-2000 Maribor, Slovenia
[2] Siemens AG, Corp Technol, Profess Speech Proc, D-81739 Munich, Germany
[3] Univ Maribor, Fac Elect Engn & Comp Sci, SI-2000 Maribor, Slovenia
关键词
Fundamental frequency; Glottal closure instant; Epoch marking; Voicing detection; Artificial neural network; FUNDAMENTAL-FREQUENCY ESTIMATION; PITCH DETERMINATION; EXTRACTION; SPEECH;
D O I
10.1016/j.sigpro.2009.04.017
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper presents a combined pitch frequency (F0) determination and epoch (pitch period) marking procedure CPDMA using merged normalized forward-backward correlation. The algorithm consists of several processing steps: preprocessing of the input speech signal, voicing detection using artificial neural networks, F0 determination stage based on normalized correlation. F0 contour postprocessing applying partial Viterbi traceback, and finally, epoch (or pitch period) marking. To evaluate the proposed CPDMA procedure against any other algorithm, a manually segmented PDA/PMA reference database based on real-life SPEECON Spanish speech database has been created. A set of criteria was proposed to objectively and compactly evaluate the performance of any evaluated PDA/PMA or voicing detection algorithm. The performance of the proposed CPDMA was compared with the performance of well-known and publicly available PRAAT toolkit. The PDA and PMA performances achieved with the proposed CPDMA algorithm significantly outperformed the performance of the PRAAT toolkit in all its three considered configurations: autocorrelation method (PRAAT_AC), cross-correlation method (PRAAT_CC), SHS (PRAAT_SHS), and point process (PRAAT_PP). The superior noise robustness of CPDMA is achieved at the expense of a more complex algorithm and consequently leads to worse real time factor when compared to PRAAT. (C) 2009 Elsevier B.V. All rights reserved.
引用
收藏
页码:2555 / 2569
页数:15
相关论文
共 50 条
  • [21] Comparative evaluations of robust and accurate F0 estimates in reverberant environments
    Unoki, Masashi
    Hosorogiya, Toshihiro
    Ishimoto, Yuichi
    2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 4569 - +
  • [22] Process πp → ππN at high energies and moderate momenta transferred to the nucleon and the determination of parameters of the f0(980) and f0(1300)
    V. V. Anisovich
    A. V. Sarantsev
    Physics of Atomic Nuclei, 2003, 66 : 928 - 940
  • [23] Process πp → ππN at high energies and moderate momenta transferred to the nucleon and the determination of parameters of the f0(980) and f0(1300)
    Anisovich, VV
    Sarantsev, AV
    PHYSICS OF ATOMIC NUCLEI, 2003, 66 (05) : 928 - 940
  • [24] Determination of f0-σ mixing angle through Bs0→ J/Ψ f0(980)(σ) decays
    Li, Jing-Wu
    Du, Dong-Sheng
    Lu, Cai-Dian
    EUROPEAN PHYSICAL JOURNAL C, 2012, 72 (11): : 1 - 8
  • [25] Robust F0 estimation using ELS-based robust complex speech analysis
    Funaki, Keiichi
    Kinjo, Tatsuhiko
    IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2008, E91A (03) : 868 - 871
  • [26] ROBUST F0 ESTIMATION IN NOISY SPEECH SIGNALS USING SHIFT AUTOCORRELATION
    Kurth, Frank
    Cornaggia-Urrigshardt, Alessia
    Urrigshardt, Sebastian
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [27] On a Robust F0 Estimation of Speech based on IRAPT using Robust TV-CAR Analysis
    Hotta, Kazushi
    Funaki, Keiichi
    2014 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2014,
  • [28] Effects of F0 Estimation Algorithms on Ultrasound- Based Silent Speech Interfaces
    Dai, Pengyu
    Al-Radhi, Mohammed Salah
    Csapo, Tamas Gabor
    2021 INTERNATIONAL CONFERENCE ON SPEECH TECHNOLOGY AND HUMAN-COMPUTER DIALOGUE (SPED), 2021, : 47 - 51
  • [29] Error Evaluation of an F0-Adaptive Spectral Envelope Estimator in Robustness against the Additive Noise and F0 Error
    Morise, Masanori
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2015, E98D (07): : 1405 - 1408
  • [30] Detection and F0 discrimination of harmonic complex tones in the presence of competing tones or noise
    Micheyl, Christophe
    Bernstein, Joshua G. W.
    Oxenham, Andrew J.
    Journal of the Acoustical Society of America, 2006, 120 (03): : 1493 - 1505