ROBUST FUNDAMENTAL FREQUENCY ESTIMATION IN COLOURED NOISE

被引:0
作者
Jaramillo, Alfredo Esquivel [1 ]
Jakobsson, Andreas [2 ]
Nielsen, Jesper Kjaer [1 ]
Christensen, Mads Graesboll [1 ]
机构
[1] Aalborg Univ, CREATE, Audio Anal Lab, Aalborg, Denmark
[2] Lund Univ, Dept Math Stat, Lund, Sweden
来源
2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING | 2020年
基金
瑞典研究理事会;
关键词
fundamental frequency; coloured noise; maximum likelihood; pre-whitening; least-squares; LCMV filter; MAXIMUM-LIKELIHOOD; EFFICIENT; SIGNALS;
D O I
10.1109/icassp40776.2020.9053018
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Most parametric fundamental frequency estimators make the implicit assumption that any corrupting noise is additive, white Gaussian. Under this assumption, the maximum likelihood (ML) and the least squares estimators are the same, and statistically efficient. However, in the coloured noise case, the estimators differ, and the spectral shape of the corrupting noise should be taken into account. To allow for this, we here propose two schemes that refine the noise statistics and parameter estimates in an iterative manner, one of them based on an approximate ML solution and the other one based on removing the periodic signal obtained from a linearly constrained minimum variance (LCMV) filter. Evaluations on real speech data indicate that the iteration steps improve the estimation accuracy, therefore offering improvement over traditional non-parametric fundamental frequency methods in most of the evaluated scenarios.
引用
收藏
页码:741 / 745
页数:5
相关论文
共 30 条
  • [1] [Anonymous], 2009, SYNTHESIS LECT SPEEC
  • [2] Chu W, 2009, INT CONF ACOUST SPEE, P3969, DOI 10.1109/ICASSP.2009.4960497
  • [3] YIN, a fundamental frequency estimator for speech and music
    de Cheveigné, A
    Kawahara, H
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2002, 111 (04) : 1917 - 1930
  • [4] Drugman T, 2011, 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, P1984
  • [5] Elvander F., 2019, ARXIV191007016
  • [6] Unbiased MMSE-Based Noise Power Estimation With Low Complexity and Low Tracking Delay
    Gerkmann, Timo
    Hendriks, Richard C.
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2012, 20 (04): : 1383 - 1393
  • [7] Hogg AOT, 2019, INT CONF ACOUST SPEE, P5826, DOI 10.1109/ICASSP.2019.8682924
  • [8] Jaramillo A. E., 2019, 2019 27 EUR SIGN PRO
  • [9] Jaramillo AE, 2019, INT CONF ACOUST SPEE, P6495, DOI 10.1109/ICASSP.2019.8683653
  • [10] Jaramillo AE, 2018, EUR SIGNAL PR CONF, P2325, DOI 10.23919/EUSIPCO.2018.8553512