Automatic glottal inverse filtering with the Markov chain Monte Carlo method

被引:8
|
作者
Auvinen, Harri [1 ]
Raitio, Tuomo [2 ]
Airaksinen, Manu [2 ]
Siltanen, Samuli [1 ]
Story, Brad H. [3 ]
Alku, Paavo [2 ]
机构
[1] Univ Helsinki, Dept Math & Stat, Helsinki, Finland
[2] Aalto Univ, Dept Signal Proc & Acoust, Espoo, Finland
[3] Univ Arizona, Dept Speech & Hearing Sci, Tucson, AZ 85721 USA
基金
芬兰科学院;
关键词
Glottal inverse filtering; Markov chain Monte Carlo; JOINT ESTIMATION; GIBBS SAMPLER; VOICE SOURCE; VOCAL-TRACT; SPEECH; QUALITY; SYSTEM; MODEL; FLOW;
D O I
10.1016/j.csl.2013.09.004
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a new glottal inverse filtering (GIF) method that utilizes a Markov chain Monte Carlo (MCMC) algorithm. First, initial estimates of the vocal tract and glottal flow are evaluated by an existing GIF method, iterative adaptive inverse filtering (IAIF). Simultaneously, the initially estimated glottal flow is synthesized using the Rosenberg-Klatt (RK) model and filtered with the estimated vocal tract filter to create a synthetic speech frame. In the MCMC estimation process, the first few poles of the initial vocal tract model and the RK excitation parameter are refined in order to minimize the error between the synthetic and original speech signals in the time and frequency domain. MCMC approximates the posterior distribution of the parameters, and the final estimate of the vocal tract is found by averaging the parameter values of the Markov chain. Experiments with synthetic vowels produced by a physical modeling approach show that the MCMC-based GIF method gives more accurate results compared to two known reference methods. (C) 2013 Elsevier Ltd. All rights reserved.
引用
收藏
页码:1139 / 1155
页数:17
相关论文
共 50 条
  • [1] Utilizing Markov Chain Monte Carlo (MCMC) Method for Improved Glottal Inverse Filtering
    Auvinen, Harri
    Raitio, Tuomo
    Siltanen, Samuli
    Alku, Paavo
    13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 1638 - +
  • [2] Markov Chain Monte Carlo in Practice
    Jones, Galin L.
    Qin, Qian
    ANNUAL REVIEW OF STATISTICS AND ITS APPLICATION, 2022, 9 : 557 - 578
  • [3] A Novel Method of Glottal Inverse Filtering
    Sahoo, Subhasmita
    Routray, Aurobinda
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2016, 24 (07) : 1230 - 1241
  • [4] OPENGLOT - An open environment for the evaluation of glottal inverse filtering
    Alku, Paavo
    Murtola, Tiina
    Malinen, Jarmo
    Kuortti, Juha
    Story, Brad
    Airaksinen, Manu
    Salmi, Mika
    Vilkman, Erkki
    Geneid, Ahmed
    SPEECH COMMUNICATION, 2019, 107 (38-47) : 38 - 47
  • [5] Markov chain Monte Carlo method and its application
    Brooks, SP
    JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES D-THE STATISTICIAN, 1998, 47 (01) : 69 - 100
  • [6] Convergence Diagnostics for Markov Chain Monte Carlo
    Roy, Vivekananda
    ANNUAL REVIEW OF STATISTICS AND ITS APPLICATION, VOL 7, 2020, 2020, 7 : 387 - 412
  • [7] An introduction to Markov chain Monte Carlo methods
    Besag, J
    MATHEMATICAL FOUNDATIONS OF SPEECH AND LANGUAGE PROCESSING, 2004, 138 : 247 - 270
  • [8] Optimal Markov chain Monte Carlo sampling
    Chen, Ting-Li
    WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL STATISTICS, 2013, 5 (05) : 341 - 348
  • [9] A simple introduction to Markov Chain Monte-Carlo sampling
    van Ravenzwaaij, Don
    Cassey, Pete
    Brown, Scott D.
    PSYCHONOMIC BULLETIN & REVIEW, 2018, 25 (01) : 143 - 154
  • [10] Closed phase covariance analysis based on constrained linear prediction for glottal inverse filtering
    Alku, Paavo
    Magi, Carlo
    Yrttiaho, Santeri
    Backstrom, Tom
    Story, Brad
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2009, 125 (05) : 3289 - 3305