ML Estimation of a Stochastic Linear System with the EM Algorithm and Its Application to Speech Recognition

被引:146
作者
Digalakis, V. [1 ]
Rohlicek, J. R. [2 ]
Ostendorf, M. [3 ]
机构
[1] SRI Int, Menlo Pk, CA 94025 USA
[2] BBN Labs Inc, Cambridge, MA 02138 USA
[3] Boston Univ, Boston, MA 02215 USA
来源
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING | 1993年 / 1卷 / 04期
关键词
D O I
10.1109/89.242489
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper we present a nontraditional approach to the problem of estimating the parameters of a stochastic linear system. The method is based on the Expectation-Maximization algorithm and can be considered as the continuous analog of the Baum-Welch estimation algorithm for hidden Markov models. We use the algorithm for training the parameters of a dynamical system model that we propose for better representing the spectral dynamics of speech for recognition. We assume that the observed feature vectors of a phone segment are the output of a stochastic linear dynamical system, and we show how the evolution of the dynamics as a function of the segment length can be modeled using alternative assumptions. We show on a phoneme classification task using the TIMIT database that our approach is the first effective use of an explicit model for statistical dependence between frames of speech.
引用
收藏
页码:431 / 442
页数:12
相关论文
共 50 条
  • [41] OFDM channel estimation by a linear EM-map algorithm
    Ocloo, J. M. Mamfoumbi
    Alberge, Florence
    [J]. 2006 IEEE International Conference on Acoustics, Speech and Signal Processing, Vols 1-13, 2006, : 3779 - 3782
  • [42] PSO Algorithm for Exact Stochastic ML Estimation of DOA for Incoherent Signals
    Chen, Haihua
    Li, Shibao
    Liu, Jianhang
    Suzuki, Masakiyo
    [J]. 2015 15TH INTERNATIONAL SYMPOSIUM ON COMMUNICATIONS AND INFORMATION TECHNOLOGIES (ISCIT), 2015, : 189 - 192
  • [43] Hybrid genetic algorithm and its application in linear system identification
    Xia, Xiu-Yu
    Zhou, Ji-Liu
    [J]. Sichuan Daxue Xuebao (Gongcheng Kexue Ban)/Journal of Sichuan University (Engineering Science Edition), 2005, 37 (01): : 104 - 107
  • [44] ML ESTIMATION IN THE POISSON BINOMIAL-DISTRIBUTION WITH GROUPED DATA VIA THE EM ALGORITHM
    ADAMIDIS, K
    LOUKAS, S
    [J]. JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 1993, 45 (1-2) : 33 - 39
  • [45] ALGORITHM FOR OPTIMAL SOLUTION OF LINEAR INEQUALITIES AND ITS APPLICATION TO PATTERN-RECOGNITION
    WARMACK, RE
    GONZALEZ, RC
    [J]. IEEE TRANSACTIONS ON COMPUTERS, 1973, C 22 (12) : 1065 - 1075
  • [46] Application of EM Algorithm in Problems of Pattern Recognition on Satellite Images
    Akinin, Maxim V.
    Akinina, Alexandra V.
    Sokolov, Alexey V.
    Tarasov, Andrey S.
    [J]. 2017 6TH MEDITERRANEAN CONFERENCE ON EMBEDDED COMPUTING (MECO), 2017, : 175 - 178
  • [47] Bark wavelet transform of speech and its application in speech recognition
    Fu, Qiang
    Yi, Kechu
    [J]. Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2000, 28 (10): : 102 - 105
  • [48] Speech recognition and its application in voice-based robot control system
    Luo, ZZ
    Zhao, JB
    [J]. PROCEEDINGS OF THE 2004 INTERNATIONAL CONFERENCE ON INTELLIGENT MECHATRONICS AND AUTOMATION, 2004, : 960 - 963
  • [49] Syllable Similarity and Its Application in Speech Recognition
    Li Honglian
    Pan Jianjun
    Fan Jing
    [J]. ICWMMN 2010, PROCEEDINGS, 2010, : 302 - 306
  • [50] Duration and its application in continuous speech recognition
    ZHAO Qingwei XIAO Xi WANG Zuoying LU Dajin (Department of Electronic Engineering
    [J]. Chinese Journal of Acoustics, 2000, (03) : 259 - 269