Probabilistic Linear Discriminant Analysis for Acoustic Modeling

被引:7
作者
Lu, Liang [1 ]
Renals, Steve [1 ]
机构
[1] Univ Edinburgh, Edinburgh, Midlothian, Scotland
基金
英国工程与自然科学研究理事会;
关键词
Acoustic modeling; automatic speech recognition; probabilistic linear discriminant analysis; GAUSSIAN MIXTURE-MODELS; HIDDEN MARKOV-MODELS; COVARIANCE MATRICES; NEURAL-NETWORKS; SPEECH; RECOGNITION;
D O I
10.1109/LSP.2014.2313410
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this letter, we propose a new acoustic modeling approach for automatic speech recognition based on probabilistic linear discriminant analysis (PLDA), which is used to model the state density function for the standard hidden Markov models (HMMs). Unlike the conventional Gaussian mixture models (GMMs) where the correlations are weakly modelled by using the diagonal covariance matrices, PLDA captures the correlations of feature vector in subspaces without vastly expanding the model. It also allows the usage of high dimensional feature input, and therefore is more flexible to make use of different type of acoustic features. We performed the preliminary experiments on the Switchboard corpus, and demonstrated the feasibility of this acoustic model.
引用
收藏
页码:702 / 706
页数:5
相关论文
共 50 条
  • [21] Unsupervised emotion recognition algorithm based on improved deep belief model in combination with probabilistic linear discriminant analysis
    Xiao, Ying
    Wang, Deyan
    Hou, Ligong
    PERSONAL AND UBIQUITOUS COMPUTING, 2019, 23 (3-4) : 553 - 562
  • [22] LINEAR DISCRIMINANT ANALYSIS WITH FEW TRAINING DATA
    Markopoulos, Panos P.
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 4626 - 4630
  • [23] A subset method for improving Linear Discriminant Analysis
    Yao, Chao
    Lu, Zhaoyang
    Li, Jing
    Xu, Yamei
    Han, Jungong
    NEUROCOMPUTING, 2014, 138 : 310 - 315
  • [24] Cross Local Gabor Binary Pattern Descriptor with Probabilistic Linear Discriminant Analysis for Pose-Invariant Face Recognition
    Jami, SantoshKumar
    Chalamala, Srinivasa Rao
    Kakkirala, Krishna Rao
    2017 19TH UKSIM-AMSS INTERNATIONAL CONFERENCE ON MATHEMATICAL MODELLING & COMPUTER SIMULATION (UKSIM), 2017, : 39 - 44
  • [25] Acoustic Modeling With Hierarchical Reservoirs
    Triefenbach, Fabian
    Jalalvand, Azarakhsh
    Demuynck, Kris
    Martens, Jean-Pierre
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2013, 21 (11): : 2439 - 2450
  • [26] Bearing Performance Degradation Assessment Using Linear Discriminant Analysis and Coupled HMM
    Liu, T.
    Chen, J.
    Zhou, X. N.
    Xiao, W. B.
    25TH INTERNATIONAL CONGRESS ON CONDITION MONITORING AND DIAGNOSTIC ENGINEERING (COMADEM 2012), 2012, 364
  • [27] Linear Collaborative Discriminant Regression and Cepstra Features for Hindi Speech Recognition
    Patil, U. G.
    Shirbahadurkar, S. D.
    Paithane, A. N.
    JOURNAL OF ENGINEERING RESEARCH, 2019, 7 (04): : 96 - 114
  • [28] Multi-View Linear Discriminant Analysis Network
    Hu, Peng
    Peng, Dezhong
    Sang, Yongsheng
    Xiang, Yong
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (11) : 5352 - 5365
  • [29] Spectral classification by generative adversarial linear discriminant analysis
    Cao, Ziyi
    Zhang, Shijie
    Liu, Youlin
    Smith, Casey J.
    Sherman, Alex M.
    Hwang, Yechan
    Simpson, Garth J.
    ANALYTICA CHIMICA ACTA, 2023, 1261
  • [30] Varying coefficient linear discriminant analysis for dynamic data
    Bao, Yajie
    Liu, Yuyang
    ELECTRONIC JOURNAL OF STATISTICS, 2022, 16 (02): : 5378 - 5436