Probabilistic Linear Discriminant Analysis for Acoustic Modeling

被引:7
作者
Lu, Liang [1 ]
Renals, Steve [1 ]
机构
[1] Univ Edinburgh, Edinburgh, Midlothian, Scotland
基金
英国工程与自然科学研究理事会;
关键词
Acoustic modeling; automatic speech recognition; probabilistic linear discriminant analysis; GAUSSIAN MIXTURE-MODELS; HIDDEN MARKOV-MODELS; COVARIANCE MATRICES; NEURAL-NETWORKS; SPEECH; RECOGNITION;
D O I
10.1109/LSP.2014.2313410
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this letter, we propose a new acoustic modeling approach for automatic speech recognition based on probabilistic linear discriminant analysis (PLDA), which is used to model the state density function for the standard hidden Markov models (HMMs). Unlike the conventional Gaussian mixture models (GMMs) where the correlations are weakly modelled by using the diagonal covariance matrices, PLDA captures the correlations of feature vector in subspaces without vastly expanding the model. It also allows the usage of high dimensional feature input, and therefore is more flexible to make use of different type of acoustic features. We performed the preliminary experiments on the Switchboard corpus, and demonstrated the feasibility of this acoustic model.
引用
收藏
页码:702 / 706
页数:5
相关论文
共 50 条
  • [31] Bird Sounds Classification Using Linear Discriminant Analysis
    Ramashini, Murugiaya
    Abas, Pg Emeroylariffion
    Grafe, Ulmar
    De Silva, Liyanage C.
    2019 4TH INTERNATIONAL CONFERENCE AND WORKSHOPS ON RECENT ADVANCES AND INNOVATIONS IN ENGINEERING (ICRAIE): THRIVING TECHNOLOGIES, 2019,
  • [32] Classification of Hyperspectral Images With Regularized Linear Discriminant Analysis
    Bandos, Tatyana V.
    Bruzzone, Lorenzo
    Camps-Valls, Gustavo
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2009, 47 (03): : 862 - 873
  • [33] Face Recognition with Linear Discriminant Analysis and Neural Networks
    Fatahi, Sepide
    Zadkhosh, Ehsan
    Chalechale, Abdollah
    2013 FIRST IRANIAN CONFERENCE ON PATTERN RECOGNITION AND IMAGE ANALYSIS (PRIA), 2013,
  • [34] An Improved method of Two Stage Linear Discriminant Analysis
    Chen, Yarui
    Tao, Xin
    Xiong, Congcong
    Yang, Jucheng
    KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2018, 12 (03): : 1243 - 1263
  • [35] Multimodal Linear Discriminant Analysis via Structural Sparsity
    Zhang, Yu
    Jiang, Yuan
    PROCEEDINGS OF THE TWENTY-SIXTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 3448 - 3454
  • [36] Tumor classification based on orthogonal linear discriminant analysis
    Wang, Huiya
    Zhang, Shanwen
    BIO-MEDICAL MATERIALS AND ENGINEERING, 2014, 24 (01) : 1399 - 1405
  • [37] Acoustic data-driven grapheme-to-phoneme conversion in the probabilistic lexical modeling framework
    Razavi, Marzieh
    Rasipuram, Ramya
    Magimai-Doss, Mathew
    SPEECH COMMUNICATION, 2016, 80 : 1 - 21
  • [38] Linear discriminant trees
    Yildiz, OT
    Alpaydin, E
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2005, 19 (03) : 323 - 353
  • [39] Probabilistic Acoustic Volume Analysis for Speech Affected by Depression
    Cummins, Nicholas
    Sethu, Vidhyasaharan
    Epps, Julien
    Krajewski, Jarek
    15TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2014), VOLS 1-4, 2014, : 1238 - 1242
  • [40] Generalized robust linear discriminant analysis for jointly sparse learning
    Zhu, Yufei
    Lai, Zhihui
    Gao, Can
    Kong, Heng
    APPLIED INTELLIGENCE, 2024, 54 (19) : 9508 - 9523