Probabilistic Linear Discriminant Analysis for Acoustic Modeling

被引：7

作者：

Lu, Liang ^{[1
]}

Renals, Steve ^{[1
]}

机构：

[1] Univ Edinburgh, Edinburgh, Midlothian, Scotland

来源：

IEEE SIGNAL PROCESSING LETTERS | 2014年 / 21卷 / 06期

基金：

英国工程与自然科学研究理事会;

关键词：

Acoustic modeling; automatic speech recognition; probabilistic linear discriminant analysis; GAUSSIAN MIXTURE-MODELS; HIDDEN MARKOV-MODELS; COVARIANCE MATRICES; NEURAL-NETWORKS; SPEECH; RECOGNITION;

D O I：

10.1109/LSP.2014.2313410

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

In this letter, we propose a new acoustic modeling approach for automatic speech recognition based on probabilistic linear discriminant analysis (PLDA), which is used to model the state density function for the standard hidden Markov models (HMMs). Unlike the conventional Gaussian mixture models (GMMs) where the correlations are weakly modelled by using the diagonal covariance matrices, PLDA captures the correlations of feature vector in subspaces without vastly expanding the model. It also allows the usage of high dimensional feature input, and therefore is more flexible to make use of different type of acoustic features. We performed the preliminary experiments on the Switchboard corpus, and demonstrated the feasibility of this acoustic model.

引用

页码：702 / 706

页数：5

共 50 条

[31] Bird Sounds Classification Using Linear Discriminant Analysis
Ramashini, Murugiaya
Abas, Pg Emeroylariffion
Grafe, Ulmar
De Silva, Liyanage C.
2019 4TH INTERNATIONAL CONFERENCE AND WORKSHOPS ON RECENT ADVANCES AND INNOVATIONS IN ENGINEERING (ICRAIE): THRIVING TECHNOLOGIES, 2019,
[32] Classification of Hyperspectral Images With Regularized Linear Discriminant Analysis
Bandos, Tatyana V.
Bruzzone, Lorenzo
Camps-Valls, Gustavo
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2009, 47 (03): : 862 - 873
[33] Face Recognition with Linear Discriminant Analysis and Neural Networks
Fatahi, Sepide
Zadkhosh, Ehsan
Chalechale, Abdollah
2013 FIRST IRANIAN CONFERENCE ON PATTERN RECOGNITION AND IMAGE ANALYSIS (PRIA), 2013,
[34] An Improved method of Two Stage Linear Discriminant Analysis
Chen, Yarui
Tao, Xin
Xiong, Congcong
Yang, Jucheng
KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2018, 12 (03): : 1243 - 1263
[35] Multimodal Linear Discriminant Analysis via Structural Sparsity
Zhang, Yu
Jiang, Yuan
PROCEEDINGS OF THE TWENTY-SIXTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 3448 - 3454
[36] Tumor classification based on orthogonal linear discriminant analysis
Wang, Huiya
Zhang, Shanwen
BIO-MEDICAL MATERIALS AND ENGINEERING, 2014, 24 (01) : 1399 - 1405
[37] Acoustic data-driven grapheme-to-phoneme conversion in the probabilistic lexical modeling framework
Razavi, Marzieh
Rasipuram, Ramya
Magimai-Doss, Mathew
SPEECH COMMUNICATION, 2016, 80 : 1 - 21
[38] Linear discriminant trees
Yildiz, OT
Alpaydin, E
INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2005, 19 (03) : 323 - 353
[39] Probabilistic Acoustic Volume Analysis for Speech Affected by Depression
Cummins, Nicholas
Sethu, Vidhyasaharan
Epps, Julien
Krajewski, Jarek
15TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2014), VOLS 1-4, 2014, : 1238 - 1242
[40] Generalized robust linear discriminant analysis for jointly sparse learning
Zhu, Yufei
Lai, Zhihui
Gao, Can
Kong, Heng
APPLIED INTELLIGENCE, 2024, 54 (19) : 9508 - 9523

← 1 2 3 4 5 →