Ameliorated language modelling for lecture speech recognition of Indian English

被引:0
作者
Phull, Disha Kaur [1 ]
Kumar, G. Bharadwaja [1 ]
机构
[1] VIT Univ, Vandalur Kelambakkam Rd, Madras 600127, Tamil Nadu, India
来源
SADHANA-ACADEMY PROCEEDINGS IN ENGINEERING SCIENCES | 2018年 / 43卷 / 12期
关键词
Adapted language model; Indian English; information retrieval; lecture recognition; speech recognition; ADAPTATION;
D O I
10.1007/s12046-018-0976-x
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
A great amount of research is growing towards the automatic transcription of lectures that consist of numerous information and knowledge that could be helpful to the educational systems and institutes. In large vocabulary speech recognition, language model plays a paramount role in reducing the humongous search space. However, language modelling is very brittle when moving from one domain to another or when moving from read speech to spontaneous speech. Also, lecture speech recognition will have some of the characteristics of spontaneous speech. Hence, it is very challenging to build the language model for this task. In this paper, a judicious approach to adapt the language model in a way where the language model will be in close proximity to the topic spoken in the lecture speech has been depicted. The evaluation of the language model is devised using the proposed approach with the existing language models such as CMU Sphinx, Gigaword and HUB-4. We observed the results analysis that the language models devised from the proposed approach outperform from the existing language models in terms of word error rate, perplexity and out of vocabulary rate. Analysis shows that the presented two-phase approach has resulted in an average decrease of the word error rate to be approximately 14% and the perplexity is decreased by half on average.
引用
收藏
页数:11
相关论文
共 34 条
  • [1] [Anonymous], 2018, Speech and Language Processing: An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition
  • [2] Bird S, 2006, P ACL 2004 INT POST, P63
  • [3] Latent semantics in language models
    Brychcin, Tomas
    Konopik, Miloslav
    [J]. COMPUTER SPEECH AND LANGUAGE, 2015, 33 (01) : 88 - 108
  • [4] Leveraging relevance cues for language modeling in speech recognition
    Chen, Berlin
    Chen, Kuan-Yu
    [J]. INFORMATION PROCESSING & MANAGEMENT, 2013, 49 (04) : 807 - 816
  • [5] An empirical study of smoothing techniques for language modeling
    Chen, SF
    Goodman, J
    [J]. COMPUTER SPEECH AND LANGUAGE, 1999, 13 (04) : 359 - 394
  • [6] Topic identification techniques applied to dynamic language model adaptation for automatic speech recognition
    Echeverry-Correa, J. D.
    Ferreiros-Lopez, J.
    Coucheiro-Limeres, A.
    Cordoba, R.
    Montero, J. M.
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2015, 42 (01) : 101 - 112
  • [7] Eickhoff C, 2016, ACM SIGWEB NEWSLETTE, P4
  • [8] Glass J., 2004, P WORKSH INT APPR SP, P9
  • [9] Goldberg Yoav., 2017, Synthesis Lectures on Human Language Technologies, DOI 10.2200/S00762ED1V01Y201703HLT037
  • [10] Unsupervised language model adaptation using LDA-based mixture models and latent semantic marginals
    Haidar, Md. Akmal
    O'Shaughnessy, Douglas
    [J]. COMPUTER SPEECH AND LANGUAGE, 2015, 29 (01) : 20 - 31