A Comparative Study of Khasi Speech Recognition Systems with Recurrent Neural Network-Based Language Model

被引:0
|
作者
Deepajothi, S. [1 ]
Rao, Vuda Sreenivasa [2 ]
Ambhika, C. [3 ]
Mandala, Vishwanadham [4 ]
Rao, R. V. V. N. Bheema [5 ]
Kumar, Shailendra [6 ]
Gera, Venkateswara Rao [7 ]
Nagaraju, D. [8 ]
机构
[1] SRM Inst Sci & Technol, Dept Comp Technol, Kattankulathur 603203, Tamil Nadu, India
[2] Koneru Lakshmaiah Educ Fdn, Dept Comp Sci & Engn, Vaddeswaram 522302, Andhra Pradesh, India
[3] RMD Engn Coll, Dept AIML, Rsm Nagar, Kavarapetai, India
[4] Indiana Univ, Bloomington, IN USA
[5] Aditya Coll Engn & Technol, Dept Informat Technol, Surampalem, India
[6] Integral Univ Lucknow, Dept ECE, Lucknow 226026, Uttar Pradesh, India
[7] Kallam Haranadhareddy Inst Technol, Dept ECE, Guntur, India
[8] Sri Venkatesa Perumal Coll Engn & Technol, Dept CSE, Puttur, Andhra Pradesh, India
关键词
Hidden Markov model; Language model; Perceptual linear prediction; Gaussian mixture model; Acoustic model;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper offers a comparative analysis of Khasi speech recognition systems utilizing a recurrent neural network-based language model (RNN-LM). Develop different acoustic models (AMs) to evaluate the optimal performance. This paper observed that using RNN-LM performed best than traditional other models. The wave surfer performs data processing followed by collecting the recorder based continuous speech database. Moreover, a minimization of word error rate (WER) in 2.83.8% range for major speech data and 2.4-3.5% for minor speech data. Additionally, two acoustic features are used, and from the experimental results, the Mel frequency cepstral coefficient (MFCC) yielded improved performance than the perceptual linear prediction (PLP).
引用
收藏
页码:1296 / 1305
页数:10
相关论文
共 50 条
  • [21] BIDIRECTIONAL RECURRENT NEURAL NETWORK LANGUAGE MODELS FOR AUTOMATIC SPEECH RECOGNITION
    Arisoy, Ebru
    Sethy, Abhinav
    Ramabhadran, Bhuvana
    Chen, Stanley
    2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 5421 - 5425
  • [22] Recurrent Neural Network Language Model Adaptation for Multi-Genre Broadcast Speech Recognition
    Chen, X.
    Tan, T.
    Liu, X.
    Lanchantin, P.
    Wan, M.
    Gales, M. J. F.
    Woodland, P. C.
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 3511 - 3515
  • [23] Adaptation Algorithms for Neural Network-Based Speech Recognition: An Overview
    Bell P.
    Fainberg J.
    Klejch O.
    Li J.
    Renals S.
    Swietojanski P.
    IEEE Open Journal of Signal Processing, 2021, 2 : 33 - 66
  • [24] Residual Convolutional Neural Network-Based Dysarthric Speech Recognition
    Kumar, Raj
    Tripathy, Manoj
    Anand, R. S.
    Kumar, Niraj
    ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2024, 49 (12) : 16241 - 16251
  • [25] Recurrent Neural Network-Based Model for Named Entity Recognition with Improved Word Embeddings
    Goyal, Archana
    Gupta, Vishal
    Kumar, Manish
    IETE JOURNAL OF RESEARCH, 2023, 69 (10) : 6970 - 6976
  • [26] Recurrent neural network based language model
    Mikolov, Tomas
    Karafiat, Martin
    Burget, Lukas
    Cernocky, Jan Honza
    Khudanpur, Sanjeev
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 2010, : 1045 - 1048
  • [27] Hand Anatomy and Neural Network-Based Recognition for Sign Language
    Tyagi, Akansha
    Bansal, Sandhya
    IETE JOURNAL OF RESEARCH, 2024, 70 (02) : 1572 - 1584
  • [28] Recurrent Neural Network-Based Dictionary Learning for Compressive Speech Sensing
    Ji, Yunyun
    Zhu, Wei-Ping
    Champagne, Benoit
    CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2019, 38 (08) : 3616 - 3643
  • [29] Recurrent Neural Network-Based Dictionary Learning for Compressive Speech Sensing
    Yunyun Ji
    Wei-Ping Zhu
    Benoit Champagne
    Circuits, Systems, and Signal Processing, 2019, 38 : 3616 - 3643
  • [30] Low Latency Based Convolutional Recurrent Neural Network Model for Speech Command Recognition
    Kinkar, Chhayarani Ram
    Jain, Yogendra Kumar
    INFORMATION TECHNOLOGY AND CONTROL, 2021, 50 (04): : 656 - 673