HindiSpeech-Net: a deep learning based robust automatic speech recognition system for Hindi language

被引:4
作者
Sharma, Usha [1 ]
Om, Hari [1 ]
Mishra, A. N. [2 ]
机构
[1] Indian Inst Technol ISM Dhanbad, Dept Comp Sci & Engn, Dhanbad 826004, Bihar, India
[2] Krishna Engn Coll, Ghaziabad 201001, India
关键词
1D-CNN; Convolutional neural network; Hindi language; Deep learning; Speech recognition; FEATURE-EXTRACTION;
D O I
10.1007/s11042-022-14019-z
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Automatic Speech Recognition (ASR) has become one of the major research areas over the past decade and gained a lot of interest. Their system implementation, adaptation to different languages and robustness in the performance are still some of the major challenges. Hindi is one of the most widely spoken languages in the world but it is a complex and resource-constraint language. Thus, speech recognition and classification systems need to be developed for Hindi language to spread the technology and to explore more communication means. But due to its language complexity than other languages and lack of standard databases, it is quite challenging to develop such systems. Deep learning is extensively used in different research fields and has proven its prominence to a broader extent. In this paper, a seven-layer 1D-convolutional neural network HindiSpeech-Net has been proposed to recognise different speech samples of the Hindi language in the respective category. A large dataset of 2400 speech samples in the Hindi language is collected in ten different classes in real-world conditions which is further accompanied by signal filtering and augmentation to enhance the dataset for making a robust model and avoid overfitting. The collected dataset is divided into training, validation and test set which were evaluated in different performance parameters. The trained HindiSpeech-Net model achieved an accuracy of 92.92% on the test set. The proposed framework is computationally less expensive, works in real-time and is suitable for implementation in embedded systems.
引用
收藏
页码:16173 / 16193
页数:21
相关论文
共 50 条
  • [1] HindiSpeech-Net: a deep learning based robust automatic speech recognition system for Hindi language
    Usha Sharma
    Hari Om
    A. N. Mishra
    Multimedia Tools and Applications, 2023, 82 : 16173 - 16193
  • [2] Automatic Speech Recognition Method Based on Deep Learning Approaches for Uzbek Language
    Mukhamadiyev, Abdinabi
    Khujayarov, Ilyos
    Djuraev, Oybek
    Cho, Jinsoo
    SENSORS, 2022, 22 (10)
  • [3] Customized deep learning based Turkish automatic speech recognition system supported by language model
    Gormez, Yasin
    PEERJ COMPUTER SCIENCE, 2024, 10
  • [4] A Highly Efficient Distributed Deep Learning System For Automatic Speech Recognition
    Zhang, Wei
    Cui, Xiaodong
    Finkler, Ulrich
    Saon, George
    Kayi, Abdullah
    Buyuktosunoglu, Alper
    Kingsbury, Brian
    Kung, David
    Picheny, Michael
    INTERSPEECH 2019, 2019, : 2628 - 2632
  • [5] Speech Recognition and System Controlling using Hindi Language
    Rathor, Sandeep
    Jadon, R. S.
    2019 10TH INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION AND NETWORKING TECHNOLOGIES (ICCCNT), 2019,
  • [6] Deep-Learning-Based BCI for Automatic Imagined Speech Recognition Using SPWVD
    Kamble, Ashwin
    Ghare, Pradnya H.
    Kumar, Vinay
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2023, 72
  • [7] Speech Vision: An End-to-End Deep Learning-Based Dysarthric Automatic Speech Recognition System
    Shahamiri, Seyed Reza
    IEEE TRANSACTIONS ON NEURAL SYSTEMS AND REHABILITATION ENGINEERING, 2021, 29 : 852 - 861
  • [8] DISCRIMINATIVE PIECEWISE LINEAR TRANSFORMATION BASED ON DEEP LEARNING FOR NOISE ROBUST AUTOMATIC SPEECH RECOGNITION
    Kashiwagi, Yosuke
    Saito, Daisuke
    Minematsu, Nobuaki
    Hirose, Keikichi
    2013 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING (ASRU), 2013, : 350 - 355
  • [9] Improving Deep Learning based Automatic Speech Recognition for Gujarati
    Raval, Deepang
    Pathak, Vyom
    Patel, Muktan
    Bhatt, Brijesh
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2022, 21 (03)
  • [10] Deep Learning for Robust Automatic Modulation Recognition Method for IoT Applications
    Zhang, Tingping
    Shuai, Cong
    Zhou, Yaru
    IEEE ACCESS, 2020, 8 (08): : 117689 - 117697