An Optimal Method for Speech Recognition Based on Neural Network

被引:0
作者
Ishak, Mohamad Khairi [1 ]
Madsen, Dag oivind [2 ]
Al-Zahrani, Fahad Ahmed [3 ]
机构
[1] Univ Sains Malaysia, Sch Elect & Elect Engn, Nibong Tebal 14300, Malaysia
[2] Univ South Eastern Norway, Bredalsveien 14, N-3511 Honefoss, Norway
[3] Umm Al Qura Univ, Comp Engn Dept, Mecca 24381, Saudi Arabia
关键词
Machine learning; neural networks; speech recognition; signal processing; learning process; fluency and accuracy; IMAGES;
D O I
10.32604/iasc.2023.033971
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Natural language processing technologies have become more widely available in recent years, making them more useful in everyday situations. Machine learning systems that employ accessible datasets and corporate work to serve the whole spectrum of problems addressed in computational linguistics have lately yielded a number of promising breakthroughs. These methods were particularly advantageous for regional languages, as they were provided with cutting-edge language processing tools as soon as the requisite corporate information was generated. The bulk of modern people are unconcerned about the importance of reading. Reading aloud, on the other hand, is an effective technique for nourishing feelings as well as a necessary skill in the learning process. This paper proposed a novel approach for speech recognition based on neural networks. The attention mechanism is first utilized to determine the speech accuracy and fluency assessments, with the spectrum map as the feature extraction input. To increase phoneme identification accuracy, reading precision, for example, employs a new type of deep speech. It makes use of the exportchapter tool, which provides a corpus, as well as the TensorFlow framework in the experimental setting. The experimental findings reveal that the suggested model can more effectively assess spoken speech accuracy and reading fluency than the old model, and its evaluation model's score outcomes are more accurate.
引用
收藏
页码:1951 / 1961
页数:11
相关论文
共 36 条
  • [1] Aggarwal A., 2022, SENSORS-BASEL, V22, P1
  • [2] Badshah A., 2017, IEEE INT C PLATFORM, P53
  • [3] A Study on a Speech Emotion Recognition System with Effective Acoustic Features Using Deep Learning Algorithms
    Byun, Sung-Woo
    Lee, Seok-Pil
    [J]. APPLIED SCIENCES-BASEL, 2021, 11 (04): : 1 - 15
  • [4] Understanding the Association between Musical Sophistication and Well-Being in Music Students
    Cara, Michel A.
    Lobos, Constanza
    Varas, Mario
    Torres, Oscar
    [J]. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH, 2022, 19 (07)
  • [5] Deep learning-based facial emotion recognition for human-computer interaction applications
    Chowdary, M. Kalpana
    Nguyen, Tu N.
    Hemanth, D. Jude
    [J]. NEURAL COMPUTING & APPLICATIONS, 2023, 35 (32) : 23311 - 23328
  • [6] Ezzat Souraya, 2012, International Journal of Computer Information Systems and Industrial Management Applications, V4, P619
  • [7] Impact of Feature Selection Algorithm on Speech Emotion Recognition Using Deep Convolutional Neural Network
    Farooq, Misbah
    Hussain, Fawad
    Baloch, Naveed Khan
    Raja, Fawad Riasat
    Yu, Heejung
    Zikria, Yousaf Bin
    [J]. SENSORS, 2020, 20 (21) : 1 - 18
  • [8] Gerosa M, 2006, INT CONF ACOUST SPEE, P393
  • [9] Highly accurate children's speech recognition for interactive reading tutors using subword units
    Hagen, Andreas
    Pellom, Bryan
    Cole, Ronald
    [J]. SPEECH COMMUNICATION, 2007, 49 (12) : 861 - 873
  • [10] Parameters Compressing in Deep Learning
    He, Shiming
    Li, Zhuozhou
    Tang, Yangning
    Liao, Zhuofan
    Li, Feng
    Lim, Se-Jung
    [J]. CMC-COMPUTERS MATERIALS & CONTINUA, 2020, 62 (01): : 321 - 336