An Optimal Method for Speech Recognition Based on Neural Network

被引：0

作者：

Ishak, Mohamad Khairi ^{[1
]}

Madsen, Dag oivind ^{[2
]}

Al-Zahrani, Fahad Ahmed ^{[3
]}

机构：

[1] Univ Sains Malaysia, Sch Elect & Elect Engn, Nibong Tebal 14300, Malaysia

[2] Univ South Eastern Norway, Bredalsveien 14, N-3511 Honefoss, Norway

[3] Umm Al Qura Univ, Comp Engn Dept, Mecca 24381, Saudi Arabia

来源：

INTELLIGENT AUTOMATION AND SOFT COMPUTING | 2023年 / 36卷 / 02期

关键词：

Machine learning; neural networks; speech recognition; signal processing; learning process; fluency and accuracy; IMAGES;

D O I：

10.32604/iasc.2023.033971

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Natural language processing technologies have become more widely available in recent years, making them more useful in everyday situations. Machine learning systems that employ accessible datasets and corporate work to serve the whole spectrum of problems addressed in computational linguistics have lately yielded a number of promising breakthroughs. These methods were particularly advantageous for regional languages, as they were provided with cutting-edge language processing tools as soon as the requisite corporate information was generated. The bulk of modern people are unconcerned about the importance of reading. Reading aloud, on the other hand, is an effective technique for nourishing feelings as well as a necessary skill in the learning process. This paper proposed a novel approach for speech recognition based on neural networks. The attention mechanism is first utilized to determine the speech accuracy and fluency assessments, with the spectrum map as the feature extraction input. To increase phoneme identification accuracy, reading precision, for example, employs a new type of deep speech. It makes use of the exportchapter tool, which provides a corpus, as well as the TensorFlow framework in the experimental setting. The experimental findings reveal that the suggested model can more effectively assess spoken speech accuracy and reading fluency than the old model, and its evaluation model's score outcomes are more accurate.

引用

页码：1951 / 1961

页数：11

共 36 条

[1] Aggarwal A., 2022, SENSORS-BASEL, V22, P1
[2] Badshah A., 2017, IEEE INT C PLATFORM, P53
[3] A Study on a Speech Emotion Recognition System with Effective Acoustic Features Using Deep Learning Algorithms
Byun, Sung-Woo
Lee, Seok-Pil
[J]. APPLIED SCIENCES-BASEL, 2021, 11 (04): : 1 - 15
[4] Understanding the Association between Musical Sophistication and Well-Being in Music Students
Cara, Michel A.
Lobos, Constanza
Varas, Mario
Torres, Oscar
[J]. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH, 2022, 19 (07)
[5] Deep learning-based facial emotion recognition for human-computer interaction applications
Chowdary, M. Kalpana
Nguyen, Tu N.
Hemanth, D. Jude
[J]. NEURAL COMPUTING & APPLICATIONS, 2023, 35 (32) : 23311 - 23328
[6] Ezzat Souraya, 2012, International Journal of Computer Information Systems and Industrial Management Applications, V4, P619
[7] Impact of Feature Selection Algorithm on Speech Emotion Recognition Using Deep Convolutional Neural Network
Farooq, Misbah
Hussain, Fawad
Baloch, Naveed Khan
Raja, Fawad Riasat
Yu, Heejung
Zikria, Yousaf Bin
[J]. SENSORS, 2020, 20 (21) : 1 - 18
[8] Gerosa M, 2006, INT CONF ACOUST SPEE, P393
[9] Highly accurate children's speech recognition for interactive reading tutors using subword units
Hagen, Andreas
Pellom, Bryan
Cole, Ronald
[J]. SPEECH COMMUNICATION, 2007, 49 (12) : 861 - 873
[10] Parameters Compressing in Deep Learning
He, Shiming
Li, Zhuozhou
Tang, Yangning
Liao, Zhuofan
Li, Feng
Lim, Se-Jung
[J]. CMC-COMPUTERS MATERIALS & CONTINUA, 2020, 62 (01): : 321 - 336

← 1 2 3 4 →