Value leadership of large language models based on speech recognition and emotion classification algorithms

被引：0

作者：

Liu, Huifang ^{[1
]}

Ye, Yunfeng ^{[2
]}

机构：

[1] Dongguan City Univ, Sch Marxism, Dongguan 523419, Peoples R China

[2] Dongguan City Univ, Brand Ctr, Dongguan 523419, Peoples R China

来源：

INTERNATIONAL JOURNAL OF BIO-INSPIRED COMPUTATION | 2024年 / 24卷 / 04期

关键词：

speech recognition; SR; emotion classification algorithm; ECA; large language model; LLM; natural language processing; machine reading;

D O I：

10.1504/IJBIC.2024.142563

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

An integrated algorithm was proposed to better address the issues of speech recognition and emotion classification in the natural language processing. This algorithm was committed to accurately converting speech information into text form and conducting sentiment analysis on it. The simulation experiment results showed that the loss value of the model in the training set was about 0.35, and the loss value in the validation set was about 0.99. After feature extraction, the accuracy, gain rate, echo value, and F1 value of the model were improved to 0.87, 0.88, 0.88, and 0.88, respectively, showing significant improvement. Compared with other similar models, the proposed model had a higher overall recognition rate, especially in emotions such as anger (90.25%), fear (89.78%), and disgust (90.11%). The above results show that this model can better understand and generate emotional language expressions and provide better services for natural language understanding.

引用

页码：201 / 211

页数：12

共 50 条

[21] BAYESIAN TRANSFORMER LANGUAGE MODELS FOR SPEECH RECOGNITION
Xue, Boyang
Yu, Jianwei
Xu, Junhao
Liu, Shansong
Hu, Shoukang
Ye, Zi
Geng, Mengzhe
Liu, Xunying
Meng, Helen
2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 7378 - 7382
[22] Alzheimer's disease recognition from spontaneous speech using large language models
Bang, Jeong-Uk
Han, Seung-Hoon
Kang, Byung-Ok
ETRI JOURNAL, 2024, 46 (01) : 96 - 105
[23] Speech Emotion Recognition Based on Henan Dialect
Cheng, Zichen
Li, Yan
Jiu, Mengfei
Ge, Jiangwei
COMMUNICATIONS, SIGNAL PROCESSING, AND SYSTEMS, VOL. 1, 2022, 878 : 498 - 505
[24] Optional English speech teaching method based on recognition emotion mining and deep learning algorithms
Zhang, Xinyu
Li, Hui
Wang, Na
Shi, Ruolin
Engineering Intelligent Systems, 2019, 27 (03): : 141 - 150
[25] Factors in Emotion Recognition With Deep Learning Models Using Speech and Text on Multiple Corpora
Braunschweiler, Norbert
Doddipatla, Rama
Keizer, Simon
Stoyanchev, Svetlana
IEEE SIGNAL PROCESSING LETTERS, 2022, 29 : 722 - 726
[26] Dual Language Models for Code Switched Speech Recognition
Garg, Saurabh
Parekh, Tanmay
Jyothi, Preethi
19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 2598 - 2602
[27] Data augmentation based on large language models for radiological report classification
Collado-Montanez, Jaime
Martin-Valdivia, Maria-Teresa
Martinez-Camara, Eugenio
KNOWLEDGE-BASED SYSTEMS, 2025, 308
[28] Zero-Shot Classification of Art With Large Language Models
Tojima, Tatsuya
Yoshida, Mitsuo
IEEE ACCESS, 2025, 13 : 17426 - 17439
[29] Speech Recognition Algorithms-Based Cough Recognition System
Barkani, Fatima
Hamidi, Mohamed
Zealouk, Ouissam
Satori, Hassan
INTERNATIONAL JOURNAL OF ONLINE AND BIOMEDICAL ENGINEERING, 2023, 19 (12) : 49 - 61
[30] Multimodal Embeddings From Language Models for Emotion Recognition in the Wild
Tseng, Shao-Yen
Narayanan, Shrikanth
Georgiou, Panayiotis
IEEE SIGNAL PROCESSING LETTERS, 2021, 28 : 608 - 612

← 1 2 3 4 5 →