Value leadership of large language models based on speech recognition and emotion classification algorithms

被引：0

作者：

Liu, Huifang ^{[1
]}

Ye, Yunfeng ^{[2
]}

机构：

[1] Dongguan City Univ, Sch Marxism, Dongguan 523419, Peoples R China

[2] Dongguan City Univ, Brand Ctr, Dongguan 523419, Peoples R China

来源：

INTERNATIONAL JOURNAL OF BIO-INSPIRED COMPUTATION | 2024年 / 24卷 / 04期

关键词：

speech recognition; SR; emotion classification algorithm; ECA; large language model; LLM; natural language processing; machine reading;

D O I：

10.1504/IJBIC.2024.142563

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

An integrated algorithm was proposed to better address the issues of speech recognition and emotion classification in the natural language processing. This algorithm was committed to accurately converting speech information into text form and conducting sentiment analysis on it. The simulation experiment results showed that the loss value of the model in the training set was about 0.35, and the loss value in the validation set was about 0.99. After feature extraction, the accuracy, gain rate, echo value, and F1 value of the model were improved to 0.87, 0.88, 0.88, and 0.88, respectively, showing significant improvement. Compared with other similar models, the proposed model had a higher overall recognition rate, especially in emotions such as anger (90.25%), fear (89.78%), and disgust (90.11%). The above results show that this model can better understand and generate emotional language expressions and provide better services for natural language understanding.

引用

页码：201 / 211

页数：12

共 50 条

[41] Multimodal Emotion Recognition Based on Facial Expressions, Speech, and EEG
Pan, Jiahui
Fang, Weijie
Zhang, Zhihang
Chen, Bingzhi
Zhang, Zheng
Wang, Shuihua
IEEE OPEN JOURNAL OF ENGINEERING IN MEDICINE AND BIOLOGY, 2024, 5 : 396 - 403
[42] A Transcription Prompt-based Efficient Audio Large Language Model for Robust Speech Recognition
Li, Yangze
Wang, Xiong
Cao, Songjun
Zhang, Yike
Ma, Long
Xie, Lei
INTERSPEECH 2024, 2024, : 1905 - 1909
[43] Continuous Mandarin speech recognition for Chinese language with large vocabulary based on segmental probability model
Shen, JL
IEE PROCEEDINGS-VISION IMAGE AND SIGNAL PROCESSING, 1998, 145 (05): : 309 - 315
[44] Design pattern recognition: a study of large language models
Pandey, Sushant Kumar
Chand, Sivajeet
Horkoff, Jennifer
Staron, Miroslaw
Ochodek, Miroslaw
Durisic, Darko
EMPIRICAL SOFTWARE ENGINEERING, 2025, 30 (03)
[45] Exploring Energy-based Language Models with Different Architectures and Training Methods for Speech Recognition
Liu, Hong
Lv, Zhaobiao
Ou, Zhijian
Zhao, Wenbo
Xiao, Qing
INTERSPEECH 2023, 2023, : 476 - 480
[46] Building DNN acoustic models for large vocabulary speech recognition
Maas, Andrew L.
Qi, Peng
Xie, Ziang
Hannun, Awni Y.
Lengerich, Christopher T.
Jurafsky, Daniel
Ng, Andrew Y.
COMPUTER SPEECH AND LANGUAGE, 2017, 41 : 195 - 213
[47] A speech recognition algorithm based on the features of Croatian language
Peic, R
PROCEEDINGS EC-VIP-MC 2003, VOLS 1 AND 2, 2003, : 613 - 618
[48] Multimodal Food Image Classification with Large Language Models
Kim, Jun-Hwa
Kim, Nam-Ho
Jo, Donghyeok
Won, Chee Sun
ELECTRONICS, 2024, 13 (22)
[49] End-to-End Large Vocabulary Speech Recognition for the Serbian Language
Popovic, Branislav
Pakoci, Edvin
Pekar, Darko
SPEECH AND COMPUTER, SPECOM 2017, 2017, 10458 : 343 - 352
[50] Investigating Large Language Models' Perception of Emotion Using Appraisal Theory
Yongsatianchot, Nutchanon
Torshizi, Parisa Ghanad
Marsella, Stacy
2023 11TH INTERNATIONAL CONFERENCE ON AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION WORKSHOPS AND DEMOS, ACIIW, 2023,

← 1 2 3 4 5 →