Value leadership of large language models based on speech recognition and emotion classification algorithms

被引:0
作者
Liu, Huifang [1 ]
Ye, Yunfeng [2 ]
机构
[1] Dongguan City Univ, Sch Marxism, Dongguan 523419, Peoples R China
[2] Dongguan City Univ, Brand Ctr, Dongguan 523419, Peoples R China
关键词
speech recognition; SR; emotion classification algorithm; ECA; large language model; LLM; natural language processing; machine reading;
D O I
10.1504/IJBIC.2024.142563
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
An integrated algorithm was proposed to better address the issues of speech recognition and emotion classification in the natural language processing. This algorithm was committed to accurately converting speech information into text form and conducting sentiment analysis on it. The simulation experiment results showed that the loss value of the model in the training set was about 0.35, and the loss value in the validation set was about 0.99. After feature extraction, the accuracy, gain rate, echo value, and F1 value of the model were improved to 0.87, 0.88, 0.88, and 0.88, respectively, showing significant improvement. Compared with other similar models, the proposed model had a higher overall recognition rate, especially in emotions such as anger (90.25%), fear (89.78%), and disgust (90.11%). The above results show that this model can better understand and generate emotional language expressions and provide better services for natural language understanding.
引用
收藏
页码:201 / 211
页数:12
相关论文
共 31 条
[1]  
AlZu'bi S., 2023, Artificial Intelligence and Applications, V2, P28, DOI [10.47852/bonviewAIA3202820, DOI 10.47852/BONVIEWAIA3202820]
[2]   Talk with ChatGPT About the Outbreak of Mpox in 2022: Reflections and Suggestions from AI Dimensions [J].
Cheng, Kunming ;
He, Yongbin ;
Li, Cheng ;
Xie, Ruijie ;
Lu, Yanqiu ;
Gu, Shuqin ;
Wu, Haiyang .
ANNALS OF BIOMEDICAL ENGINEERING, 2023, 51 (05) :870-874
[3]   WTASR: Wavelet Transformer for Automatic Speech Recognition of Indian Languages [J].
Choudhary, Tripti ;
Goyal, Vishal ;
Bansal, Atul .
BIG DATA MINING AND ANALYTICS, 2023, 6 (01) :85-91
[4]   Single-sequence protein structure prediction using a language model and deep learning [J].
Chowdhury, Ratul ;
Bouatta, Nazim ;
Biswas, Surojit ;
Floristean, Christina ;
Kharkare, Anant ;
Roye, Koushik ;
Rochereau, Charlotte ;
Ahdritz, Gustaf ;
Zhang, Joanna ;
Church, George M. ;
Sorger, Peter K. ;
AlQuraishi, Mohammed .
NATURE BIOTECHNOLOGY, 2022, 40 (11) :1617-+
[5]   Accent classification from an emotional speech in clean and noisy environments [J].
Dharshini, Priya G. ;
Rao, K. Sreenivasa .
MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (03) :3485-3508
[6]   Sentiment Analysis for Personalized Chatbots in E-Commerce Applications [J].
El-Ansari, Anas ;
Beni-Hssane, Abderrahim .
WIRELESS PERSONAL COMMUNICATIONS, 2023, 129 (03) :1623-1644
[7]   Developing phoneme-based lip-reading sentences system for silent speech recognition [J].
El-Bialy, Randa ;
Chen, Daqing ;
Fenghour, Souheil ;
Hussein, Walid ;
Xiao, Perry ;
Karam, Omar H. ;
Li, Bo .
CAAI TRANSACTIONS ON INTELLIGENCE TECHNOLOGY, 2023, 8 (01) :129-138
[8]   Reward modeling for mitigating toxicity in transformer-based language models [J].
Faal, Farshid ;
Schmitt, Ketra ;
Yu, Jia Yuan .
APPLIED INTELLIGENCE, 2023, 53 (07) :8421-8435
[9]  
Iqbal M.A., 2022, SN Comput., V135, P66
[10]   Going beyond ourselves: the role of self-transcendent experiences in wisdom [J].
Kim, Yena ;
Nusbaum, Howard C. ;
Yang, Fan .
COGNITION & EMOTION, 2023, 37 (01) :98-116