Value leadership of large language models based on speech recognition and emotion classification algorithms

被引:0
|
作者
Liu, Huifang [1 ]
Ye, Yunfeng [2 ]
机构
[1] Dongguan City Univ, Sch Marxism, Dongguan 523419, Peoples R China
[2] Dongguan City Univ, Brand Ctr, Dongguan 523419, Peoples R China
关键词
speech recognition; SR; emotion classification algorithm; ECA; large language model; LLM; natural language processing; machine reading;
D O I
10.1504/IJBIC.2024.142563
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
An integrated algorithm was proposed to better address the issues of speech recognition and emotion classification in the natural language processing. This algorithm was committed to accurately converting speech information into text form and conducting sentiment analysis on it. The simulation experiment results showed that the loss value of the model in the training set was about 0.35, and the loss value in the validation set was about 0.99. After feature extraction, the accuracy, gain rate, echo value, and F1 value of the model were improved to 0.87, 0.88, 0.88, and 0.88, respectively, showing significant improvement. Compared with other similar models, the proposed model had a higher overall recognition rate, especially in emotions such as anger (90.25%), fear (89.78%), and disgust (90.11%). The above results show that this model can better understand and generate emotional language expressions and provide better services for natural language understanding.
引用
收藏
页码:201 / 211
页数:12
相关论文
共 50 条
  • [41] Multimodal Emotion Recognition Based on Facial Expressions, Speech, and EEG
    Pan, Jiahui
    Fang, Weijie
    Zhang, Zhihang
    Chen, Bingzhi
    Zhang, Zheng
    Wang, Shuihua
    IEEE OPEN JOURNAL OF ENGINEERING IN MEDICINE AND BIOLOGY, 2024, 5 : 396 - 403
  • [42] A Transcription Prompt-based Efficient Audio Large Language Model for Robust Speech Recognition
    Li, Yangze
    Wang, Xiong
    Cao, Songjun
    Zhang, Yike
    Ma, Long
    Xie, Lei
    INTERSPEECH 2024, 2024, : 1905 - 1909
  • [43] Continuous Mandarin speech recognition for Chinese language with large vocabulary based on segmental probability model
    Shen, JL
    IEE PROCEEDINGS-VISION IMAGE AND SIGNAL PROCESSING, 1998, 145 (05): : 309 - 315
  • [44] Design pattern recognition: a study of large language models
    Pandey, Sushant Kumar
    Chand, Sivajeet
    Horkoff, Jennifer
    Staron, Miroslaw
    Ochodek, Miroslaw
    Durisic, Darko
    EMPIRICAL SOFTWARE ENGINEERING, 2025, 30 (03)
  • [45] Exploring Energy-based Language Models with Different Architectures and Training Methods for Speech Recognition
    Liu, Hong
    Lv, Zhaobiao
    Ou, Zhijian
    Zhao, Wenbo
    Xiao, Qing
    INTERSPEECH 2023, 2023, : 476 - 480
  • [46] Building DNN acoustic models for large vocabulary speech recognition
    Maas, Andrew L.
    Qi, Peng
    Xie, Ziang
    Hannun, Awni Y.
    Lengerich, Christopher T.
    Jurafsky, Daniel
    Ng, Andrew Y.
    COMPUTER SPEECH AND LANGUAGE, 2017, 41 : 195 - 213
  • [47] A speech recognition algorithm based on the features of Croatian language
    Peic, R
    PROCEEDINGS EC-VIP-MC 2003, VOLS 1 AND 2, 2003, : 613 - 618
  • [48] Multimodal Food Image Classification with Large Language Models
    Kim, Jun-Hwa
    Kim, Nam-Ho
    Jo, Donghyeok
    Won, Chee Sun
    ELECTRONICS, 2024, 13 (22)
  • [49] End-to-End Large Vocabulary Speech Recognition for the Serbian Language
    Popovic, Branislav
    Pakoci, Edvin
    Pekar, Darko
    SPEECH AND COMPUTER, SPECOM 2017, 2017, 10458 : 343 - 352
  • [50] Investigating Large Language Models' Perception of Emotion Using Appraisal Theory
    Yongsatianchot, Nutchanon
    Torshizi, Parisa Ghanad
    Marsella, Stacy
    2023 11TH INTERNATIONAL CONFERENCE ON AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION WORKSHOPS AND DEMOS, ACIIW, 2023,