Value leadership of large language models based on speech recognition and emotion classification algorithms

被引:0
|
作者
Liu, Huifang [1 ]
Ye, Yunfeng [2 ]
机构
[1] Dongguan City Univ, Sch Marxism, Dongguan 523419, Peoples R China
[2] Dongguan City Univ, Brand Ctr, Dongguan 523419, Peoples R China
关键词
speech recognition; SR; emotion classification algorithm; ECA; large language model; LLM; natural language processing; machine reading;
D O I
10.1504/IJBIC.2024.142563
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
An integrated algorithm was proposed to better address the issues of speech recognition and emotion classification in the natural language processing. This algorithm was committed to accurately converting speech information into text form and conducting sentiment analysis on it. The simulation experiment results showed that the loss value of the model in the training set was about 0.35, and the loss value in the validation set was about 0.99. After feature extraction, the accuracy, gain rate, echo value, and F1 value of the model were improved to 0.87, 0.88, 0.88, and 0.88, respectively, showing significant improvement. Compared with other similar models, the proposed model had a higher overall recognition rate, especially in emotions such as anger (90.25%), fear (89.78%), and disgust (90.11%). The above results show that this model can better understand and generate emotional language expressions and provide better services for natural language understanding.
引用
收藏
页码:201 / 211
页数:12
相关论文
共 50 条
  • [21] BAYESIAN TRANSFORMER LANGUAGE MODELS FOR SPEECH RECOGNITION
    Xue, Boyang
    Yu, Jianwei
    Xu, Junhao
    Liu, Shansong
    Hu, Shoukang
    Ye, Zi
    Geng, Mengzhe
    Liu, Xunying
    Meng, Helen
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 7378 - 7382
  • [22] Alzheimer's disease recognition from spontaneous speech using large language models
    Bang, Jeong-Uk
    Han, Seung-Hoon
    Kang, Byung-Ok
    ETRI JOURNAL, 2024, 46 (01) : 96 - 105
  • [23] Speech Emotion Recognition Based on Henan Dialect
    Cheng, Zichen
    Li, Yan
    Jiu, Mengfei
    Ge, Jiangwei
    COMMUNICATIONS, SIGNAL PROCESSING, AND SYSTEMS, VOL. 1, 2022, 878 : 498 - 505
  • [24] Optional English speech teaching method based on recognition emotion mining and deep learning algorithms
    Zhang, Xinyu
    Li, Hui
    Wang, Na
    Shi, Ruolin
    Engineering Intelligent Systems, 2019, 27 (03): : 141 - 150
  • [25] Factors in Emotion Recognition With Deep Learning Models Using Speech and Text on Multiple Corpora
    Braunschweiler, Norbert
    Doddipatla, Rama
    Keizer, Simon
    Stoyanchev, Svetlana
    IEEE SIGNAL PROCESSING LETTERS, 2022, 29 : 722 - 726
  • [26] Dual Language Models for Code Switched Speech Recognition
    Garg, Saurabh
    Parekh, Tanmay
    Jyothi, Preethi
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 2598 - 2602
  • [27] Data augmentation based on large language models for radiological report classification
    Collado-Montanez, Jaime
    Martin-Valdivia, Maria-Teresa
    Martinez-Camara, Eugenio
    KNOWLEDGE-BASED SYSTEMS, 2025, 308
  • [28] Zero-Shot Classification of Art With Large Language Models
    Tojima, Tatsuya
    Yoshida, Mitsuo
    IEEE ACCESS, 2025, 13 : 17426 - 17439
  • [29] Speech Recognition Algorithms-Based Cough Recognition System
    Barkani, Fatima
    Hamidi, Mohamed
    Zealouk, Ouissam
    Satori, Hassan
    INTERNATIONAL JOURNAL OF ONLINE AND BIOMEDICAL ENGINEERING, 2023, 19 (12) : 49 - 61
  • [30] Multimodal Embeddings From Language Models for Emotion Recognition in the Wild
    Tseng, Shao-Yen
    Narayanan, Shrikanth
    Georgiou, Panayiotis
    IEEE SIGNAL PROCESSING LETTERS, 2021, 28 : 608 - 612