Value leadership of large language models based on speech recognition and emotion classification algorithms

被引:0
|
作者
Liu, Huifang [1 ]
Ye, Yunfeng [2 ]
机构
[1] Dongguan City Univ, Sch Marxism, Dongguan 523419, Peoples R China
[2] Dongguan City Univ, Brand Ctr, Dongguan 523419, Peoples R China
关键词
speech recognition; SR; emotion classification algorithm; ECA; large language model; LLM; natural language processing; machine reading;
D O I
10.1504/IJBIC.2024.142563
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
An integrated algorithm was proposed to better address the issues of speech recognition and emotion classification in the natural language processing. This algorithm was committed to accurately converting speech information into text form and conducting sentiment analysis on it. The simulation experiment results showed that the loss value of the model in the training set was about 0.35, and the loss value in the validation set was about 0.99. After feature extraction, the accuracy, gain rate, echo value, and F1 value of the model were improved to 0.87, 0.88, 0.88, and 0.88, respectively, showing significant improvement. Compared with other similar models, the proposed model had a higher overall recognition rate, especially in emotions such as anger (90.25%), fear (89.78%), and disgust (90.11%). The above results show that this model can better understand and generate emotional language expressions and provide better services for natural language understanding.
引用
收藏
页码:201 / 211
页数:12
相关论文
共 50 条
  • [1] PROMPTING LARGE LANGUAGE MODELS WITH SPEECH RECOGNITION ABILITIES
    Fathullah, Yassir
    Wu, Chunyang
    Lakomkin, Egor
    Jia, Junteng
    Shangguan, Yuan
    Li, Ke
    Guo, Jinxi
    Xiong, Wenhan
    Mahadeokar, Jay
    Kalinli, Ozlem
    Fuegen, Christian
    Seltzer, Mike
    2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2024), 2024, : 13351 - 13355
  • [2] Extending RNN-T-based speech recognition systems with emotion and language classification
    Kons, Zvi
    Aronowitz, Hagai
    Morais, Edmilson
    Damasceno, Matheus
    Kuo, Hong-Kwang
    Thomas, Samuel
    Saon, George
    INTERSPEECH 2022, 2022, : 546 - 549
  • [3] Pinyin Regularization in Error Correction for Chinese Speech Recognition with Large Language Models
    Tang, Zhiyuan
    Wang, Dong
    Huang, Shen
    Shang, Shidong
    INTERSPEECH 2024, 2024, : 1910 - 1914
  • [4] Large vocabulary speech recognition with multispan statistical language models
    Bellegarda, JR
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2000, 8 (01): : 76 - 84
  • [5] Speech based emotion classification
    Nwe, TL
    Wei, FS
    De Silva, LC
    IEEE REGION 10 INTERNATIONAL CONFERENCE ON ELECTRICAL AND ELECTRONIC TECHNOLOGY, VOLS 1 AND 2, 2001, : 297 - 301
  • [6] Large-Scale Random Forest Language Models for Speech Recognition
    Su, Yi
    Jelinek, Frederick
    Khudanpur, Sanjeev
    INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 945 - 948
  • [7] Exploring Multimodal Data Approach in Natural Language Processing Based on Speech Recognition Algorithms
    Oleh, Basystiuk
    Ihor, Farmaha
    Zoriana, Rybchak
    2023 17TH INTERNATIONAL CONFERENCE ON THE EXPERIENCE OF DESIGNING AND APPLICATION OF CAD SYSTEMS, CADSM, 2023,
  • [8] English speech emotion recognition method based on speech recognition
    Liu, Man
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2022, 25 (2) : 391 - 398
  • [9] English speech emotion recognition method based on speech recognition
    Man Liu
    International Journal of Speech Technology, 2022, 25 : 391 - 398
  • [10] Context Unlocks Emotions: Text-based Emotion Classification Dataset Auditing with Large Language Models
    Yang, Daniel
    Kommineni, Aditya
    Alshehri, Mohammad
    Mohanty, Nilamadhab
    Modi, Vedant
    Gratch, Jonathan
    Narayanan, Shrikanth
    2023 11TH INTERNATIONAL CONFERENCE ON AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION, ACII, 2023,