Context Unlocks Emotions: Text-based Emotion Classification Dataset Auditing with Large Language Models

被引:1
|
作者
Yang, Daniel [1 ]
Kommineni, Aditya [1 ]
Alshehri, Mohammad [1 ,2 ]
Mohanty, Nilamadhab [1 ]
Modi, Vedant [1 ]
Gratch, Jonathan [1 ]
Narayanan, Shrikanth [1 ]
机构
[1] Univ Southern Calif, Los Angeles, CA 90007 USA
[2] Saudi Aramco, Dhahran, Saudi Arabia
来源
2023 11TH INTERNATIONAL CONFERENCE ON AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION, ACII | 2023年
基金
美国国家科学基金会;
关键词
emotion classification; natural language processing; large language models; prompting;
D O I
10.1109/ACIIW59127.2023.10388131
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The lack of contextual information in text data can make the annotation process of text-based emotion classification datasets challenging. As a result, such datasets often contain labels that fail to consider all the relevant emotions in the vocabulary. This misalignment between text inputs and labels can degrade the performance of machine learning models trained on top of them. As re-annotating entire datasets is a costly and time-consuming task that cannot be done at scale, we propose to use the expressive capabilities of large language models to synthesize additional context for input text to increase its alignment with the annotated emotional labels. In this work, we propose a formal definition of textual context to motivate a prompting strategy to enhance such contextual information. We provide both human and empirical evaluation to demonstrate the efficacy of the enhanced context. Our method improves alignment between inputs and their human-annotated labels from both an empirical and human-evaluated standpoint.
引用
收藏
页数:8
相关论文
共 41 条
  • [31] Artificial intelligence orchestration for text-based ultrasonic simulation via self-review by multi-large language model agents
    Soyeon Kim
    Yonggyun Yu
    Hogeon Seo
    Scientific Reports, 15 (1)
  • [32] Comparing human text classification performance and explainability with large language and machine learning models using eye-tracking
    Venkatesh, Jeevithashree Divya
    Jaiswal, Aparajita
    Nanda, Gaurav
    SCIENTIFIC REPORTS, 2024, 14 (01):
  • [33] Automatic Genre Identification for Robust Enrichment of Massive Text Collections: Investigation of Classification Methods in the Era of Large Language Models
    Kuzman, Taja
    Mozetic, Igor
    Ljubesic, Nikola
    MACHINE LEARNING AND KNOWLEDGE EXTRACTION, 2023, 5 (03): : 1149 - 1175
  • [34] Vision-Enabled Large Language and Deep Learning Models for Image-Based Emotion Recognition
    Nadeem, Mohammad
    Sohail, Shahab Saquib
    Javed, Laeeba
    Anwer, Faisal
    Saudagar, Abdul Khader Jilani
    Muhammad, Khan
    COGNITIVE COMPUTATION, 2024, 16 (05) : 2566 - 2579
  • [35] Pre-Trained Transformer-Based Models for Text Classification Using Low-Resourced Ewe Language
    Agbesi, Victor Kwaku
    Chen, Wenyu
    Yussif, Sophyani Banaamwini
    Hossin, Md Altab
    Ukwuoma, Chiagoziem C.
    Kuadey, Noble A.
    Agbesi, Colin Collinson
    Samee, Nagwan Abdel
    Jamjoom, Mona M.
    Al-antari, Mugahed A.
    SYSTEMS, 2024, 12 (01):
  • [36] Improving the Accuracy of Text-to-SQL Tools Based on Large Language Models for Real-World Relational Databases
    Coelho, Gustavo M. C.
    Nascimento, Eduardo R. S.
    Izquierdo, Yenier T.
    Garcia, Grettel M.
    Feijo, Lucas
    Lemos, Melissa
    Garcia, Robinson L. S.
    de Oliveira, Aiko R.
    Pinheiro, Joao P.
    Casanova, Marco A.
    DATABASE AND EXPERT SYSTEMS APPLICATIONS, PT I, DEXA 2024, 2024, 14910 : 93 - 107
  • [37] NLP4ReF: Requirements Classification and Forecasting: From Model-Based Design to Large Language Models
    Peer, Jordan
    Mordecai, Yaniv
    Reich, Yoram
    2024 IEEE AEROSPACE CONFERENCE, 2024,
  • [38] Applying automatic text-based detection of deceptive language to police reports: Extracting behavioral patterns from a multi-step classification model to understand how we lie to the police
    Quijano-Sanchez, Lara
    Liberatore, Federico
    Camacho-Collados, Jose
    Camacho-Collados, Miguel
    KNOWLEDGE-BASED SYSTEMS, 2018, 149 : 155 - 168
  • [39] Veracity-Oriented Context-Aware Large Language Models-Based Prompting Optimization for Fake News Detection
    Jin, Weiqiang
    Gao, Yang
    Tao, Tao
    Wang, Xiujun
    Wang, Ningwei
    Wu, Baohai
    Zhao, Biao
    INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2025, 2025 (01)
  • [40] Exploring the ability of emerging large language models to detect cyberbullying in social posts through new prompt-based classification approaches
    Cirillo, Stefano
    Desiato, Domenico
    Polese, Giuseppe
    Solimando, Giandomenico
    Sugumaran, Vijayan
    Sundaramurthy, Shanmugam
    INFORMATION PROCESSING & MANAGEMENT, 2025, 62 (03)