Context Unlocks Emotions: Text-based Emotion Classification Dataset Auditing with Large Language Models

被引:1
|
作者
Yang, Daniel [1 ]
Kommineni, Aditya [1 ]
Alshehri, Mohammad [1 ,2 ]
Mohanty, Nilamadhab [1 ]
Modi, Vedant [1 ]
Gratch, Jonathan [1 ]
Narayanan, Shrikanth [1 ]
机构
[1] Univ Southern Calif, Los Angeles, CA 90007 USA
[2] Saudi Aramco, Dhahran, Saudi Arabia
来源
2023 11TH INTERNATIONAL CONFERENCE ON AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION, ACII | 2023年
基金
美国国家科学基金会;
关键词
emotion classification; natural language processing; large language models; prompting;
D O I
10.1109/ACIIW59127.2023.10388131
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The lack of contextual information in text data can make the annotation process of text-based emotion classification datasets challenging. As a result, such datasets often contain labels that fail to consider all the relevant emotions in the vocabulary. This misalignment between text inputs and labels can degrade the performance of machine learning models trained on top of them. As re-annotating entire datasets is a costly and time-consuming task that cannot be done at scale, we propose to use the expressive capabilities of large language models to synthesize additional context for input text to increase its alignment with the annotated emotional labels. In this work, we propose a formal definition of textual context to motivate a prompting strategy to enhance such contextual information. We provide both human and empirical evaluation to demonstrate the efficacy of the enhanced context. Our method improves alignment between inputs and their human-annotated labels from both an empirical and human-evaluated standpoint.
引用
收藏
页数:8
相关论文
共 41 条
  • [41] The added value of including thyroid nodule features into large language models for automatic ACR TI-RADS classification based on ultrasound reports
    Lopez-ubeda, Pilar
    Martin-Noguerol, Teodoro
    Ruiz-Vinuesa, Alba
    Luna, Antonio
    JAPANESE JOURNAL OF RADIOLOGY, 2025, 43 (04) : 593 - 602