EmoLabel: Semi-Automatic Methodology for Emotion Annotation of Social Media Text

被引：9

作者：

Canales, Lea ^{[1
]}

Daelemans, Walter ^{[2
]}

Boldrini, Ester ^{[1
]}

Martinez-Barco, Patricio ^{[1
]}

机构：

[1] Univ Alicante, Dept Software & Comp Syst, Alicante 03690, Spain

[2] Univ Antwerp, CLiPS Res Ctr, B-2000 Antwerp, Belgium

来源：

IEEE TRANSACTIONS ON AFFECTIVE COMPUTING | 2022年 / 13卷 / 02期

关键词：

Task analysis; Erbium; Manuals; Twitter; Emotion recognition; Data mining; Natural language processing; sentiment analysis; textual emotion recognition; corpora annotation; social media text; CLASSIFICATION; RECOGNITION;

D O I：

10.1109/TAFFC.2019.2927564

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The exponential growth of the amount of subjective information on the Web 2.0. has caused an increasing interest from researchers willing to develop methods to extract emotion data from these new sources. One of the most important challenges in textual emotion detection is the gathering of data with emotion labels because of the subjectivity of assigning these labels. Basing on this rationale, the main objective of our research is to contribute to the resolution of this important challenge. This is tackled by proposing EmoLabel: a semi-automatic methodology based on pre-annotation, which consists of two main phases: (1) an automatic process to pre-annotate the unlabelled English sentences; and (2) a manual process of refinement where human annotators determine which is the dominant emotion. Our objective is to assess the influence of this automatic pre-annotation method on manual emotion annotation from two points of view: agreement and time needed for annotation. The evaluation performed demonstrates the benefits of pre-annotation processes since the results on annotation time show a gain of near 20 percent when the pre-annotation process is applied (Pre-ML) without reducing annotator performance. Moreover, the benefits of pre-annotation are higher in those contributors whose performance is low (inaccurate annotators).

引用

页码：579 / 591

页数：13

共 56 条

[1]

Al-Saqqa S, 2018, INT CONF COMP SCI, P136, DOI 10.1109/CSIT.2018.8486405

[2]

Alm CeciliaOvesdotter., 2005, Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing, P579

[3]

Aman S., 2008, Proceedings of the Third International Joint Conference on Natural Language Processing, P296

[4]

Aman S, 2007, LECT NOTES ARTIF INT, V4629, P196

[5] Using EmotiBlog to annotate and analyse subjectivity in the new textual genres [J].

Boldrini, Ester ;

Balahur, Alexandra ;

Martinez-Barco, Patricio ;

Montoyo, Andres .

DATA MINING AND KNOWLEDGE DISCOVERY, 2012, 25 (03) :603-634

[6]

Cambria E, 2018, AAAI CONF ARTIF INTE, P1795

[7]

Cambria E, 2015, AAAI CONF ARTIF INTE, P508

[8]

Canales L., 2017, IEEE T AFFECT COMPUT, DOI [10.1109/TAFFC.2017.2764770, DOI 10.1109/TAFFC.2017.2764770]

[9]

Canales Lea., 2017, P RECENT ADV NATURAL, V4-6, P157, DOI [10.26615/978-954-452-049-6_022, DOI 10.26615/978-954-452-049-6_022]

[10]

Chaffar S, 2011, LECT NOTES ARTIF INT, V6657, P62, DOI 10.1007/978-3-642-21043-3_8

← 1 2 3 4 5 6 →