A BERT based dual-channel explainable text emotion recognition system

被引：67

作者：

Kumar, Puneet ^{[1
]}

Raman, Balasubramanian ^{[1
]}

机构：

[1] Indian Inst Technol Roorkee, Dept Comp Sci & Engn, Roorkee, Uttar Pradesh, India

来源：

NEURAL NETWORKS | 2022年 / 150卷

关键词：

Emotion recognition; Natural language processing; Explainable AI; Deep neural network explainability; MODEL;

D O I：

10.1016/j.neunet.2022.03.017

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper, a novel dual-channel system for multi-class text emotion recognition has been proposed, and a novel technique to explain its training & predictions has been developed. The architecture of the proposed system contains the embedding module, dual-channel module, emotion classification module, and explainability module. The embedding module extracts the textual features from the input sentences in the form of embedding vectors using the pre-trained Bidirectional Encoder Representations from Transformers (BERT) model. Then the embedding vectors are fed as the inputs to the dual-channel network containing two network channels made up of convolutional neural network (CNN) and bidirectional long short term memory (BiLSTM) network. The intuition behind using CNN and BiLSTM in both the channels was to harness the goodness of the convolutional layer for feature extraction and the BiLSTM layer to extract text's order and sequence-related information. The outputs of both channels are in the form of embedding vectors which are concatenated and fed to the emotion classification module. The proposed system's architecture has been determined by thorough ablation studies, and a framework has been developed to discuss its computational cost. The emotion classification module learns and projects the emotion embeddings on a hyperplane in the form of clusters. The proposed explainability technique explains the training and predictions of the proposed system by analyzing the inter & intra-cluster distances and the intersection of these clusters. The proposed approach's consistent accuracy, precision, recall, and F1 score results for ISEAR, Aman, AffectiveText, and EmotionLines datasets, ensure its applicability to diverse texts.(C)& nbsp;& nbsp;2022 Elsevier Ltd. All rights reserved.

引用

页码：392 / 407

页数：16

共 75 条

[51] EmoSenticSpace: A novel framework for affective common-sense reasoning [J].

Poria, Soujanya ;

Gelbukh, Alexander ;

Cambria, Erik ;

Hussain, Amir ;

Huang, Guang-Bin .

KNOWLEDGE-BASED SYSTEMS, 2014, 69 :108-123

[52]

Rathnayaka P, 2019, ARXIV PREPRINT ARXIV

[53] "Why Should I Trust You?" Explaining the Predictions of Any Classifier [J].

Ribeiro, Marco Tulio ;

Singh, Sameer ;

Guestrin, Carlos .

KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, :1135-1144

[54] A model for sentiment and emotion analysis of unstructured social media text [J].

Rout, Jitendra Kumar ;

Choo, Kim-Kwang Raymond ;

Dash, Amiya Kumar ;

Bakshi, Sambit ;

Jena, Sanjay Kumar ;

Williams, Karen L. .

ELECTRONIC COMMERCE RESEARCH, 2018, 18 (01) :181-199

[55] A CIRCUMPLEX MODEL OF AFFECT [J].

RUSSELL, JA .

JOURNAL OF PERSONALITY AND SOCIAL PSYCHOLOGY, 1980, 39 (06) :1161-1178

[56] EVIDENCE FOR UNIVERSALITY AND CULTURAL VARIATION OF DIFFERENTIAL EMOTION RESPONSE PATTERNING [J].

SCHERER, KR ;

WALLBOTT, HG .

JOURNAL OF PERSONALITY AND SOCIAL PSYCHOLOGY, 1994, 66 (02) :310-328

[57]

Seal Dibyendu, 2020, Information and Communication Technology for Sustainable Development. Proceedings of ICT4SD 2018. Advances in Intelligent Systems and Computing (AISC 933), P423, DOI 10.1007/978-981-13-7166-0_42

[58] Multimodal approaches for emotion recognition: A survey [J].

Sebe, N ;

Cohen, I ;

Gevers, T ;

Huang, TS .

INTERNET IMAGING VI, 2005, 5670 :56-67

[59]

Seyeditabari A., 2019, ARXIV PREPRINT ARXIV

[60]

Seyeditabari A., 2018, ARXIV PREPRINT ARXIV

← 1 2 3 4 5 6 7 8 →