Sentiment Classification Algorithm Based on Multi-Modal Social Media Text Information

被引：14

作者：

Xuanyuan, Minzheng ^{[1
,2
]}

Xiao, Le ^{[1
,2
]}

Duan, Mengshi ^{[1
,2
]}

机构：

[1] Henan Univ Technol, Zhengzhou 450052, Peoples R China

[2] Minist Educ, Key Lab Grain Informat Proc & Control, Zhengzhou 450001, Peoples R China

来源：

IEEE ACCESS | 2021年 / 9卷

关键词：

Social networking (online); Training; Feature extraction; Data models; Classification algorithms; Recurrent neural networks; Logic gates; UCRNN; sentiment classification; public opinion analysis; natural language processing; deep neural network; social media; multi-modal;

D O I：

10.1109/ACCESS.2021.3061450

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The issue of sentiment classification in short-term and small-scale data scenarios is considered in this paper. It is a hot topic because the text sentiment classification task in the public opinion analysis scene has two characteristics: short time and small data scale. Existing work focused on improving the accuracy at the cost of data and training time, without considering scenarios where time and data are lacked. The most commonly used method to solve the problem of small data scale is to use multi-modal information such as pictures, sounds and videos, which will lead to unbearable training time. The shorter training time determines that the classification model is generally selected as a deep neural network with fewer layers, such as TextCNN, TextRNN, and so on. However, such models are limited by the structure and have a low classification accuracy. In order to solve both short-term and small-scale data problems, a common information user attribute on social media is added to the model as multimodal information, which includes twelve attributes such as user age, location, and posting time. This paper proposed a sentiment classification algorithm based on multi-modal social media text information. The algorithm makes use of parallel convolutional neural networks (CNN) and recurrent neural network (RNN) to process text information and user attributes respectively, and combines the feature vectors of the two models for classification, which is called User attributes Convolutional and Recurrent Neural Network (UCRNN). The addition of user attributes can improve accuracy, and the CNN network used to extract user attributes features has fewer parameters, which proves that the algorithm can achieve high accuracy under short-term and small-scale data. Experiments verify that the training time of this model is slightly less than TextRNN. The classification accuracy can reach 90.2%, which is the state-of-the-art in the field of short-term and small-scale data sentiment classification.

引用

页码：33410 / 33418

页数：9

共 20 条

[1]

[Anonymous], Conf. Comput. Vis. (ICCV)

[2] Novel OGBEE-based feature selection and feature-level fusion with MLP neural network for social media multimodal sentiment analysis [J].

Bairavel, S. ;

Krishnamurthy, M. .

SOFT COMPUTING, 2020, 24 (24) :18431-18445

[3]

Boillot A, 2013, PLOS ONE, V8, DOI [10.1371/journal.pone.0052708, 10.1371/journal.pone.0064417]

[4]

Devlin J., 2018, arXiv:1810.04805

[5]

Joulin Armand, 2016, ARXIV160701759

[6]

Kim Y., 2014, arXiv

[7]

고대건, 2017, IEIE Transactions on Smart Processing & Computing, V6, P53, DOI 10.5573/IEIESPC.2017.6.1.053

[8] Hybrid context enriched deep learning model for fine-grained sentiment analysis in textual and visual semiotic modality social data [J].

Kumar, Akshi ;

Srinivasan, Kathiravan ;

Cheng Wen-Huang ;

Zomaya, Albert Y. .

INFORMATION PROCESSING & MANAGEMENT, 2020, 57 (01)

[9]

Li F., 2016, ARXIV160807720

[10] Reasoning human emotional responses from large-scale social and public media [J].

Li, Xianghua ;

Wang, Zhen ;

Gao, Chao ;

Shi, Lei .

APPLIED MATHEMATICS AND COMPUTATION, 2017, 310 :182-193

← 1 2 →