Attention-Based CNN and Bi-LSTM Model Based on TF-IDF and GloVe Word Embedding for Sentiment Analysis

被引:62
作者
Kamyab, Marjan [1 ]
Liu, Guohua [1 ]
Adjeisah, Michael [2 ]
机构
[1] Donghua Univ, Sch Comp Sci & Technol, Shanghai 201620, Peoples R China
[2] Zhejiang Normal Univ, Coll Math & Comp Sci, Jinhua 321004, Zhejiang, Peoples R China
来源
APPLIED SCIENCES-BASEL | 2021年 / 11卷 / 23期
关键词
deep learning; CNN; Bi-LSTM; attention mechanism; social media sentiment analysis; TF-IDF;
D O I
10.3390/app112311255
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Sentiment analysis (SA) detects people's opinions from text engaging natural language processing (NLP) techniques. Recent research has shown that deep learning models, i.e., Convolutional Neural Network (CNN), Recurrent Neural Network (RNN), and Transformer-based provide promising results for recognizing sentiment. Nonetheless, CNN has the advantage of extracting high-level features by using convolutional and max-pooling layers; it cannot efficiently learn a sequence of correlations. At the same time, Bidirectional RNN uses two RNN directions to improve extracting long-term dependencies. However, it cannot extract local features in parallel, and Transformer-based like Bidirectional Encoder Representations from Transformers (BERT) are the computational resources needed to fine-tune, facing an overfitting problem on small datasets. This paper proposes a novel attention-based model that utilizes CNNs with LSTM (named ACL-SA). First, it applies a preprocessor to enhance the data quality and employ term frequency-inverse document frequency (TF-IDF) feature weighting and pre-trained Glove word embedding approaches to extract meaningful information from textual data. In addition, it utilizes CNN's max-pooling to extract contextual features and reduce feature dimensionality. Moreover, it uses an integrated bidirectional LSTM to capture long-term dependencies. Furthermore, it applies the attention mechanism at the CNN's output layer to emphasize each word's attention level. To avoid overfitting, the Guasiannoise and GuasianDroupout are adopted as regularization. The model's robustness is evaluated on four English standard datasets, i.e., Sentiment140, US-airline, Sentiment140-MV, SA4A with various performance matrices, and compared efficiency with existing baseline models and approaches. The experiment results show that the proposed method significantly outperforms the state-of-the-art models.
引用
收藏
页数:17
相关论文
共 55 条
[1]   Deep learning-based sentiment classification of evaluative text based on Multi-feature fusion [J].
Abdi, Asad ;
Shamsuddin, Siti Mariyam ;
Hasan, Shafaatunnur ;
Piran, Jalil .
INFORMATION PROCESSING & MANAGEMENT, 2019, 56 (04) :1245-1259
[2]   Sentiment analysis through recurrent variants latterly on convolutional neural network of Twitter [J].
Abid, Fazeel ;
Alam, Muhammad ;
Yasir, Muhammad ;
Li, Chen .
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2019, 95 :292-308
[3]   Enhancing deep learning sentiment analysis with ensemble techniques in social applications [J].
Araque, Oscar ;
Corcuera-Platas, Ignacio ;
Sanchez-Rada, J. Fernando ;
Iglesias, Carlos A. .
EXPERT SYSTEMS WITH APPLICATIONS, 2017, 77 :236-246
[4]  
Arun C., 2021, MATER TODAY-PROC
[5]   ABCDM: An Attention-based Bidirectional CNN-RNN Deep Model for sentiment analysis [J].
Basiri, Mohammad Ehsan ;
Nemati, Shahla ;
Abdar, Moloud ;
Cambria, Erik ;
Acharya, U. Rajendra .
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2021, 115 :279-294
[6]   Understanding Emotions in Text Using Deep Learning and Big Data [J].
Chatterjee, Ankush ;
Gupta, Umang ;
Chinnakotla, Manoj Kumar ;
Srikanth, Radhakrishnan ;
Galley, Michel ;
Agrawal, Puneet .
COMPUTERS IN HUMAN BEHAVIOR, 2019, 93 :309-317
[7]   Selecting publication keywords for domain analysis in bibliometrics: A comparison of three methods [J].
Chen, Guo ;
Xiao, Lu .
JOURNAL OF INFORMETRICS, 2016, 10 (01) :212-223
[8]   AI-Skin: Skin disease recognition based on self-learning and wide data collection through a closed-loop framework [J].
Chen, Min ;
Zhou, Ping ;
Wu, Di ;
Hu, Long ;
Hassan, Mohammad Mehedi ;
Alamri, Atif .
INFORMATION FUSION, 2020, 54 :1-9
[9]  
Chen X, 2014, INT CONF SEMANT, P49, DOI 10.1109/SKG.2014.20
[10]   Sentiment Analysis Based on Deep Learning: A Comparative Study [J].
Dang, Nhan Cach ;
Moreno-Garcia, Maria N. ;
De la Prieta, Fernando .
ELECTRONICS, 2020, 9 (03)