KEAHT: A Knowledge-Enriched Attention-Based Hybrid Transformer Model for Social Sentiment Analysis

被引:18
作者
Tiwari, Dimple [1 ]
Nagpal, Bharti [2 ]
机构
[1] Ambedkar Inst Adv Commun Technol & Res GGSIPU, New Delhi, India
[2] Ambedkar Inst Adv Commun Technol & Res, NSUT East Campus, New Delhi, India
关键词
COVID-19; vaccine; Indian farmer protest; Bidirectional encoder representation from transformer (BERT); Latent Dirichlet Allocation (LDA); Lexicon approach; Social networks; WORD EMBEDDINGS; CNN; MACHINE; LSTM;
D O I
10.1007/s00354-022-00182-2
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Social media materialized as an influential platform that allows people to share their views on global and local issues. Sentiment analysis can handle these massive amounts of unstructured reviews and convert them into meaningful opinions. Undoubtedly, COVID-19 originated as the enormous challenge across the world that physically and financially bruted humankind. Meanwhile, farmers' protests shook up the world against three pieces of legislation passed by the Indian government. Hence, an artificial intelligence-based sentiment model is needed for suggesting the right direction toward outbreaks. Although Deep Neural Network (DNN) gained popularity in sentiment analysis applications, these still have a limitation of sequential training, high-dimension feature space, and equal feature importance distribution. In addition, inaccurate polarity scoring and utility-based topic modeling are other challenging aspects of sentiment analysis. It motivates us to propose a Knowledge-Enriched Attention-based Hybrid Transformer (KEAHT) model by enriching the explicit knowledge of Latent Dirichlet Allocation (LDA) topic modeling and lexicalized domain ontology. A pre-trained Bidirectional Encoder Representation from Transformer (BERT) is employed to train within a minimum training corpus. It provides the facility of attention mechanism and can solve complex text problems accurately. A comparative study with existing baselines and recent hybrid models affirms the credibility of the proposed KEAHT in the field of Natural Language Processing (NLP). This model emphasizes artificial intelligence's role in handling the situation of the global pandemic and democratic dispute in a country. Furthermore, two benchmark datasets, namely "COVID-19-Vaccine-Labelled-Tweets" and "Indian-Farmer-Protest-Labelled-Tweets", are also constructed to accommodate future researchers for outlining the essential facts associated with the outbreaks.
引用
收藏
页码:1165 / 1202
页数:38
相关论文
共 64 条
[1]   Towards Improving the Lexicon-Based Approach for Arabic Sentiment Analysis [J].
Abdulla, Nawaf A. ;
Ahmed, Nizar A. ;
Shehab, Mohammed A. ;
Al-Ayyoub, Mahmoud ;
Al-Kabi, Mohammed N. ;
Al-rifai, Saleh .
INTERNATIONAL JOURNAL OF INFORMATION TECHNOLOGY AND WEB ENGINEERING, 2014, 9 (03) :55-71
[2]   Sentiment analysis and its applications in fighting COVID-19 and infectious diseases: A systematic review [J].
Alamoodi, A. H. ;
Zaidan, B. B. ;
Zaidan, A. A. ;
Albahri, O. S. ;
Mohammed, K. I. ;
Malik, R. Q. ;
Almahdi, E. M. ;
Chyad, M. A. ;
Tareq, Z. ;
Albahri, A. S. ;
Hameed, Hamsa ;
Alaa, Musaab .
EXPERT SYSTEMS WITH APPLICATIONS, 2021, 167
[3]   Transportation sentiment analysis using word embedding and ontology-based topic modeling [J].
Ali, Farman ;
Kwak, Daehan ;
Khan, Pervez ;
El-Sappagh, Shaker ;
Ali, Amjad ;
Ullah, Sana ;
Kim, Kye Hyun ;
Kwak, Kyung-Sup .
KNOWLEDGE-BASED SYSTEMS, 2019, 174 :27-42
[4]  
Alka A, 2018, SENTIMENT CLASSIFIER, P31, DOI 10.5121 /csit.2018.81004
[5]  
[Anonymous], Adam optimizer
[6]   T-SAF: Twitter sentiment analysis framework using a hybrid classification scheme [J].
Asghar, Muhammad Zubair ;
Kundi, Fazal Masud ;
Ahmad, Shakeel ;
Khan, Aurangzeb ;
Khan, Furqan .
EXPERT SYSTEMS, 2018, 35 (01)
[7]   Comparative experiments using supervised learning and machine translation for multilingual sentiment analysis [J].
Balahur, Alexandra ;
Turchi, Marco .
COMPUTER SPEECH AND LANGUAGE, 2014, 28 (01) :56-75
[8]   Aspect-Based Sentiment Analysis Using Attribute Extraction of Hospital Reviews [J].
Bansal, Ankita ;
Kumar, Niranjan .
NEW GENERATION COMPUTING, 2022, 40 (04) :941-960
[9]   ABCDM: An Attention-based Bidirectional CNN-RNN Deep Model for sentiment analysis [J].
Basiri, Mohammad Ehsan ;
Nemati, Shahla ;
Abdar, Moloud ;
Cambria, Erik ;
Acharya, U. Rajendra .
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2021, 115 :279-294
[10]   An improved ensemble based intrusion detection technique usingXGBoost [J].
Bhati, Bhoopesh Singh ;
Chugh, Garvit ;
Al-Turjman, Fadi ;
Bhati, Nitesh Singh .
TRANSACTIONS ON EMERGING TELECOMMUNICATIONS TECHNOLOGIES, 2021, 32 (06)