Heuristic-Assisted BERT for Twitter Sentiment Analysis

被引:11
作者
Yenduri, Gokul [1 ]
Rajakumar, B. R. [2 ]
Praghash, K. [3 ]
Binu, D. [2 ]
机构
[1] Vignan S Univ, Vlits, CSE Dept, Guntur, Andhra Pradesh, India
[2] Resbee Info Technol Private Ltd, Opposite Govt Hosp, 2nd Floor,Rathi Pl, Thuckalay 629175, Tamil Nadu, India
[3] Koneru Lakshmaiah Educ Fdn, Dept Elect & Commun Engn, Vaddeswaram, AP, India
关键词
Sentiment analysis; Twitter data; BERT; tokenization; PA-CUP model; ALGORITHM; NETWORKS;
D O I
10.1142/S1469026821500152
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The identification of opinions and sentiments from tweets is termed as "Twitter Sentiment Analysis (TSA)". The major process of TSA is to determine the sentiment or polarity of the tweet and then classifying them into a negative or positive tweet. There are several methods introduced for carrying out TSA, however, it remains to be challenging due to slang words, modern accents, grammatical and spelling mistakes, and other issues that could not be solved by existing techniques. This work develops a novel customized BERT-oriented sentiment classification that encompasses two main phases: pre-processing and tokenization, and a "Customized Bidirectional Encoder Representations from Transformers (BERT)"-based classification. At first, the gathered raw tweets are pre-processed under stop-word removal, stemming and blank space removal. After pre-processing, the semantic words are obtained, from which the meaningful words (tokens) are extracted in the tokenization phase. Consequently, these extracted tokens are classified via optimized BERT, where biases and weight are tuned optimally by Particle-Assisted Circle Updating Position (PA-CUP). Moreover, the maximal sequence length of the BERT encoder is updated using standard PA-CUP. Finally, the performance analysis is carried out to substantiate the enhancement of the proposed model.
引用
收藏
页数:18
相关论文
共 44 条
[1]  
Ahmad S., 2019, HUM CENTRIC COMPUT I
[2]  
Ahuja Ravinder, 2019, Procedia Computer Science, V152, P341, DOI 10.1016/j.procs.2019.05.008
[3]  
AlBalushi F.M., 2019, J NETWORKING COMMUNI, V2, P37
[4]   Twitter sentiment analysis with a deep neural network: An enhanced approach using user behavioral information [J].
Alharbi, Ahmed Sulaiman M. ;
de Doncker, Elise .
COGNITIVE SYSTEMS RESEARCH, 2019, 54 :50-61
[5]   Using the contextual language model BERT for multi-criteria classification of scientific articles [J].
Ambalavanan, Ashwin Karthik ;
Devarakonda, Murthy V. .
JOURNAL OF BIOMEDICAL INFORMATICS, 2020, 112
[6]  
Angiani Giulio., 2016, KDWEB
[7]   A novel metaheuristic method for solving constrained engineering optimization problems: Crow search algorithm [J].
Askarzadeh, Alireza .
COMPUTERS & STRUCTURES, 2016, 169 :1-12
[8]   Threshold Prediction for Segmenting Tumour from Brain MRI Scans [J].
Beno, M. Marsaline ;
Valarmathi, I. R. ;
Swamy, S. M. ;
Rajakumar, B. R. .
INTERNATIONAL JOURNAL OF IMAGING SYSTEMS AND TECHNOLOGY, 2014, 24 (02) :129-137
[9]   Optimization using lion algorithm: a biological inspiration from lion’s social behavior [J].
Boothalingam R. .
Evolutionary Intelligence, 2018, 11 (1-2) :31-52
[10]   Application of Sentiment Analysis to Language Learning [J].
Chen, Mai-Hua ;
Chen, Wei-Fan ;
Ku, Lun-Wei .
IEEE ACCESS, 2018, 6 :24433-24442