Position-context additive transformer-based model for classifying text data on social media

被引:0
作者
Abd-Elaziz, M. M. [1 ]
El-Rashidy, Nora [2 ]
Abou Elfetouh, Ahmed [1 ]
El-Bakry, Hazem M. [1 ]
机构
[1] Mansoura Univ, Fac Comp & Informat Sci, Informat Syst Dept, Mansoura, Egypt
[2] Kaferelshikh Univ, Fac Artificial Intelligence, Machine Learning & Informat Retrieval Dept, Kafr Al Sheikh, Egypt
关键词
Social media; Transformer-based model; Word embedding; Bi-LSTM network; Additive attention; NEURAL-NETWORKS;
D O I
10.1038/s41598-025-90738-1
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
In recent years, the continuous increase in the growth of text data on social media has been a major reason to rely on the pre-training method to develop new text classification models specially transformer-based models that have proven worthwhile in most natural language processing tasks. This paper introduces a new Position-Context Additive transformer-based model (PCA model) that consists of two-phases to increase the accuracy of text classification tasks on social media. Phase I aims to develop a new way to extract text characteristics by paying attention to the position and context of each word in the input layer. This is done by integrating the improved word embedding method (the position) with the developed Bi-LSTM network to increase the focus on the connection of each word with the other words around it (the context). As for phase II, it focuses on the development of a transformer-based model based primarily on improving the additive attention mechanism. The PCA model has been tested for the implementation of the classification of health-related social media texts in 6 data sets. Results showed that performance accuracy was improved by an increase in F1-Score between 0.2 and 10.2% in five datasets compared to the best published results. On the other hand, the performance of PCA model was compared with three transformer-based models that proved high accuracy in classifying texts, and experiments also showed that PCA model overcame the other models in 4 datasets to achieve an improvement in F1-score between 0.1 and 2.1%. The results also led us to conclude a direct correlation between the volume of training data and the accuracy of performance as the increase in the volume of training data positively affects F1-Score improvement.
引用
收藏
页数:11
相关论文
共 39 条
[31]   Generating Plausible and Context-Appropriate Comments on Social Media Posts: A Large Language Model-Based Approach [J].
Ha, Taehyun .
IEEE ACCESS, 2024, 12 :161545-161556
[32]   A topic model based framework for identifying the distribution of demand for relief supplies using social media data [J].
Zhang, Ting ;
Shen, Shi ;
Cheng, Changxiu ;
Su, Kai ;
Zhang, Xiangxue .
INTERNATIONAL JOURNAL OF GEOGRAPHICAL INFORMATION SCIENCE, 2021, 35 (11) :2216-2237
[33]   Text-in-Image Enhanced Self-Supervised Alignment Model for Aspect-Based Multimodal Sentiment Analysis on Social Media [J].
Zhao, Xuefeng ;
Wang, Yuxiang ;
Zhong, Zhaoman .
SENSORS, 2025, 25 (08)
[34]   Rapid estimation of an earthquake impact area using a spatial logistic growth model based on social media data [J].
Wang, Yandong ;
Ruan, Shisi ;
Wang, Teng ;
Qiao, Mengling .
INTERNATIONAL JOURNAL OF DIGITAL EARTH, 2019, 12 (11) :1265-1284
[35]   RETRACTED: Intelligent city emergency intelligence perception model based on social media big data (Retracted Article) [J].
Xiong, Guibin .
JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2021, 13 (Suppl 1) :69-70
[36]   Building a model-based personalised recommendation approach for tourist attractions from geotagged social media data [J].
Sun, Xiaoyu ;
Huang, Zhou ;
Peng, Xia ;
Chen, Yiran ;
Liu, Yu .
INTERNATIONAL JOURNAL OF DIGITAL EARTH, 2019, 12 (06) :661-678
[37]   New Approach of Measuring Human Personality Traits Using Ontology-Based Model from Social Media Data [J].
Alamsyah, Andry ;
Dudija, Nidya ;
Widiyanesti, Sri .
INFORMATION, 2021, 12 (10)
[38]   A new emergency management dynamic value assessment model based on social media data: a multiphase decision-making perspective [J].
Shan, Siqing ;
Liu, Xiaohui ;
Wei, Yigang ;
Xu, Lida ;
Zhang, Baishang ;
Yu, Lei .
ENTERPRISE INFORMATION SYSTEMS, 2020, 14 (05) :680-709
[39]   The study on the dissemination of waste sorting policies on social media and the public's feedback attitudes: a text analysis based on comment data of policies in 46 key cities in China [J].
Chen, Liangkun ;
Huang, Lexin ;
Ma, Wanqi ;
Ma, Suwei ;
Li, Yuhang .
FRONTIERS IN ENVIRONMENTAL SCIENCE, 2025, 13