A stacking ensemble approach for identification of informative tweets on twitter data

被引:4
作者
Dasari S.K. [1 ,3 ]
Gorla S. [2 ]
Prasad Reddy P.V.G.D. [3 ]
机构
[1] Department of Computer Science and Engineering, GMR Institute of Technology, GMRIT Rajam and Research Scholar, Andhra University, Visakhapatnam
[2] Department of Computer Science and Engineering, GITAM Deemed to Be University, Visakhapatnam
[3] Department of Computer Science and Systems Engineering, Andhra University, Visakhapatnam
关键词
CrisisNLP; embedding; Machine Learning; Stacking; Tweets; Vectorization;
D O I
10.1007/s41870-023-01316-5
中图分类号
学科分类号
摘要
In the Present era, social media information plays an impact on our daily activities. Accurate media information identification is also challenging because of fake or spam information. Social media may receive this information via Twitter, Facebook, Youtube, LinkedIn, etc. In the context of natural disasters, people share their opinions via social media. Identifying such tweets which are informative or non-informative via these social media is a challenging issue. This work focuses on when a natural disaster happens and how social media information has been categorized, such as informative or non-informative. In this paper, we collected seven datasets from various disaster events from crisisNLP and categorized the tweet information using the stacking ensemble approach. In the proposed work, we considered ensemble stacking approach, in which for base learner we chosen the top five best-performed machine learning models where as for meta-learner we chosen the logistic regression. We tested our stacking model performance with matrices, namely accuracy, precision, recall, and F1 score. We also compared our approach results with other state-of-the-art models. © 2023, The Author(s), under exclusive licence to Bharati Vidyapeeth's Institute of Computer Applications and Management.
引用
收藏
页码:2651 / 2662
页数:11
相关论文
共 32 条
[1]  
Imran M., Elbassuoni S., Castillo C., Diaz F., Meier P., Practical extraction of disaster-relevant information from social media, Proceedings of the 22Nd International Conference on World Wide Web, pp. 1021-1024, (2013)
[2]  
Rudra K., Ghosh S., Ganguly N., Goyal P., Ghosh S., Extracting situational information from microblogs during disaster events: A classification-summarization approach, Proceedings of the 24Th ACM International on Conference on Information and Knowledge Management, pp. 583-592, (2015)
[3]  
Imran M., Elbassuoni S., Castillo C., Diaz F., Meier P., Extracting information nuggets from disaster-Related messages in social media, Iscram, 201, 3, pp. 791-801, (2013)
[4]  
Alam F., Ofli F., Imran M., Crisismmd: Multimodal twitter datasets from natural disasters, Twelfth International AAAI Conference on Web and Social Media, (2018)
[5]  
Alam F., Ofli F., Imran M., Aupetit M., A Twitter Tale of Three Hurricanes: Harvey, Irma, and Maria. Arxiv, 1805, (2018)
[6]  
Imran M., Mitra P., Castillo C., Twitter as a Lifeline: Human-Annotated Twitter Corpora for NLP of Crisis-Related Messages. Arxiv, 1605, (2016)
[7]  
Alam F., Joty S., Imran M., Domain Adaptation with Adversarial Training and Graph Embeddings. Arxiv, 1805, (2018)
[8]  
Nguyen D.T., Ofli F., Imran M., Mitra P., Damage assessment from social media imagery data during disasters, Proceedings of the 2017 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining 2017, pp. 569-576, (2017)
[9]  
Madichetty S., Sridevi M., A neural-based approach for detecting the situational information from Twitter during disaster, IEEE Trans Comput Soc Syst, 8, 4, pp. 870-880, (2021)
[10]  
Madichetty S., Sridevi M., A novel method for identifying the damage assessment tweets during disaster, Fut Gen Comput Syst, 116, pp. 440-454, (2021)