Distributed Sentiment Analysis for Geo-Tagged Twitter Data

被引:0
|
作者
Zengin, Muhammed Said [1 ]
Arslan, Rabia [1 ]
Akgun, Mehmet Burak [1 ]
机构
[1] TOBB Ekon & Teknol Univ, Bilgisayar Muhendisligi Bolumu, Ankara, Turkey
来源
2022 30TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE, SIU | 2022年
关键词
Big data; distributed data processing; sentiment analysis; BERT;
D O I
10.1109/SIU55565.2022.9864702
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The ever-increasing frequency of sharing on social media makes these platforms one of the primary sources of data for computational social science studies. Similarly, examining and analyzing large scale social media data-sets is crucial for governments as well as companies. However, as the amount of data increases, insights that need to be derived from the data using artificial intelligence based models becomes more and more demanding in terms of processing power. In fact, hardware requirements might dramatically increase if the insights are needed under real-time or near-real time constraints. In this study, we developed a distributed sentiment analysis model that utilizes a large social media data-set. 16 million tweets have been collected and grouped by the originating city. The sentiment analysis model was produced by fine-tuning the pre-trained BERT model. Distributed big data analytics engine, Apache Spark, is used to execute the trained model in a distributed fashion. For evaluation purposes, the prediction time on a single compute unit is compared with the distributed prediction time. Sentiment analysis model has been executed separately for each of the data-groups corresponding to 81 provinces. The data-set containing 16 million tweets used in this study, the Turkish sentiment analysis model produced, the distributed prediction code developed for Apache Spark and all the results of the study can be accessed from the address https://distributed-sentiment-analysis.github.io/.
引用
收藏
页数:4
相关论文
共 50 条
  • [41] A Hybrid Approach for the Sentiment Analysis of Turkish Twitter Data
    Shehu, H. A.
    Tokat, S.
    ARTIFICIAL INTELLIGENCE AND APPLIED MATHEMATICS IN ENGINEERING PROBLEMS, 2020, 43 : 182 - 190
  • [42] Sentiment Analysis on Twitter
    Meral, Meric
    Diri, Banu
    2014 22ND SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2014, : 690 - 693
  • [43] Automatic Sentiment Analysis of Twitter Messages
    Lima, Ana C. E. S.
    de Castro, Leandro N.
    2012 FOURTH INTERNATIONAL CONFERENCE ON COMPUTATIONAL ASPECTS OF SOCIAL NETWORKS (CASON), 2012, : 52 - 57
  • [44] Sentiment analysis with Twitter
    Akgul, Eyup Sercan
    Ertano, Caner
    Diri, Banu
    PAMUKKALE UNIVERSITY JOURNAL OF ENGINEERING SCIENCES-PAMUKKALE UNIVERSITESI MUHENDISLIK BILIMLERI DERGISI, 2016, 22 (02): : 106 - 110
  • [45] Sentiment Analysis Framework of Twitter Data Using Classification
    Khurana, Medha
    Gulati, Anurag
    Singh, Saurabh
    2018 FIFTH INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED AND GRID COMPUTING (IEEE PDGC), 2018, : 459 - 464
  • [46] How are Tourists Different? - Reading Geo-tagged Photos through a Deep Learning Model
    Zhang, Kun
    Chen, Dongzhi
    Li, Chunlin
    JOURNAL OF QUALITY ASSURANCE IN HOSPITALITY & TOURISM, 2020, 21 (02) : 234 - 243
  • [47] A Service-Based System for Sentiment Analysis and Visualization of Twitter Data in Realtime
    Taher, Yehia
    Haque, Rafiqul
    AlShaer, Mohammed
    Heuvel, Willem Jan V. D.
    Zeitouni, Karine
    Araujo, Renata
    Hacid, Mohand-Said
    Dbouk, Mohamed
    SERVICE-ORIENTED COMPUTING - ICSOC 2016 WORKSHOPS, 2017, 10380 : 199 - 202
  • [48] Large scale and parallel sentiment analysis based on Label Propagation in Twitter Data
    Yang, Yibing
    Shafiq, M. Omair
    2018 17TH IEEE INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS (IEEE TRUSTCOM) / 12TH IEEE INTERNATIONAL CONFERENCE ON BIG DATA SCIENCE AND ENGINEERING (IEEE BIGDATASE), 2018, : 1791 - 1798
  • [49] MR-SAT: A MapReduce Algorithm for Big Data Sentiment Analysis on Twitter
    Nodarakis, Nikolaos
    Sioutas, Spyros
    Tsakalidis, Athanasios K.
    Tzimas, Giannis
    PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON WEB INFORMATION SYSTEMS AND TECHNOLOGIES, VOL 1 (WEBIST), 2016, : 140 - 147
  • [50] Sentiment Analysis of Twitter Data: Case Study on Digital India
    Mishra, Prerna
    Rajnish, Ranjana
    Kumar, Pankaj
    2016 INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY (INCITE) - NEXT GENERATION IT SUMMIT ON THE THEME - INTERNET OF THINGS: CONNECT YOUR WORLDS, 2016,