Distributed Sentiment Analysis for Geo-Tagged Twitter Data

被引:0
|
作者
Zengin, Muhammed Said [1 ]
Arslan, Rabia [1 ]
Akgun, Mehmet Burak [1 ]
机构
[1] TOBB Ekon & Teknol Univ, Bilgisayar Muhendisligi Bolumu, Ankara, Turkey
来源
2022 30TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE, SIU | 2022年
关键词
Big data; distributed data processing; sentiment analysis; BERT;
D O I
10.1109/SIU55565.2022.9864702
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The ever-increasing frequency of sharing on social media makes these platforms one of the primary sources of data for computational social science studies. Similarly, examining and analyzing large scale social media data-sets is crucial for governments as well as companies. However, as the amount of data increases, insights that need to be derived from the data using artificial intelligence based models becomes more and more demanding in terms of processing power. In fact, hardware requirements might dramatically increase if the insights are needed under real-time or near-real time constraints. In this study, we developed a distributed sentiment analysis model that utilizes a large social media data-set. 16 million tweets have been collected and grouped by the originating city. The sentiment analysis model was produced by fine-tuning the pre-trained BERT model. Distributed big data analytics engine, Apache Spark, is used to execute the trained model in a distributed fashion. For evaluation purposes, the prediction time on a single compute unit is compared with the distributed prediction time. Sentiment analysis model has been executed separately for each of the data-groups corresponding to 81 provinces. The data-set containing 16 million tweets used in this study, the Turkish sentiment analysis model produced, the distributed prediction code developed for Apache Spark and all the results of the study can be accessed from the address https://distributed-sentiment-analysis.github.io/.
引用
收藏
页数:4
相关论文
共 50 条
  • [21] Sentiment Analysis for Twitter Data in the Hindi Language
    Madan, Anjum
    Ghose, Udayan
    2021 11TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING, DATA SCIENCE & ENGINEERING (CONFLUENCE 2021), 2021, : 784 - 789
  • [22] A Comprehensive Survey on Sentiment Analysis in Twitter Data
    Krishnan, Hema
    Elayidom, M. Sudheep
    Santhanakrishnan, T.
    INTERNATIONAL JOURNAL OF DISTRIBUTED SYSTEMS AND TECHNOLOGIES, 2022, 13 (05)
  • [23] Sentiment Analysis of Twitter Data: A Hybrid Approach
    Srivastava, Ankit
    Singh, Vijendra
    Drall, Gurdeep Singh
    INTERNATIONAL JOURNAL OF HEALTHCARE INFORMATION SYSTEMS AND INFORMATICS, 2019, 14 (02) : 1 - 16
  • [24] Sentiment Analysis on Twitter Data using Apache Spark Framework
    Elzayady, Hossam
    Badran, Khaled M.
    Salama, Gouda I.
    PROCEEDINGS OF 2018 13TH INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING AND SYSTEMS (ICCES), 2018, : 171 - 176
  • [25] Event Based Sentiment Analysis of Twitter Data
    Patil, Mamta
    Chavan, H. K.
    PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON COMPUTING METHODOLOGIES AND COMMUNICATION (ICCMC 2018), 2018, : 1041 - 1054
  • [26] A Review of Techniques for Sentiment Analysis Of Twitter Data
    Bhuta, Sagar
    Doshi, Avit
    Doshi, Uehit
    Narvekar, Meera
    PROCEEDINGS OF THE 2014 INTERNATIONAL CONFERENCE ON ISSUES AND CHALLENGES IN INTELLIGENT COMPUTING TECHNIQUES (ICICT), 2014, : 583 - 591
  • [27] Sentiment Analysis on COVID-19 Twitter Data: A Sentiment Timeline
    Karagkiozidou, Makrina
    Koukaras, Paraskevas
    Tjortjis, Christos
    ARTIFICIAL INTELLIGENCE APPLICATIONS AND INNOVATIONS, AIAI 2022, PART II, 2022, 647 : 350 - 359
  • [28] Sentiment mapping: point pattern analysis of sentiment classified Twitter data
    Camacho, Ken
    Portelli, Raechel
    Shortridge, Ashton
    Takahashi, Bruno
    CARTOGRAPHY AND GEOGRAPHIC INFORMATION SCIENCE, 2021, 48 (03) : 241 - 257
  • [29] Sentiment Analysis of Twitter Data Using NLP Models: A Comprehensive Review
    Albladi, Aish
    Islam, Minarul
    Seals, Cheryl
    IEEE ACCESS, 2025, 13 : 30444 - 30468
  • [30] Sentiment Drift Detection and Analysis in Real Time Twitter Data Streams
    Susi E.
    Shanthi A.P.
    Computer Systems Science and Engineering, 2023, 45 (03): : 3231 - 3246