Distributed Sentiment Analysis for Geo-Tagged Twitter Data

被引:0
|
作者
Zengin, Muhammed Said [1 ]
Arslan, Rabia [1 ]
Akgun, Mehmet Burak [1 ]
机构
[1] TOBB Ekon & Teknol Univ, Bilgisayar Muhendisligi Bolumu, Ankara, Turkey
来源
2022 30TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE, SIU | 2022年
关键词
Big data; distributed data processing; sentiment analysis; BERT;
D O I
10.1109/SIU55565.2022.9864702
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The ever-increasing frequency of sharing on social media makes these platforms one of the primary sources of data for computational social science studies. Similarly, examining and analyzing large scale social media data-sets is crucial for governments as well as companies. However, as the amount of data increases, insights that need to be derived from the data using artificial intelligence based models becomes more and more demanding in terms of processing power. In fact, hardware requirements might dramatically increase if the insights are needed under real-time or near-real time constraints. In this study, we developed a distributed sentiment analysis model that utilizes a large social media data-set. 16 million tweets have been collected and grouped by the originating city. The sentiment analysis model was produced by fine-tuning the pre-trained BERT model. Distributed big data analytics engine, Apache Spark, is used to execute the trained model in a distributed fashion. For evaluation purposes, the prediction time on a single compute unit is compared with the distributed prediction time. Sentiment analysis model has been executed separately for each of the data-groups corresponding to 81 provinces. The data-set containing 16 million tweets used in this study, the Turkish sentiment analysis model produced, the distributed prediction code developed for Apache Spark and all the results of the study can be accessed from the address https://distributed-sentiment-analysis.github.io/.
引用
收藏
页数:4
相关论文
共 50 条
  • [1] Evaluating Geo-Tagged Twitter Data to Analyze Tourist Flows in Styria, Austria
    Scholz, Johannes
    Jeznik, Janja
    ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION, 2020, 9 (11)
  • [2] Sentiment Analysis by Fusing Text and Location Features of Geo-Tagged Tweets
    Lim, Wei Lun
    Ho, Chiung Ching
    Ting, Choo-Yee
    IEEE ACCESS, 2020, 8 : 181014 - 181027
  • [3] Using Geo-Tagged Sentiment to Better Understand Social Interactions
    Vivanco, Elizabeth
    Palanca, Javier
    del Val, Elena
    Rebollo, Miguel
    Botti, Vicent
    ADVANCES IN PRACTICAL APPLICATIONS OF CYBER-PHYSICAL MULTI-AGENT SYSTEMS: THE PAAMS COLLECTION, PAAMS 2017, 2017, 10349 : 369 - 372
  • [4] Sentiment Analysis On Twitter Data Using Distributed Architecture
    Karhan, Zebra
    Soysaldi, Meryem
    Ozben, Yagiz Ozgenc
    Kilic, Erdal
    2018 3RD INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND ENGINEERING (UBMK), 2018, : 357 - 360
  • [5] Reading the urban socio-spatial network through space syntax and geo-tagged Twitter data
    Iranmanesh, Aminreza
    Atun, Resmiye Alpar
    JOURNAL OF URBAN DESIGN, 2020, 25 (06) : 738 - 757
  • [6] Geo-Tagged Social Media Data as a Proxy for Urban Mobility
    Qian, Cheng
    Kats, Philipp
    Malinchik, Sergey
    Hoffman, Mark
    Kettler, Brian
    Kontokosta, Constantine
    Sobolevsky, Stanislav
    ADVANCES IN CROSS-CULTURAL DECISION MAKING, (AHFE 2017), 2018, 610 : 29 - 40
  • [7] Urban magnetism through the lens of geo-tagged photography
    Paldino, Silvia
    Bojic, Iva
    Sobolevsky, Stanislav
    Ratti, Carlo
    Gonzalez, Marta C.
    EPJ DATA SCIENCE, 2015, 4 (01): : 1 - 17
  • [8] Analysis of the performance and robustness of methods to detect base locations of individuals with geo-tagged social media data
    Liu, Zhewei
    Zhang, Anshu
    Yao, Yepeng
    Shi, Wenzhong
    Huang, Xiao
    Shen, Xiaoqi
    INTERNATIONAL JOURNAL OF GEOGRAPHICAL INFORMATION SCIENCE, 2021, 35 (03) : 609 - 627
  • [9] Sentiment Analysis of Twitter Data
    Desai, Radhi D.
    PROCEEDINGS OF THE 2018 SECOND INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND CONTROL SYSTEMS (ICICCS), 2018, : 114 - 117
  • [10] Urban magnetism through the lens of geo-tagged photography
    Silvia Paldino
    Iva Bojic
    Stanislav Sobolevsky
    Carlo Ratti
    Marta C González
    EPJ Data Science, 4