Using geolocated tweets for characterization of Twitter in Portugal and the Portuguese administrative regions

被引:5
|
作者
Brogueira, Gaspar [1 ,2 ]
Batista, Fernando [1 ,2 ]
Carvalho, Joao Paulo [1 ,3 ]
机构
[1] INESC ID, Lisbon, Portugal
[2] Inst Univ Lisboa, ISCTE, Lisbon, Portugal
[3] Univ Lisbon, Inst Super Tecn, Lisbon, Portugal
关键词
Twitter; Geolocated tweets; Portuguese tweets; Portuguese districts; Twitter data analysis;
D O I
10.1007/s13278-016-0347-8
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The information published by the millions of public social network users is an important source of knowledge that can be used in academic, socioeconomic or demographic studies (distribution of male and female population, age, marital status, birth), lifestyle analysis (interests, hobbies, social habits) or be used to study online behavior (time spent online, interaction with friends or discussion about brands, products or politics). This work uses a database of about 27 million Portuguese geolocated tweets, produced in Portugal by 97.8 K users during a 1-year period, to extract information about the behavior of the geolocated Portuguese Twitter community and show that with this information it is possible to extract overall indicators such as: the daily periods of increased activity per region; prediction of regions where the concentration of the population is higher or lower in certain periods of the year; how do regional habitants feel about life; or what is talked about in each region. We also analyze the behavior of the geolocated Portuguese Twitter users based on the tweeted contents, and find indications that their behavior differs in certain relevant aspect from other Twitter communities, hypothesizing that this is in part due to the abnormal high percentage of young teenagers in the community. Finally, we present a small case study on Portuguese tourism in the Algarve region. To the best of our knowledge, this work is the first study that shows geolocated Portuguese users' behavior in Twitter focusing on geographic regional use.
引用
收藏
页数:20
相关论文
共 34 条
  • [21] Who Tweets in the United Kingdom? Profiling the Twitter Population Using the British Social Attitudes Survey 2015
    Sloan, Luke
    SOCIAL MEDIA + SOCIETY, 2017, 3 (01):
  • [22] What are IBD Patients Talking About on Twitter? Using Natural Language Understanding to Investigate Patients’ Tweets
    Stemmer M.
    Parmet Y.
    Ravid G.
    SN Computer Science, 4 (4)
  • [23] Identifying Health-Related Discussions of Cannabis Use on Twitter by Using a Medical Dictionary: Content Analysis of Tweets
    Allem, Jon-Patrick
    Majmundar, Anuja
    Dormanesh, Allison
    Donaldson, Scott, I
    JMIR FORMATIVE RESEARCH, 2022, 6 (02)
  • [24] Who are there: Discover Twitter users and tweets for target area using mention relationship strength and local tweet ratio
    Liu, Yimin
    Luo, Xiangyang
    Zhang, Meng
    Tao, Zhiyuan
    Liu, Fenlin
    JOURNAL OF NETWORK AND COMPUTER APPLICATIONS, 2022, 199
  • [25] Inferences about spatiotemporal variation in dengue virus transmission are sensitive to assumptions about human mobility: a case study using geolocated tweets from Lahore, Pakistan
    Moritz U. G. Kraemer
    D. Bisanzio
    R. C. Reiner
    R. Zakar
    J. B. Hawkins
    C. C. Freifeld
    D. L. Smith
    S. I. Hay
    J. S. Brownstein
    T. Alex Perkins
    EPJ Data Science, 7
  • [26] Twitter users' coping behaviors during the COVID-19 lockdown: an analysis of tweets using mixed methods
    Mittal, Ruchi
    Ahmed, Wasim
    Mittal, Amit
    Aggarwal, Ishan
    INFORMATION DISCOVERY AND DELIVERY, 2021, 49 (03) : 193 - 202
  • [27] A Framework for User Characterization based on Tweets Using Machine Learning Algorithms
    Zahra, Kinza
    Azam, Farooque
    Butt, Wasi Haider
    Ilyas, Fauqia
    PROCEEDINGS OF 2018 VII INTERNATIONAL CONFERENCE ON NETWORK, COMMUNICATION AND COMPUTING (ICNCC 2018), 2018, : 11 - 16
  • [28] Inferences about spatiotemporal variation in dengue virus transmission are sensitive to assumptions about human mobility: a case study using geolocated tweets from Lahore, Pakistan
    Kraemer, Moritz U. G.
    Bisanzio, D.
    Reiner, R. C.
    Zakar, R.
    Hawkins, J. B.
    Freifeld, C. C.
    Smith, D. L.
    Hay, S. I.
    Brownstein, J. S.
    Perkins, T. Alex
    EPJ DATA SCIENCE, 2018, 7
  • [29] Automated Detection of Vaping-Related Tweets on Twitter During the 2019 EVALI Outbreak Using Machine Learning Classification
    Ren, Yang
    Wu, Dezhi
    Singh, Avineet
    Kasson, Erin
    Huang, Ming
    Cavazos-Rehg, Patricia
    FRONTIERS IN BIG DATA, 2022, 5
  • [30] Semantic Characterization of Tweets Using Topic Models: A Use Case in the Entertainment Domain
    Garcia-Silva, Andres
    Rodriguez-Doncel, Victor
    Corcho, Oscar
    INTERNATIONAL JOURNAL ON SEMANTIC WEB AND INFORMATION SYSTEMS, 2013, 9 (03) : 1 - 13