Temporary Topic Models in Social Sciences: A Study on STM

被引:0
作者
Kurnaz, Ahmet [1 ]
Unver, H. Akin [2 ]
机构
[1] Canakkale Onsekiz Mart Univ, Siyaset Bilimi & Kamu Yonetimi Bolumu, Canakkale, Turkey
[2] Ozyegin Univ, Uluslararasi Iliskiler Bolumu, Istanbul, Turkey
来源
2022 30TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE, SIU | 2022年
关键词
topic models; STM; content analysis; text mining; social media; TWITTER; TEXT;
D O I
10.1109/SIU55565.2022.9864923
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Topic models are rapidly becoming popular in social sciences. However, researchers should pay attention to some critical steps while using these models. The format and content of the textual data, language, existence of covariates, and preprocessing steps are the most crucial elements of a topic model analysis. This study inspects the effect of various datasets and preprocessing steps on Structural Topic Models (STM). Results shows that preprocessing, which depends on the research question, profoundly affects the model performance. Besides, the existence of multilingual data weakens the topic quality. Also, the algorithm performance is different among long and short texts. Last, the potential usage of covariates in the model enhances its functionality in social science.
引用
收藏
页数:4
相关论文
共 24 条
  • [1] Local stakeholders' narratives about large-scale urban development: The Zhejiang Hangzhou Future Sci-Tech City
    Anzoise, Valentina
    Slanzi, Debora
    Poli, Irene
    [J]. URBAN STUDIES, 2020, 57 (03) : 655 - 671
  • [2] Chuang J., 2014, Computer-Assisted Content Analysis: Topic Models for Exploring Multiple Subjective Interpretations, P9
  • [3] The Evolution of Topic Modeling
    Churchill, Rob
    Singh, Lisa
    [J]. ACM COMPUTING SURVEYS, 2022, 54 (10S)
  • [4] May it please the twitterverse: The use of Twitter by state high court judges
    Curry, Todd A.
    Fix, Michael P.
    [J]. JOURNAL OF INFORMATION TECHNOLOGY & POLITICS, 2019, 16 (04) : 379 - 393
  • [5] Eles PT, 2016, 2016 INTERNATIONAL CONFERENCE ON MILITARY COMMUNICATIONS AND INFORMATION SYSTEMS (ICMCIS)
  • [6] Measuring Indirect External Threat: A Time Series Sentiment Analysis of NATO Resolutions
    Fay, Erik M.
    [J]. 2020 IEEE 7TH INTERNATIONAL CONFERENCE ON DATA SCIENCE AND ADVANCED ANALYTICS (DSAA 2020), 2020, : 767 - 768
  • [7] Collective sense-making in times of crisis: Connecting terror management theory with Twitter user reactions to the Berlin terrorist attack
    Fischer-Pressler, Diana
    Schwemmer, Carsten
    Fischbach, Kai
    [J]. COMPUTERS IN HUMAN BEHAVIOR, 2019, 100 : 138 - 151
  • [8] Immigration-related Speechmaking in a Party-constrained Parliament: Evidence from the 'Refugee Crisis' of the 18th German Bundestag (2013-2017)
    Geese, Lucas
    [J]. GERMAN POLITICS, 2020, 29 (02) : 201 - 222
  • [9] Toward understanding 17th century English culture: A structural topic model of Francis Bacon's ideas
    Grajzl, Peter
    Murrell, Peter
    [J]. JOURNAL OF COMPARATIVE ECONOMICS, 2019, 47 (01) : 111 - 135
  • [10] Machine Learning for Social Science: An Agnostic Approach
    Grimmer, Justin
    Roberts, Margaret E.
    Stewart, Brandon M.
    [J]. ANNUAL REVIEW OF POLITICAL SCIENCE, VOL 24, 2021, 2021, 24 : 395 - 419