A scoping review on the use of natural language processing in research on political polarization: trends and research prospects

被引:11
作者
Nemeth, Renata [1 ]
机构
[1] Eotvos Lorand Univ, Fac Social Sci, Res Ctr Computat Social Sci, Budapest, Hungary
来源
JOURNAL OF COMPUTATIONAL SOCIAL SCIENCE | 2023年 / 6卷 / 01期
关键词
Language polarization; Political polarization; Partisan language; Natural language processing; Text mining; Computational text analysis;
D O I
10.1007/s42001-022-00196-2
中图分类号
O1 [数学]; C [社会科学总论];
学科分类号
03 ; 0303 ; 0701 ; 070101 ;
摘要
As part of the "text-as-data " movement, Natural Language Processing (NLP) provides a computational way to examine political polarization. We conducted a methodological scoping review of studies published since 2010 (n = 154) to clarify how NLP research has conceptualized and measured political polarization, and to characterize the degree of integration of the two different research paradigms that meet in this research area. We identified biases toward US context (59%), Twitter data (43%) and machine learning approach (33%). Research covers different layers of the political public sphere (politicians, experts, media, or the lay public), however, very few studies involved more than one layer. Results indicate that only a few studies made use of domain knowledge and a high proportion of the studies were not interdisciplinary. Those studies that made efforts to interpret the results demonstrated that the characteristics of political texts depend not only on the political position of their authors, but also on other often-overlooked factors. Ignoring these factors may lead to overly optimistic performance measures. Also, spurious results may be obtained when causal relations are inferred from textual data. Our paper provides arguments for the integration of explanatory and predictive modeling paradigms, and for a more interdisciplinary approach to polarization research.
引用
收藏
页码:289 / 313
页数:25
相关论文
共 96 条
  • [1] Acree B., 2016, THESIS COLL ARTS SCI, DOI [10.17615/mm0p-jk38, DOI 10.17615/MM0P-JK38]
  • [2] Etch-a-Sketching: Evaluating the Post-Primary Rhetorical Moderation Hypothesis
    Acree, Brice D. L.
    Gross, Justin H.
    Smith, Noah A.
    Sim, Yanchuan
    Boydstun, Amber E.
    [J]. AMERICAN POLITICS RESEARCH, 2020, 48 (01) : 99 - 131
  • [3] Ademmer E., 2019, KIEL WORKING PAPERS, V2140
  • [4] Bibliometric analysis of global scientific literature on effects of COVID-19 pandemic on mental health
    Akintunde, Tosin Yinka
    Musa, Taha Hussein
    Musa, Hassan Hussein
    Musa, Idriss Hussein
    Chen, Shaojun
    Ibrahim, Elhakim
    Tassang, Angwi Enow
    Helmy, Mai Salah El Din Mohamed
    [J]. ASIAN JOURNAL OF PSYCHIATRY, 2021, 63
  • [5] [Anonymous], 2006, P 10 C COMPUTATIONAL
  • [6] Baly R., 2020, PREPRINT, DOI DOI 10.4550/ARXIV.2010.05338
  • [7] BAYRAM U, 2019, IEEE IJCNN
  • [8] Learning Political Polarization on Social Media Using Neural Networks
    Belcastro, Loris
    Cantini, Riccardo
    Marozzo, Fabrizio
    Talia, Domenico
    Trunfio, Paolo
    [J]. IEEE ACCESS, 2020, 8 : 47177 - 47187
  • [9] Bonikowski B., 2019, UCR POLITICAL EC SEM
  • [10] Statistical modeling: The two cultures
    Breiman, L
    [J]. STATISTICAL SCIENCE, 2001, 16 (03) : 199 - 215