Topic modelling in corpus-based discourse analysis: Uses and critiques

被引:2
作者
Bednarek, Monika [1 ]
机构
[1] Univ Sydney, Brennan MacCallum A18, Sydney, NSW 2006, Australia
关键词
Corpus-assisted discourse studies; corpus-based discourse analysis; corpus linguistics; Jupyter notebooks; topic modelling;
D O I
10.1177/14614456241293075
中图分类号
G2 [信息与知识传播];
学科分类号
05 ; 0503 ;
摘要
This discussion paper aims to take stock of the main uses and critiques of topic modelling in studies that combine corpus linguistics and discourse analysis. Topic modelling is a cover term for a collection of semiautomated techniques that aim to analyse the content of texts. While the paper does not provide a complete synthesis of the use of topic modelling in corpus linguistics, it aims to give an overview of recent uses and critiques of topic modelling within corpus-based discourse analysis and related fields. Such recent uses include topic modelling for corpus exploration and combining topic modelling with other analytic techniques such as keywords analysis, showing an increasing engagement with topic modelling in corpus linguistic studies of discourse(s). However, the critiques levelled at topic modelling are not inconsiderable, although some researchers do see it as a strong and effective method. One conclusion from the research reviewed in this article is that topic modelling can be enhanced through the use of methods from both corpus linguistics and discourse analysis which can reduce some of its limitations. In this respect, it can be argued that linguists have a lot to offer to non-linguistic fields where topic modelling is widely used and that interdisciplinary collaborations could be a fruitful endeavour. The paper is accompanied by invited expert commentaries written by linguists with relevant experience in applying topic modelling. These commentaries provide further comment on its uses and critiques in relation to corpus-based discourse analysis, confirming, challenging, elaborating or extending the points made here. While this article primarily focuses on corpus-based discourse analysis, many of the points should also be of interest to scholars in other fields. Taken together, it is clear that continued exploration of and debate on topic modelling is a worthwhile endeavour, and that new technological developments necessitate such critical engagement.
引用
收藏
页码:659 / 671
页数:13
相关论文
共 33 条
[1]   Weight Stigma: Towards a Language-Informed Analytical Framework [J].
Bednarek, Monika ;
Bray, Carly ;
Vanichkina, Darya P. ;
Brookes, Gavin ;
Bonfiglioli, Catriona ;
Coltman-Patel, Tara ;
Lee, Kelvin ;
Baker, Paul .
APPLIED LINGUISTICS, 2023, 45 (03) :424-448
[2]   Probabilistic Topic Models [J].
Blei, David M. .
COMMUNICATIONS OF THE ACM, 2012, 55 (04) :77-84
[3]   Latent Dirichlet allocation [J].
Blei, DM ;
Ng, AY ;
Jordan, MI .
JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (4-5) :993-1022
[4]   The utility of topic modelling for discourse studies: A critical evaluation [J].
Brookes, Gavin ;
McEnery, Tony .
DISCOURSE STUDIES, 2019, 21 (01) :3-21
[5]   Operation Heron: latent topic changes in an abusive letter series [J].
Busso, Lucia ;
Petyko, Marton ;
Atkins, Sarah ;
Grant, Tim .
CORPORA, 2022, 17 (02) :225-258
[6]  
Chan H., 2024, TOPSBM AUSTR TEXT AN
[7]   A network approach to topic models [J].
Gerlach, Martin ;
Peixoto, Tiago P. ;
Altmann, Eduardo G. .
SCIENCE ADVANCES, 2018, 4 (07)
[8]   The interpretation of topic models for scholarly analysis: An evaluation and critique of current practice [J].
Gillings, Mathew ;
Hardie, Andrew .
DIGITAL SCHOLARSHIP IN THE HUMANITIES, 2023, 38 (02) :530-543
[9]   Using large text news archives for the analysis of climate change discourse: some methodological observations [J].
Grundmann, Reiner .
JOURNAL OF RISK RESEARCH, 2022, 25 (03) :395-406
[10]   Analysing the opinions of UK veterinarians on practice-based research using corpus linguistic and mathematical methods [J].
Huntley, Selene J. ;
Mahlberg, Michaela ;
Wiegand, Viola ;
van Gennip, Yves ;
Yang, Hui ;
Dean, Rachel S. ;
Brennan, Marnie L. .
PREVENTIVE VETERINARY MEDICINE, 2018, 150 :60-69