Performing an Inductive Thematic Analysis of Semi-Structured Interviews With a Large Language Model: An Exploration and Provocation on the Limits of the Approach

被引:52
作者
De Paoli, Stefano [1 ,2 ]
机构
[1] Abertay Univ, Digital Soc, Dundee, Scotland
[2] Abertay Univ, Sociol Div, Bell St, Dundee DD1 1HG, Scotland
关键词
large language models; thematic analysis; qualitative research; human- AI collaboration;
D O I
10.1177/08944393231220483
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Large Language Models (LLMs) have emerged as powerful generative Artificial Intelligence solutions. This paper presents results and reflections of an experiment done with the LLM GPT 3.5-Turbo to perform an inductive Thematic Analysis (TA). Previous research has worked on conducting deductive analysis. Thematic Analysis is a qualitative method for analysis commonly used in social sciences and it is based on interpretations by the human analyst(s) and the identification of explicit and latent meanings in qualitative data. The paper presents the motivations for attempting this analysis; it reflects on how the six phases to a TA proposed by Braun and Clarke can partially be reproduced with the LLM and it reflects on what are the model's outputs. The paper uses two datasets of open access semi-structured interviews, previously analysed by other researchers. The first dataset contains interviews with videogame players, and the second is a dataset of interviews with lecturers teaching data science in a University. This paper used the analyses previously conducted on these datasets to compare with the results produced by the LLM. The results show that the model can infer most of the main themes from previous research. This shows that using LLMs to perform an inductive TA is viable and offers a good degree of validity. The discussion offers some recommendations for working with LLMs in qualitative analysis.
引用
收藏
页码:997 / 1019
页数:23
相关论文
共 32 条
[1]   Three Gaps in Computational Text Analysis Methods for Social Sciences: A Research Agenda [J].
Baden, Christian ;
Pipal, Christian ;
Schoonvelde, Martijn ;
van der Velden, Mariken A. C. G. .
COMMUNICATION METHODS AND MEASURES, 2022, 16 (01) :1-18
[2]   ChatGPT: five priorities for research [J].
Bockting, Claudi ;
van Dis, Eva A. M. ;
Bollen, Johan ;
van Rooij, Robert ;
Zuidema, Willem L. .
NATURE, 2023, 614 (7947) :224-226
[3]  
Braun V., 2006, Qualitative Research in Psychology, V3, P77, DOI [DOI 10.1191/1478088706QP063OA, DOI 10.1080/14780887.2020.1769238]
[4]  
Curty R., 2022, TEACHING UNDERGRADUA, DOI DOI 10.25349/D9402J
[5]  
Curty R., 2021, TEACHING UNDERGRADUA
[6]   ChatGPT for (Finance) research: The Bananarama Conjecture [J].
Dowling, Michael ;
Lucey, Brian .
FINANCE RESEARCH LETTERS, 2023, 53
[7]  
Flick U., 2018, SAGE HDB QUALITATIVE, DOI [10.4135/9781526416070, 10.4135/9781526416070.n4, DOI 10.4135/9781526416070]
[8]  
Floridi L, 2023, SSRN Electronic Journal, DOI [10.2139/ssrn.4358789, 10.1007/s13347-023-00621-y, 10.1007/s13347-023-00621-y, 10.2139/ssrn.4358789, DOI 10.2139/SSRN.4358789, DOI 10.1007/S13347-023-00621-Y]
[9]  
Gao J., 2023, COLLABCODER GPT POWE, DOI DOI 10.48550/ARXIV.2304.07366
[10]  
Gauthier Robert P., 2022, Proceedings of the ACM on Human-Computer Interaction, V6, DOI [10.1145/3492844, 10.1145/3492844]