Using ChatGPT for human-computer interaction research: a primer

被引:49
作者
Tabone, Wilbert [1 ]
de Winter, Joost [1 ]
机构
[1] Delft Univ Technol, Fac Mech Maritime & Mat Engn, Dept Cognit Robot, NL-2628 CD Delft, Netherlands
关键词
prompt engineering; human-subject research; application programming interface (API); reproducibility; EXPERIENCE; RIGOR; REAL;
D O I
10.1098/rsos.231053
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
ChatGPT could serve as a tool for text analysis within the field of Human-Computer Interaction, though its validity requires investigation. This study applied ChatGPT to: (1) textbox questionnaire responses on nine augmented-reality interfaces, (2) interview data from participants who experienced these interfaces in a virtual simulator, and (3) transcribed think-aloud data of participants who viewed a real painting and its replica. Using a hierarchical approach, ChatGPT produced scores or summaries of text batches, which were then aggregated. Results showed that (1) ChatGPT generated sentiment scores of the interfaces that correlated extremely strongly (r > 0.99) with human rating scale outcomes and with a rule-based sentiment analysis method (criterion validity). Additionally, (2) by inputting automatically transcribed interviews to ChatGPT, it provided meaningful meta-summaries of the qualities of the interfaces (face validity). One meta-summary analysed in depth was found to have substantial but imperfect overlap with a content analysis conducted by an independent researcher (criterion validity). Finally, (3) ChatGPT's summary of the think-aloud data highlighted subtle differences between the real painting and the replica (face validity), a distinction corresponding with a keyword analysis (criterion validity). In conclusion, our research indicates that, with appropriate precautions, ChatGPT can be used as a valid tool for analysing text data.
引用
收藏
页数:21
相关论文
共 82 条
[1]  
Alba D., 2022, OpenAI Chatbot Spits Out Biased Musings, Despite Guardrails
[2]  
Amer-Yahia S, 2023, Arxiv, DOI [arXiv:2306.01388, 10.48550/arXiv.2306.01388]
[3]  
Baidoo-Anu D., 2023, J. AI, DOI [10.2139/ssrn.4337484, DOI 10.2139/SSRN.4337484]
[4]   ChatGPT: five priorities for research [J].
Bockting, Claudi ;
van Dis, Eva A. M. ;
Bollen, Johan ;
van Rooij, Robert ;
Zuidema, Willem L. .
NATURE, 2023, 614 (7947) :224-226
[5]  
Bommarito I I., 2022, arXiv
[6]  
Borji A, 2023, Arxiv, DOI [arXiv:2302.03494, DOI 10.48550/ARXIV.2302.03494, 10.48550/arxiv.2302.03494]
[7]  
Braun V., 2006, Qual Res Psychol, V3, P77, DOI [10.1191/1478088706qp063oa, DOI 10.1191/1478088706QP063OA, 10.1037/13620-004, DOI 10.1037/13620-004]
[8]  
Bubeck S, 2023, Arxiv, DOI [arXiv:2303.12712, 10.48550/ARXIV.2303.12712]
[9]  
Carvalho PD, 2021, DOI [10.18420/ecscw2021_wsmc06, 10.18420/ecscw2021_wsmc06, DOI 10.18420/ECSCW2021_WSMC06]
[10]  
Chen LJ, 2023, Arxiv, DOI [arXiv:2307.09009, DOI 10.48550/ARXIV.2307.09009]