Evaluating large language models in analysing classroom dialogue

被引:1
作者
Long, Yun [1 ]
Luo, Haifeng [1 ]
Zhang, Yu [1 ]
机构
[1] Tsinghua Univ, Inst Educ, Beijing 100084, Peoples R China
基金
中国国家自然科学基金;
关键词
TALK; STUDENTS;
D O I
10.1038/s41539-024-00273-3
中图分类号
G40 [教育学];
学科分类号
040101 ; 120403 ;
摘要
This study explores the use of Large Language Models (LLMs), specifically GPT-4, in analysing classroom dialogue-a key task for teaching diagnosis and quality improvement. Traditional qualitative methods are both knowledge- and labour-intensive. This research investigates the potential of LLMs to streamline and enhance this process. Using datasets from middle school mathematics and Chinese classes, classroom dialogues were manually coded by experts and then analysed with a customised GPT-4 model. The study compares manual annotations with GPT-4 outputs to evaluate efficacy. Metrics include time efficiency, inter-coder agreement, and reliability between human coders and GPT-4. Results show significant time savings and high coding consistency between the model and human coders, with minor discrepancies. These findings highlight the strong potential of LLMs in teaching evaluation and facilitation.
引用
收藏
页数:10
相关论文
共 52 条
  • [1] 2023, Arxiv, DOI [arXiv:2303.08774, 10.48550/arXiv.2303.08774]
  • [2] Alexander R. J., 2008, Towards dialogic teaching: Rethinking classroom talk, V4th
  • [3] Alic S, 2022, Arxiv, DOI arXiv:2208.04715
  • [4] Brown TB, 2020, Arxiv, DOI [arXiv:2005.14165, 10.48550/arXiv.2005.14165, DOI 10.48550/ARXIV.2005.14165]
  • [5] Improving the effectiveness of collaborative group work in primary schools: effects on science attainment
    Baines, Ed
    Blatchford, Peter
    Chowne, Anne
    [J]. BRITISH EDUCATIONAL RESEARCH JOURNAL, 2007, 33 (05) : 663 - 680
  • [6] Baker M.J., 2016, DIALOGUE ARGUMENTATI
  • [7] Reward, Punishment, and Cooperation: A Meta-Analysis
    Balliet, Daniel
    Mulder, Laetitia B.
    Van Lange, Paul A. M.
    [J]. PSYCHOLOGICAL BULLETIN, 2011, 137 (04) : 594 - 615
  • [8] On the Dangers of Stochastic Parrots: Can Language Models Be Too Big?
    Bender, Emily M.
    Gebru, Timnit
    McMillan-Major, Angelina
    Shmitchell, Shmargaret
    [J]. PROCEEDINGS OF THE 2021 ACM CONFERENCE ON FAIRNESS, ACCOUNTABILITY, AND TRANSPARENCY, FACCT 2021, 2021, : 610 - 623
  • [9] Boaler J, 2008, TEACH COLL REC, V110, P608
  • [10] Bommasani R., 2021, arXiv, DOI DOI 10.48550/ARXIV.2108.07258