Automatic Text Classification With Large Language Models: A Review of <monospace>openai</monospace> for Zero- and Few-Shot Classification

被引:0
|
作者
Anglin, Kylie L. [1 ]
Ventura, Claudia [1 ]
机构
[1] Univ Connecticut, Storrs, CT 06269 USA
关键词
large language models; LLMs; artificial intelligence; <monospace>openai</monospace>; educational measurement;
D O I
10.3102/10769986241279927
中图分类号
G40 [教育学];
学科分类号
040101 ; 120403 ;
摘要
While natural language documents, such as intervention transcripts and participant writing samples, can provide highly nuanced insights into educational and psychological constructs, researchers often find these materials difficult and expensive to analyze. Recent developments in machine learning, however, have allowed social scientists to harness the power of artificial intelligence for complex data categorization tasks. One approach, supervised learning, supports high-performance categorization yet still requires a large, hand-labeled training corpus, which can be costly. An alternative approach-zero- and few-shot classification with pretrained large language models-offers a cheaper, compelling alternative. This article considers the application of zero-shot and few-shot classification in educational research. We provide an overview of large language models, a step-by-step tutorial on using the Python openai package for zero-shot and few-shot classification, and a discussion of relevant research considerations for social scientists.<br />
引用
收藏
页数:23
相关论文
共 30 条
  • [1] Large Language Models for Binary Health-Related Question Answering: A Zero- and Few-Shot Evaluation
    Fernandez-Pichel, Marcos
    Losada, David E.
    Pichel, Juan C.
    COMPUTATIONAL SCIENCE, ICCS 2024, PT IV, 2024, 14835 : 325 - 339
  • [2] Large Language Models for Few-Shot Automatic Term Extraction
    Banerjee, Shubhanker
    Chakravarthi, Bharathi Raja
    McCrae, John Philip
    NATURAL LANGUAGE PROCESSING AND INFORMATION SYSTEMS, PT I, NLDB 2024, 2024, 14762 : 137 - 150
  • [3] Disaster Tweet Classification Using Fine-Tuned Deep Learning Models Versus Zero and Few-Shot Large Language Models
    Dinani, Soudabeh Taghian
    Caragea, Doina
    Gyawali, Nikesh
    DATA MANAGEMENT TECHNOLOGIES AND APPLICATIONS, DATA 2023, 2024, 2105 : 73 - 94
  • [4] Large Language Models-aided Literature Reviews: A Study on Few-Shot Relevance Classification
    Giobergia, Flavio
    Koudounas, Alkis
    Baralis, Elena
    2024 IEEE 18TH INTERNATIONAL CONFERENCE ON APPLICATION OF INFORMATION AND COMMUNICATION TECHNOLOGIES, AICT 2024, 2024,
  • [5] Zero- and few-shot prompting of generative large language models provides weak assessment of risk of bias in clinical trials
    Suster, Simon
    Baldwin, Timothy
    Verspoor, Karin
    RESEARCH SYNTHESIS METHODS, 2024, 15 (06) : 988 - 1000
  • [6] Zero-Shot Classification of Art With Large Language Models
    Tojima, Tatsuya
    Yoshida, Mitsuo
    IEEE ACCESS, 2025, 13 : 17426 - 17439
  • [7] Mutual Learning Prototype Network for Few-Shot Text Classification
    Liu, Jun
    Qin, Xiaorui
    Tao, Jian
    Dong, Hongfei
    Li, Xiaoxu
    Beijing Youdian Daxue Xuebao/Journal of Beijing University of Posts and Telecommunications, 2024, 47 (03): : 30 - 35
  • [8] Harnessing large language models' zero-shot and few-shot learning capabilities for regulatory research
    Meshkin, Hamed
    Zirkle, Joel
    Arabidarrehdor, Ghazal
    Chaturbedi, Anik
    Chakravartula, Shilpa
    Mann, John
    Thrasher, Bradlee
    Li, Zhihua
    BRIEFINGS IN BIOINFORMATICS, 2024, 25 (05)
  • [9] Large Language Models for Text Classification: From Zero-Shot Learning to Instruction-Tuning
    Chae, Youngjin
    Davidson, Thomas
    SOCIOLOGICAL METHODS & RESEARCH, 2025,
  • [10] Structuring medication signeturs as a language regression task: comparison of zero- and few-shot GPT with fine-tuned models
    Garcia-Agundez, Augusto
    Kay, Julia L.
    Li, Jing
    Gianfrancesco, Milena
    Rai, Baljeet
    Hu, Angela
    Schmajuk, Gabriela
    Yazdany, Jinoos
    JAMIA OPEN, 2024, 7 (02)