Chinese Diabetes Question Classification Using Large Language Models and Transfer Learning

被引:0
|
作者
Ge, Chengze [1 ,2 ]
Ling, Hongshun [1 ]
Quan, Fuliang [1 ]
Zeng, Jianping [2 ]
机构
[1] Huimei Technol, Hangzhou, Peoples R China
[2] Fudan Univ, Sch Comp Sci, Shanghai, Peoples R China
来源
HEALTH INFORMATION PROCESSING: EVALUATION TRACK PAPERS, CHIP 2023 | 2024年 / 2080卷
关键词
Diabetes questions classification; LLM; LoRA Fine-Tuning; Transfer Learning;
D O I
10.1007/978-981-97-1717-0_19
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Type 2 diabetes has evolved into a significant global public health challenge. Diabetes question-answering services are playing an increasingly important role in providing daily health services for patients and high-risk populations. As one of the evaluation track for CHIP 2023, participants are required to classify diabetes-related questions. We have introduced an approach that utilizes generative open-source large language models to accomplish this task. Initially, we designed a prompt construction method that transforms question-label pairs into a conversational text. Subsequently, we fine-tuned the large language model using LoRA method. Furthermore, to enhance the capability in the medical domain, we employed another open-source dataset for initial fine-tuning of the model, followed by transfer learning to fine-tune the Chinese diabetes questions dataset. Experimental results demonstrate the superiority of our approach, ultimately achieving a score of 92.10 on the test data.
引用
收藏
页码:205 / 213
页数:9
相关论文
共 50 条
  • [21] A Performance Comparison of CNN Models for Bean Phenology Classification Using Transfer Learning Techniques
    Ibarra-Perez, Teodoro
    Jaramillo-Martinez, Ramon
    Correa-Aguado, Hans C.
    Ndjatchi, Christophe
    del Rosario Martinez-Blanco, Ma.
    Guerrero-Osuna, Hector A.
    Mirelez-Delgado, Flabio D.
    Casas-Flores, Jose I.
    Reveles-Martinez, Rafael
    Hernandez-Gonzalez, Umanel A.
    AGRIENGINEERING, 2024, 6 (01): : 841 - 857
  • [22] A comparative study of deep transfer learning models for malware classification using image datasets
    Ranjan, Ranjeet Kumar
    Singh, Amit
    INTERNATIONAL JOURNAL OF INFORMATION AND COMPUTER SECURITY, 2023, 21 (3-4) : 293 - 319
  • [23] Will Transfer Learning Enhance ImageNet Classification Accuracy Using ImageNet-Pretrained Models?
    Ebrahim, Maad
    Al-Ayyoub, Mahmoud
    Alsmirat, Mohammad A.
    2019 10TH INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION SYSTEMS (ICICS), 2019, : 211 - 216
  • [24] Music Genre Classification Using Transfer Learning
    Liang, Beici
    Gu, Minwei
    THIRD INTERNATIONAL CONFERENCE ON MULTIMEDIA INFORMATION PROCESSING AND RETRIEVAL (MIPR 2020), 2020, : 392 - 393
  • [25] Banana Disease Classification using Transfer Learning
    Derafi, Muhammad Izzat Faiz Mohd
    Razak, Siti Fatimah Abdul
    Sayeed, Md Shohel
    2024 IEEE SYMPOSIUM ON INDUSTRIAL ELECTRONICS AND APPLICATIONS, ISIEA 2024, 2024,
  • [26] Improving the Generalization of Deep Learning Classification Models in Medical Imaging Using Transfer Learning and Generative Adversarial Networks
    Venu, Sagar Kora
    AGENTS AND ARTIFICIAL INTELLIGENCE, ICAART 2021, 2022, 13251 : 218 - 235
  • [27] ECG Classification using Deep Transfer Learning
    Gajendran, Mohan Kumar
    Khan, Muhammad Zubair
    Khattak, Muazzam A. Khan
    2021 4TH INTERNATIONAL CONFERENCE ON INFORMATION AND COMPUTER TECHNOLOGIES (ICICT 2021), 2021, : 1 - 5
  • [28] Classification of Medical Thermograms using Transfer Learning
    Ornek, Ahmet Haydar
    Ceylan, Murat
    2020 28TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2020,
  • [29] Deep Learning Models for Poorly Differentiated Colorectal Adenocarcinoma Classification in Whole Slide Images Using Transfer Learning
    Tsuneki, Masayuki
    Kanavati, Fahdi
    DIAGNOSTICS, 2021, 11 (11)
  • [30] Skin lesion classification using transfer learning
    Nivedhitha, G.
    Kalpana, P.
    Sidthik, A. Sheik
    Rani, V. Anusha
    Singh, Ajith B.
    Rajagopal, R.
    Soft Computing, 2024, 28 (20) : 12337 - 12343