Investigating the Classification of Living Kidney Donation Experiences on Reddit and Understanding the Sensitivity of ChatGPT to Prompt Engineering: Content Analysis

被引:0
作者
Nielsen, Joshua [1 ]
Chen, Xiaoyu [2 ]
Davis, Lashara [3 ]
Waterman, Amy [3 ]
Gentili, Monica [1 ]
机构
[1] Univ Louisville, JB Speed Sch Engn, Dept Ind Engn, 220 Eastern Pkwy, Louisville, KY 40292 USA
[2] SUNY Buffalo, Sch Engn & Appl Sci, Dept Ind & Syst Engn, Buffalo, NY USA
[3] Houston Methodist Hosp, Dept Surg, Patient Engagement Divers & Educ Div, Houston, TX USA
来源
JMIR AI | 2025年 / 4卷
基金
美国国家科学基金会;
关键词
prompt engineering; generative artificial intelligence; kidney donation; transplant; living donor; SOCIAL MEDIA; ORGAN DONATION; TRANSPLANTATION; OUTCOMES; DONORS;
D O I
10.2196/57319
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Background: Living kidneydonation (LKD), whereindividuals donateonekidney whilealive,playsacriticalroleinincreasing the number of kidneys available for those experiencing kidney failure. Previous studies show that many generous people are interested in becoming living donors; however, a huge gap exists between the number of patients on the waiting list and the number of living donors yearly. Objective: To bridge this gap, we aimed to investigate how to identify potential living donors from discussions on public social media forums so that educational interventions could later be directed to them. Methods: Using Reddit forums as an example, this study described the classification of Reddit content shared about LKD into threeclasses: (1) present(presentlydealing with LKD personally), (2) past (dealtwith LKD personally in the past), and (3) other (LKD general comments). An evaluation was conducted comparing a fine-tuned distilled version of the Bidirectional Encoder Representations fromTransformers (BERT) model with inference using GPT-3.5 (ChatGPT). To systematically evaluate ChatGPT's sensitivity to distinguishing between the 3 prompt categories, we used a comprehensive prompt engineering strategy encompassing a full factorial analysis in 48 runs. A novel prompt engineering approach, dialogue until classification consensus, was introduced to simulate a deliberation between 2 domain experts until a consensus on classification was achieved. Results: BERT and GPT-3.5 exhibited classification accuracies of approximately 75% and 78%, respectively. Recognizing the inherent ambiguity between classes, a post hoc analysis of incorrect predictions revealed sensible reasoning and acceptable errors in the predictive models. Considering these acceptable mismatched predictions, the accuracy improved to 89.3% for BERT and 90.7% for GPT-3.5. Conclusions:Large language models, such as GPT-3.5, are highly capable of detecting and categorizing LKD-targeted content on social media forums. They are sensitive to instructions, and the introduced dialogue until classification consensus method exhibited superior performance over stand-alone reasoning, highlighting the meritin advancing prompt engineering methodologies. The models can produce appropriate contextual reasoning, even when final conclusions differ from their human counterparts.
引用
收藏
页数:17
相关论文
共 48 条
[1]   Kidney transplantation as primary therapy for end-stage renal disease:: A National Kidney Foundation/Kidney Disease Outcomes Quality Initiative (NKF/KDOQI™) conference [J].
Abecassis, Michael ;
Bartlett, Stephen T. ;
Collins, Allan J. ;
Davis, Connie L. ;
Delmonico, Francis L. ;
Friedewald, John J. ;
Hays, Rebecca ;
Howard, Andrew ;
Jones, Edward ;
Leichtman, Alan B. ;
Merion, Robert M. ;
Metzger, Robert A. ;
Pradel, Francoise ;
Schweitzer, Eugene J. ;
Velez, Ruben L. ;
Gaston, Robert S. .
CLINICAL JOURNAL OF THE AMERICAN SOCIETY OF NEPHROLOGY, 2008, 3 (02) :471-480
[2]   Transformer models for text-based emotion detection: a review of BERT-based approaches [J].
Acheampong, Francisca Adoma ;
Nunoo-Mensah, Henry ;
Chen, Wenyu .
ARTIFICIAL INTELLIGENCE REVIEW, 2021, 54 (08) :5789-5829
[3]   Optuna: A Next-generation Hyperparameter Optimization Framework [J].
Akiba, Takuya ;
Sano, Shotaro ;
Yanase, Toshihiko ;
Ohta, Takeru ;
Koyama, Masanori .
KDD'19: PROCEEDINGS OF THE 25TH ACM SIGKDD INTERNATIONAL CONFERENCCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2019, :2623-2631
[4]  
Ali Hazrat, 2023, 2023 IEEE International Conference on Artificial Intelligence, Blockchain, and Internet of Things (AIBThings), P1, DOI 10.1109/AIBThings58340.2023.10292475
[5]   Trends in Living Donation by Race and Ethnicity Among Children With End-stage Renal Disease in the United States, 1995-2015 [J].
Amaral, Sandra ;
McCulloch, Charles E. ;
Black, Elizabeth ;
Winnicki, Erica ;
Lee, Brian ;
Roll, Garret R. ;
Grimes, Barbara ;
Ku, Elaine .
TRANSPLANTATION DIRECT, 2020, 6 (07) :E570
[6]  
[Anonymous], [222] National Nuclear Data Center. https://www.nndc.bnl.gov/ensdf/. Accessed: 2022-09-01.
[7]   Using Machine Learning to Compare Provaccine and Antivaccine Discourse Among the Public on Social Media: Algorithm Development Study [J].
Argyris, Young Anna ;
Monu, Kafui ;
Tan, Pang-Ning ;
Aarts, Colton ;
Jiang, Fan ;
Wiseley, Kaleigh Anne .
JMIR PUBLIC HEALTH AND SURVEILLANCE, 2021, 7 (06)
[8]   Classifying Comments on Social Media Related to Living Kidney Donation: Machine Learning Training and Validation Study [J].
Asghari, Mohsen ;
Nielsen, Joshua ;
Gentili, Monica ;
Koizumi, Naoru ;
Elmaghraby, Adel .
JMIR MEDICAL INFORMATICS, 2022, 10 (11)
[9]   An economic assessment of contemporary kidney transplant practice [J].
Axelrod, David A. ;
Schnitzler, Mark A. ;
Xiao, Huiling ;
Irish, William ;
Tuttle-Newhall, Elizabeth ;
Chang, Su-Hsin ;
Kasiske, Bertram L. ;
Alhamad, Tarek ;
Lentine, Krista L. .
AMERICAN JOURNAL OF TRANSPLANTATION, 2018, 18 (05) :1168-1176
[10]   Opportunities and Challenges in Using Social Media in Organ Donation [J].
Ayorinde, John O. O. ;
Saeb-Parsy, Kourosh ;
Hossain, Ayaz .
JAMA SURGERY, 2020, 155 (09) :797-798