Healthy nutrition and weight management for a positive pregnancy experience in the antenatal period: Comparison of responses from artificial intelligence models on nutrition during pregnancy

被引:0
|
作者
Karacan, Emine [1 ]
机构
[1] Iskenderun Tech Univ, Dortyol Vocat Sch Hlth Serv, Hatay, Turkiye
关键词
Gestational weight gain; Maternal nutrition; Pregnancy; Artificial intelligence; Nursing; DIETARY GUIDELINES; ADVICE; GRADE; WOMEN;
D O I
10.1016/j.ijmedinf.2024.105663
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Background: As artificial intelligence AI-supported applications become integral to web-based informationseeking, assessing their impact on healthy nutrition and weight management during the antenatal period is crucial. Objective: This study was conducted to evaluate both the quality and semantic similarity of responses created by AI models to the most frequently asked questions about healthy nutrition and weight management during the antenatal period, based on existing clinical knowledge. Methods: In this study, a cross-sectional assessment design was used to explore data from 3 AI models (GPT-4, MedicalGPT, Med-PaLM). We directed the most frequently asked questions about nutrition during pregnancy, obtained from the American College of Obstetricians and Gynecologists (ACOG) to each model in a new and single session on October 21, 2023, without any prior conversation. Immediately after, instructions were given to the AI models to generate responses to these questions. The responses created by AI models were evaluated using the Grading of Recommendations Assessment, Development, and Evaluation (GRADE) scale. Additionally, to assess the semantic similarity between answers to 31 pregnancy nutrition-related frequently asked questions sourced from the ACOG and responses from AI models we evaluated cosine similarity using both WORD2VEC and BioLORD-2023. Results: Med-PaLM outperformed GPT-4 and MedicalGPT in response quality (mean = 3.93), demonstrating superior clinical accuracy over both GPT-4 (p = 0.016) and MedicalGPT (p = 0.001). GPT-4 had higher quality than MedicalGPT (p = 0.027). The semantic similarity between ACOG and Med-PaLM is higher with WORD2VEC (0.92) compared to BioLORD2023 (0.81), showing a difference of +0.11. The similarity scores for ACOG-MedicalGPT and ACOG-GPT-4 are similar across both models, with minimal differences of -0.01. Overall, WORD2VEC has a slightly higher average similarity (0.82) than BioLORD-2023 (0.79), with a difference of +0.03. Conclusions: Despite the superior performance of Med-PaLM, there is a need for further evidence-based research and improvement in the integration of AI in healthcare due to varying AI model performances.
引用
收藏
页数:7
相关论文
共 2 条
  • [1] Be Healthy in Pregnancy (BHIP): A Randomized Controlled Trial of Nutrition and Exercise Intervention from Early Pregnancy to Achieve Recommended Gestational Weight Gain
    Atkinson, Stephanie A.
    Maran, Atherai
    Dempsey, Kendra
    Perreault, Maude
    Vanniyasingam, Thuva
    Phillips, Stuart M.
    Hutton, Eileen K.
    Mottola, Michelle F.
    Wahoush, Olive
    Xie, Feng
    Thabane, Lehana
    NUTRIENTS, 2022, 14 (04)
  • [2] Eating for 1, Healthy and Active for 2; feasibility of delivering novel, compact training for midwives to build knowledge and confidence in giving nutrition, physical activity and weight management advice during pregnancy
    Basu, Andrea
    Kennedy, Lynne
    Tocque, Karen
    Jones, Sharn
    BMC PREGNANCY AND CHILDBIRTH, 2014, 14