Performance of Generative Artificial Intelligence in Dental Licensing Examinations

被引:17
作者
Chau, Reinhard Chun Wang [1 ]
Thu, Khaing Myat [1 ]
Yu, Ollie Yiru [1 ]
Hsung, Richard Tai-Chiu [1 ,2 ]
Lo, Edward Chin Man [1 ]
Lam, Walter Yu Hang [1 ,3 ]
机构
[1] Univ Hong Kong, Fac Dent, Hong Kong, Peoples R China
[2] Hong Kong Chu Hai Coll, Dept Comp Sci, Hong Kong, Peoples R China
[3] Univ Hong Kong, Musketeers Fdn Inst Data Sci, Hong Kong, Peoples R China
关键词
Artificial intelligence; Communication; Dental education; Digital technology; Examination questions;
D O I
10.1016/j.identj.2023.12.007
中图分类号
R78 [口腔科学];
学科分类号
1003 ;
摘要
Objectives: Generative artificial intelligence (GenAI), including large language models (LLMs), has vast potential applications in health care and education. However, it is unclear how proficient LLMs are in interpreting written input and providing accurate answers in dentistry. This study aims to investigate the accuracy of GenAI in answering questions from dental licensing examinations. Methods: A total of 1461 multiple-choice questions from question books for the US and the UK dental licensing examinations were input into 2 versions of ChatGPT 3.5 and 4.0. The passing rates of the US and UK dental examinations were 75.0% and 50.0%, respectively. The performance of the 2 versions of GenAI in individual examinations and dental subjects was analysed and compared. Results: ChatGPT 3.5 correctly answered 68.3% (n = 509) and 43.3% (n = 296) of questions from the US and UK dental licensing examinations, respectively. The scores for ChatGPT 4.0 were 80.7% (n = 601) and 62.7% (n = 429), respectively. ChatGPT 4.0 passed both written dental licensing examinations, whilst ChatGPT 3.5 failed. ChatGPT 4.0 answered 327 more questions correctly and 102 incorrectly compared to ChatGPT 3.5 when comparing the 2 versions. Conclusions: The newer version of GenAI has shown good proficiency in answering multiplechoice questions from dental licensing examinations. Whilst the more recent version of GenAI generally performed better, this observation may not hold true in all scenarios, and further improvements are necessary. The use of GenAI in dentistry will have significant implications for dentist-patient communication and the training of dental professionals. (c) 2023 The Authors. Published by Elsevier Inc. on behalf of FDI World Dental Federation. This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/)
引用
收藏
页码:616 / 621
页数:6
相关论文
共 36 条
  • [11] Role of Chat GPT in Public Health
    Biswas, Som S.
    [J]. ANNALS OF BIOMEDICAL ENGINEERING, 2023, 51 (05) : 868 - 869
  • [12] Chau RCW., 2023, J California Dent Assoc, V51
  • [13] A Systematic Review of the Use of mHealth in Oral Health Education among Older Adults
    Chau, Reinhard Chun Wang
    Thu, Khaing Myat
    Chaurasia, Akhilanand
    Hsung, Richard Tai Chiu
    Lam, Walter Yu-Hang
    [J]. DENTISTRY JOURNAL, 2023, 11 (08)
  • [14] Accuracy of Artificial Intelligence-Based Photographic Detection of Gingivitis
    Chau, Reinhard Chun Wang
    Li, Guan-Hua
    Tew, In Meei
    Thu, Khaing Myat
    McGrath, Colman
    Lo, Wai-Lun
    Ling, Wing-Kuen
    Hsung, Richard Tai-Chiu
    Lam, Walter Yu Hang
    [J]. INTERNATIONAL DENTAL JOURNAL, 2023, 73 (05) : 724 - 730
  • [15] Accuracy of arti fi cial intelligence-designed single-molar dental prostheses: A feasibility study
    Chau, Reinhard Chun Wang
    Hsung, Richard Tai-Chiu
    McGrath, Colman
    Pow, Edmond Ho Nang
    Lam, Walter Yu Hang
    [J]. JOURNAL OF PROSTHETIC DENTISTRY, 2024, 131 (06) : 1111 - 1117
  • [16] Dashti M, 2023, J Prosthet Dent, VS0022-3913, P00371
  • [17] Dowd FJ, 2007, Mosby's review for the NBDE part two
  • [18] "So what if ChatGPT wrote it?" Multidisciplinary perspectives on opportunities, challenges and implications of generative conversational AI for research, practice and policy
    Dwivedi, Yogesh K.
    Kshetri, Nir
    Hughes, Laurie
    Slade, Emma Louise
    Jeyaraj, Anand
    Kar, Arpan Kumar
    Baabdullah, Abdullah M.
    Koohang, Alex
    Raghavan, Vishnupriya
    Ahuja, Manju
    Albanna, Hanaa
    Albashrawi, Mousa Ahmad
    Al-Busaidi, Adil S.
    Balakrishnan, Janarthanan
    Barlette, Yves
    Basu, Sriparna
    Bose, Indranil
    Brooks, Laurence
    Buhalis, Dimitrios
    Carter, Lemuria
    Chowdhury, Soumyadeb
    Crick, Tom
    Cunningham, Scott W.
    Davies, Gareth H.
    Davison, Robert M.
    De, Rahul
    Dennehy, Denis
    Duan, Yanqing
    Dubey, Rameshwar
    Dwivedi, Rohita
    Edwards, John S.
    Flavian, Carlos
    Gauld, Robin
    Grover, Varun
    Hu, Mei-Chih
    Janssen, Marijn
    Jones, Paul
    Junglas, Iris
    Khorana, Sangeeta
    Kraus, Sascha
    Larsen, Kai R.
    Latreille, Paul
    Laumer, Sven
    Malik, F. Tegwen
    Mardani, Abbas
    Mariani, Marcello
    Mithas, Sunil
    Mogaji, Emmanuel
    Nord, Jeretta Horn
    O'Connor, Siobhan
    [J]. INTERNATIONAL JOURNAL OF INFORMATION MANAGEMENT, 2023, 71
  • [19] Fan K, 2014, MCQs for dentistry, V3rd
  • [20] Original Paper Performance of ChatGPT on the Peruvian National Licensing Medical Examination: Cross-Sectional Study
    Flores-Cohaila, Javier A.
    Garcia-Vicente, Abigail
    Vizcarra-Jimenez, Sonia F.
    De la Cruz-Galan, Janith
    Gutierrez-Arratia, Jesus
    Torres, Blanca Geraldine Quiroga
    Taype-Rondan, Alvaro
    [J]. JMIR MEDICAL EDUCATION, 2023, 9