Development of lumbar spine MRI referrals vetting models using machine learning and deep learning algorithms: Comparison models vs healthcare professionals

被引:7
作者
Alanazi, A. H. [1 ,2 ]
Cradock, A. [1 ]
Rainford, L. [1 ]
机构
[1] Univ Coll Dublin, Sch Med, Radiog & Diagnost Imaging, Dublin, Ireland
[2] Soc Artificial Intelligence Healthcare, Riyadh, Saudi Arabia
关键词
Natural language processing; Machine learning; Deep learning; Referrals' appropriateness; Magnetic resonance imaging; CLINICAL DECISION-SUPPORT; LOW-BACK-PAIN; RADIOLOGY; APPROPRIATENESS;
D O I
10.1016/j.radi.2022.05.005
中图分类号
R8 [特种医学]; R445 [影像诊断学];
学科分类号
1002 ; 100207 ; 1009 ;
摘要
Introduction: Referrals vetting is a necessary daily task to ensure the appropriateness of radiology referrals. Vetting requires extensive clinical knowledge and may challenge those responsible. This study aims to develop AI models to automate the vetting process and to compare their performance with healthcare professionals. Methods: 1020 lumbar spine MRI referrals were collected retrospectively from two Irish hospitals. Three expert MRI radiographers classified the referrals into indicated or not indicated for scanning based on iRefer guidelines. The reference label for each referral was assigned based on the majority voting. The corpus was divided into two datasets, one for the models' development with 920 referrals, and one included 100 referrals used as a held-out for the final comparison of the AI models versus national and international MRI radiographers. Three traditional models were developed: SVM, LR, RF, and two deep neural models, including CNN and Bi-LSTM. For the traditional models, four vectorisation techniques applied: BoW, bigrams, trigrams, and TF-IDF. A textual data augmentation technique was applied to investigate the influence of data augmentation on the models' performances. Results: RF with BoW achieved the highest AUC reaching 0.99. CNN model outperformed Bi-LSTM with AUC 1/4 0.98. With the augmented dataset, the performance significantly improved with an increase in F1 scores ranging from 1% to 7%. All models outperformed the national and international radiographers when compared on the hold-out dataset. Conclusion: The models assigned the referrals' appropriateness with higher accuracies than the national and international radiographers. Applying data augmentation significantly improved the models' performances. Implications for practice: The outcomes suggest that the use of AI for checking referrals' eligibility could serve as a supporting tool to improve the referrals' management in radiology departments. (c) 2022 The Author(s). Published by Elsevier Ltd on behalf of The College of Radiographers.
引用
收藏
页码:674 / 683
页数:10
相关论文
共 30 条
[1]   A Dictionary-based Oversampling Approach to Clinical Document Classification on Small and Imbalanced Dataset [J].
Abdollahi, Mahdi ;
Gao, Xiaoying ;
Mei, Yi ;
Ghosh, Shameek ;
Li, Jinyan .
2020 IEEE/WIC/ACM INTERNATIONAL JOINT CONFERENCE ON WEB INTELLIGENCE AND INTELLIGENT AGENT TECHNOLOGY (WI-IAT 2020), 2020, :357-364
[2]   A Text Data Augmentation Approach for Improving the Performance of CNN [J].
Abulaish, Muhammad ;
Sah, Amit Kumar .
2019 11TH INTERNATIONAL CONFERENCE ON COMMUNICATION SYSTEMS & NETWORKS (COMSNETS), 2019, :660-665
[3]  
Adipradana R., 2021, Bulletin of Electrical Engineering and Informatics, V10, P2130, DOI [10.11591/eei.v10i4.2956, DOI 10.11591/EEI.V10I4.2956]
[4]  
Arras Leila., 2016, P 1 WORKSH REPR LEAR, P1, DOI 10.18653/v1/W16-1601
[5]   Evaluating Two Measures of Lumbar Spine MRI Overuse: Administrative Data Versus Chart Review [J].
Avoundjian, Tigran ;
Gidwani, Risha ;
Yao, Dorcas ;
Lo, Jeanie ;
Sinnott, Patricia ;
Thakur, Neil ;
Barnett, Paul G. .
JOURNAL OF THE AMERICAN COLLEGE OF RADIOLOGY, 2016, 13 (09) :1057-1066
[6]   Explaining variation in GP referral rates for x-rays for back pain [J].
Baker, Rachel ;
Lecouturier, Jan ;
Bond, Senga .
IMPLEMENTATION SCIENCE, 2006, 1 (1)
[7]   Effectiveness of Clinical Decision Support in Controlling Inappropriate Imaging [J].
Blackmore, C. Craig ;
Mecklenburg, Robert S. ;
Kaplan, Gary S. .
JOURNAL OF THE AMERICAN COLLEGE OF RADIOLOGY, 2011, 8 (01) :19-25
[8]   Natural Language Processing of Radiology Reports in Patients With Hepatocellular Carcinoma to Predict Radiology Resource Utilization [J].
Brown, A. D. ;
Kachura, J. R. .
JOURNAL OF THE AMERICAN COLLEGE OF RADIOLOGY, 2019, 16 (06) :840-844
[9]   Deep Learning to Classify Radiology Free-Text Reports [J].
Chen, Matthew C. ;
Ball, Robyn L. ;
Yang, Lingyao ;
Moradzadeh, Nathaniel ;
Chapman, Brian E. ;
Larson, David B. ;
Langlotz, Curtis P. ;
Amrhein, Timothy J. ;
Lungren, Matthew P. .
RADIOLOGY, 2018, 286 (03) :845-852
[10]   Neural classification of Norwegian radiology reports: using NLP to detect findings in CT-scans of children [J].
Dahl, Fredrik A. ;
Rama, Taraka ;
Hurlen, Petter ;
Brekke, Pal H. ;
Husby, Haldor ;
Gundersen, Tore ;
Nytro, Oystein ;
Ovrelid, Lilja .
BMC MEDICAL INFORMATICS AND DECISION MAKING, 2021, 21 (01) :84