Evaluation of artificial intelligence models for osteoarthritis of the knee using deep learning algorithms for orthopedic radiographs

被引:9
作者
Tiwari, Anjali [1 ]
Poduval, Murali [2 ]
Bagaria, Vaibhav [1 ,3 ]
机构
[1] Sir HN Reliance Fdn Hosp & Res Ctr, Dept OfOrthoped, Raja Rammohan Roy Rd, Mumbai 400004, Maharashtra, India
[2] Tata Consultancy Serv, Lifesci Engn, Mumbai 400096, Maharashtra, India
[3] Columbia Asia Hosp, Dept Orthoped, Mumbai 400004, Maharashtra, India
来源
WORLD JOURNAL OF ORTHOPEDICS | 2022年 / 13卷 / 06期
关键词
Osteoarthritis; Artificial intelligence; Knee; Computer vision; Machine leaning; Deep learning;
D O I
10.5312/wjo.v13.i6.603
中图分类号
R826.8 [整形外科学]; R782.2 [口腔颌面部整形外科学]; R726.2 [小儿整形外科学]; R62 [整形外科学(修复外科学)];
学科分类号
摘要
BACKGROUND Deep learning, a form of artificial intelligence, has shown promising results for interpreting radiographs. In order to develop this niche machine learning (ML) program of interpreting orthopedic radiographs with accuracy, a project named deep learning algorithm for orthopedic radiographs was conceived. In the first phase, the diagnosis of knee osteoarthritis (KOA) as per the standard Kellgren-Lawrence (KL) scale in medical images was conducted using the deep learning algorithm for orthopedic radiographs. AIM To compare efficacy and accuracy of eight different transfer learning deep learning models for detecting the grade of KOA from a radiograph and identify the most appropriate ML-based model for the detecting grade of KOA. METHODS The study was performed on 2068 radiograph exams conducted at the Department of Orthopedic Surgery, Sir HN Reliance Hospital and Research Centre (Mumbai, India) during 2019-2021. Three orthopedic surgeons reviewed these independently, graded them for the severity of KOA as per the KL scale and settled disagreement through a consensus session. Eight models, namely ResNet50, VGG-16, InceptionV3, MobilnetV2, EfficientnetB7, DenseNet201, Xception and NasNetMobile, were used to evaluate the efficacy of ML in accurately classifying radiographs for KOA as per the KL scale. Out of the 2068 images, 70% were used initially to train the model, 10% were used subsequently to test the model, and 20% were used finally to determine the accuracy of and validate each model. The idea behind transfer learning for KOA grade image classification is that if the existing models are already trained on a large and general dataset, these models will effectively serve as generic models to fulfill the study's objectives. Finally, in order to benchmark the efficacy, the results of the models were also compared to a first-year orthopedic trainee who independently classified these models according to the KL scale. RESULTS Our network yielded an overall high accuracy for detecting KOA, ranging from 54% to 93%. The most successful of these was the DenseNet model, with accuracy up to 93%; interestingly, it even outperformed the human first-year trainee who had an accuracy of 74%. CONCLUSION The study paves the way for extrapolating the learning using ML to develop an automated KOA classification tool and enable healthcare professionals with better decision-making.
引用
收藏
页码:603 / 614
页数:12
相关论文
共 19 条
[1]   Predicting Early Symptomatic Osteoarthritis in the Human Knee Using Machine Learning Classification of Magnetic Resonance Images From the Osteoarthritis Initiative [J].
Ashinsky, Beth G. ;
Bouhrara, Mustapha ;
Coletta, Christopher E. ;
Lehallier, Benoit ;
Urish, Kenneth L. ;
Lin, Ping-Chang ;
Goldberg, Ilya G. ;
Spencer, Richard G. .
JOURNAL OF ORTHOPAEDIC RESEARCH, 2017, 35 (10) :2243-2250
[2]   Treatment modalities for hip and knee osteoarthritis: A systematic review of safety [J].
Aweid, Osama ;
Haider, Zakir ;
Saed, Abdel ;
Kalairajah, Yegappan .
JOURNAL OF ORTHOPAEDIC SURGERY, 2018, 26 (03)
[3]  
Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171
[4]   Bone Tumor Diagnosis Using a Na⟨ve Bayesian Model of Demographic and Radiographic Features [J].
Do, Bao H. ;
Langlotz, Curtis ;
Beaulieu, Christopher F. .
JOURNAL OF DIGITAL IMAGING, 2017, 30 (05) :640-647
[5]   Artificial intelligence in medicine [J].
Hamet, Pavel ;
Tremblay, Johanne .
METABOLISM-CLINICAL AND EXPERIMENTAL, 2017, 69 :S36-S40
[6]  
Hsu H., 2022, StatPearls
[7]   A survey of the recent architectures of deep convolutional neural networks [J].
Khan, Asifullah ;
Sohail, Anabia ;
Zahoora, Umme ;
Qureshi, Aqsa Saeed .
ARTIFICIAL INTELLIGENCE REVIEW, 2020, 53 (08) :5455-5516
[8]   Artificial intelligence in fracture detection: transfer learning from deep convolutional neural networks [J].
Kim, D. H. ;
MacKinnon, T. .
CLINICAL RADIOLOGY, 2018, 73 (05) :439-445
[9]   Microtextured CoCrMo alloy for use in metal-on-polyethylene prosthetic joint bearings: Multi-directional wear and corrosion measurements [J].
Langhorn, J. ;
Borjali, A. ;
Hippensteel, E. ;
Nelson, W. ;
Raeymaekers, B. .
TRIBOLOGY INTERNATIONAL, 2018, 124 :178-183
[10]   Deep neural network improves fracture detection by clinicians [J].
Lindsey, Robert ;
Daluiski, Aaron ;
Chopra, Sumit ;
Lachapelle, Alexander ;
Mozer, Michael ;
Sicular, Serge ;
Hanel, Douglas ;
Gardner, Michael ;
Gupta, Anurag ;
Hotchkiss, Robert ;
Potter, Hollis .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2018, 115 (45) :11591-11596