A Machine Learning workflow for Diagnosis of Knee Osteoarthritis with a focus on post-hoc explainability

被引:4
作者
Kokkotis, Christos [1 ,2 ]
Moustakidis, Serafeim [3 ]
Papageorgiou, Elpiniki [1 ,4 ]
Giakas, Giannis [2 ]
Tsaopoulos, Dimitrios [1 ]
机构
[1] Ctr Res & Technol Hellas, Inst Bioecon & Agritechnol, Volos, Greece
[2] Univ Thessaly, Dept Phys Educ & Sport Sci, Trikala, Greece
[3] AIDEAS OU, Narva Mnt 5, Tallinn, Harju Maakond, Estonia
[4] Univ Thessaly, Fac Technol, Larisa, Greece
来源
2020 11TH INTERNATIONAL CONFERENCE ON INFORMATION, INTELLIGENCE, SYSTEMS AND APPLICATIONS (IISA 2020) | 2020年
基金
欧盟地平线“2020”;
关键词
knee osteoarthritis; diagnosis; feature selection; machine learning; interpretation; KL-grade;
D O I
10.1109/iisa50023.2020.9284354
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Knee Osteoarthritis (KOA) is a multifactorial disease-causing joint pain, deformity and dysfunction. The aim of this paper is to provide a data mining approach that could identify important risk factors which contribute to the diagnosis of KOA and their impact on model output, with a focus on post-hoc explainability. Data were obtained from the osteoarthritis initiative (OAI) database enrolling people, with nonsymptomatic KOA and symptomatic KOA or being at high risk of developing KOA. The current study considered multidisciplinary data from heterogeneous sources such as questionnaire data, physical activity indexes, self-reported data about joint symptoms, disability and function as well as general health and physical exams' data from individuals with or without KOA from the baseline visit. For the data mining part, a robust feature selection methodology was employed consisting of filter, wrapper and embedded techniques whereas feature ranking was decided on the basis of a majority vote scheme. The validation of the extracted factors was performed in subgroups employing seven well-known classifiers. A 77.88 % classification accuracy was achieved by Logistic Regression on the group of the first forty selected (40) risk factors. We investigated the behavior of the best model, with respect to classification errors and the impact of used features, to confirm their clinical relevance. The interpretation of the model output was performed by SHAP. The results are the basis for the development of easy-to-use diagnostic tools for clinicians for the early detection of KOA.
引用
收藏
页码:247 / 253
页数:7
相关论文
共 22 条
  • [1] Al Daoud E., 2019, International Journal of Computer and Information Engineering, V13, P6, DOI DOI 10.5281/ZENODO.3607805
  • [2] [Anonymous], 2016, PLOS ONE, DOI DOI 10.1371/JOURNAL.PONE.0148724
  • [3] Biesiada J, 2007, ADV INTEL SOFT COMPU, V45, P242
  • [4] Exploring deep learning capabilities in knee osteoarthritis case study for classification
    Christodoulou, Eirini
    Moustakidis, Serafeim
    Papandrianos, Nikolaos
    Tsaopoulos, Dimitrios
    Papageorgiou, Elpiniki
    [J]. 2019 10TH INTERNATIONAL CONFERENCE ON INFORMATION, INTELLIGENCE, SYSTEMS AND APPLICATIONS (IISA), 2019, : 271 - 276
  • [5] Du YD, 2017, IEEE INT C BIOINFORM, P671, DOI 10.1109/BIBM.2017.8217734
  • [6] Gornale S. S., 2017, INT J IMAGE GRAPHICS, V9
  • [7] Ikram ST, 2017, J KING SAUD UNIV-COM, V29, P462, DOI 10.1016/j.jksuci.2015.12.004
  • [8] PREDICTORS AFFECTING BALANCE PERFORMANCES IN PATIENTS WITH KNEE OSTEOARTHRITIS USING DECISION TREE ANALYSIS
    Kobayashi, T.
    Kannari, T.
    Horiuchi, H.
    Matsui, N.
    Ito, T.
    Nojin, K.
    Kakuse, K.
    Okawa, M.
    Yamanaka, M.
    [J]. OSTEOARTHRITIS AND CARTILAGE, 2019, 27 : S243 - S243
  • [9] Kokkotis C, 2020, Osteoarthr Cartil Open, V2, P100069, DOI 10.1016/j.ocarto.2020.100069
  • [10] Detecting knee osteoarthritis and its discriminating parameters using random forests
    Kotti, Margarita
    Duffell, Lynsey D.
    Faisal, Aldo A.
    McGregor, Alison H.
    [J]. MEDICAL ENGINEERING & PHYSICS, 2017, 43 : 19 - 29