Comparing Effectiveness of Machine Learning Methods for Diagnosis of Deep Vein Thrombosis

被引:1
作者
Sorano, Ruslan [1 ]
Magnusson, Lars V. [1 ]
Abbas, Khurshid [1 ]
机构
[1] Ostfold Univ Coll, Halden, Norway
来源
COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2022 WORKSHOPS, PART V | 2022年 / 13381卷
关键词
Deep vein thrombosis; DVT; VTE; Machine learning; VENOUS THROMBOEMBOLISM; D-DIMER; MARKERS;
D O I
10.1007/978-3-031-10548-7_21
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
This paper presents the results of a comparative study of machine learning techniques when predicting deep vein thrombosis. We used the Ri-Schedule dataset with Electronic Health Records of suspected thrombotic patients for training and validation. A total of 1653 samples and 59 predictors were included in this study. We have compared 20 standard machine learning algorithms and identified the best-performing ones: Random Forest, XGBoost, Gradient-Boosting and HistGradientBoosting classifiers. After hyper-parameter optimization, the best overall accuracy of 0.91 was shown by Gradient-Boosting classifier using only 15 of the original variables. We have also tuned the algorithms for maximum sensitivity. The best specificity was offered by Random Forests. At maximum sensitivity of 1.0 and specificity of 0.41, the Random Forest model was able to identify 23% additional negative cases over the screening practice in use today. These results suggest that machine learning could offer practical value in real-life implementations if combined with traditional methods for ruling out deep vein thrombosis.
引用
收藏
页码:279 / 293
页数:15
相关论文
共 36 条
[1]  
[Anonymous], 2017, Classification and regression trees, DOI [DOI 10.1201/9781315139470-8, 10.1201/9781315139470-8]
[2]  
Bishop C. M., 1995, Neural Networks for Pattern Recognition
[3]  
Bordes A, 2009, J MACH LEARN RES, V10, P1737
[4]   D-dimer, other markers of haemostasis activation and soluble adhesion molecules in patients with different clinical probabilities of deep vein thrombosis [J].
Bozic, M ;
Blinc, A ;
Stegnar, M .
THROMBOSIS RESEARCH, 2002, 108 (2-3) :107-114
[5]  
Breiman L, 1998, ANN STAT, V26, P841
[6]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[7]  
Breiman L, 1996, MACH LEARN, V24, P123, DOI 10.1023/A:1018054314350
[8]   XGBoost: A Scalable Tree Boosting System [J].
Chen, Tianqi ;
Guestrin, Carlos .
KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, :785-794
[9]  
Coleman Dawn M, 2012, Expert Opin Med Diagn, V6, P253, DOI 10.1517/17530059.2012.692674
[10]   SUPPORT-VECTOR NETWORKS [J].
CORTES, C ;
VAPNIK, V .
MACHINE LEARNING, 1995, 20 (03) :273-297