Novel multiclass classification machine learning approach for the early-stage classification of systemic autoimmune rheumatic diseases

被引:0
作者
Wang, Yun [1 ]
Wei, Wei [1 ]
Ouyang, Renren [1 ]
Chen, Rujia [1 ]
Wang, Ting [1 ]
Yuan, Xu [1 ]
Wang, Feng [1 ]
Hou, Hongyan [1 ]
Wu, Shiji [1 ]
机构
[1] Huazhong Univ Sci & Technol, Dept Lab Med, Tongji Hosp, Tongji Med Coll, Wuhan, Hubei, Peoples R China
来源
LUPUS SCIENCE & MEDICINE | 2024年 / 11卷 / 01期
关键词
Autoimmune Diseases; Autoimmunity; Lupus Erythematosus; Systemic; INTERNATIONAL COLLABORATING CLINICS; LUPUS-ERYTHEMATOSUS; AMERICAN-COLLEGE; SJOGRENS-SYNDROME; REVISED CRITERIA; ARTHRITIS; CONSENSUS; AUTOANTIBODIES; DIAGNOSIS; LEAGUE;
D O I
10.1136/lupus-2023-001125
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
Objective Systemic autoimmune rheumatic diseases (SARDs) encompass a diverse group of complex conditions with overlapping clinical features, making accurate diagnosis challenging. This study aims to develop a multiclass machine learning (ML) model for early-stage SARDs classification using accessible laboratory indicators.Methods A total of 925 SARDs patients were included, categorised into SLE, Sjogren's syndrome (SS) and inflammatory myositis (IM). Clinical characteristics and laboratory markers were collected and nine key indicators, including anti-dsDNA, anti-SS-A60, anti-Sm/nRNP, antichromatin, anti-dsDNA (indirect immunofluorescence assay), haemoglobin (Hb), platelet, neutrophil percentage and cytoplasmic patterns (AC-19, AC-20), were selected for model building. Various ML algorithms were used to construct a tripartite classification ML model.Results Patients were divided into two cohorts, cohort 1 was used to construct a tripartite classification model. Among models assessed, the random forest (RF) model demonstrated superior performance in distinguishing SLE, IM and SS (with area under curve=0.953, 0.903 and 0.836; accuracy= 0.892, 0.869 and 0.857; sensitivity= 0.890, 0.868 and 0.795; specificity= 0.910, 0.836 and 0.748; positive predictive value=0.922, 0.727 and 0.663; and negative predictive value= 0.854, 0.915 and 0.879). The RF model excelled in classifying SLE (precision=0.930, recall=0.985, F1 score=0.957). For IM and SS, RF model outcomes were (precision=0.793, 0.950; recall=0.920, 0.679; F1 score=0.852, 0.792). Cohort 2 served as an external validation set, achieving an overall accuracy of 87.3%. Individual classification performances for SLE, SS and IM were excellent, with precision, recall and F1 scores specified. SHAP analysis highlighted significant contributions from antibody profiles.Conclusion This pioneering multiclass ML model, using basic laboratory indicators, enhances clinical feasibility and demonstrates promising potential for SARDs classification. The collaboration of clinical expertise and ML offers a nuanced approach to SARDs classification, with potential for enhanced patient care.
引用
收藏
页数:11
相关论文
共 50 条
[31]   A novel multi-modal machine learning based approach for automatic classification of EEG recordings in dementia [J].
Ieracitano, Cosimo ;
Mammone, Nadia ;
Hussain, Amir ;
Morabito, Francesco C. .
NEURAL NETWORKS, 2020, 123 :176-190
[32]   Neuropsychological test selection for cognitive impairment classification: A machine learning approach [J].
Weakley, Alyssa ;
Williams, Jennifer A. ;
Schmitter-Edgecombe, Maureen ;
Cook, Diane J. .
JOURNAL OF CLINICAL AND EXPERIMENTAL NEUROPSYCHOLOGY, 2015, 37 (09) :899-916
[33]   HepNet: Deep Neural Network for Classification of Early-Stage Hepatic Steatosis Using Microwave Signals [J].
Hasan, Sazid ;
Brankovic, Aida ;
Awal, Md Abdul ;
Rezaeieh, Sasan Ahdi ;
Keating, Shelley E. ;
Abbosh, Amin M. ;
Zamani, Ali .
IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2025, 29 (01) :142-151
[34]   Microgrid Fault Detection and Classification: Machine Learning Based Approach, Comparison, and Reviews [J].
Rahman Fahim, Shahriar ;
K. Sarker, Subrata ;
Muyeen, S. M. ;
Sheikh, Md. Rafiqul Islam ;
Das, Sajal K. .
ENERGIES, 2020, 13 (13)
[35]   Classification Framework for Healthy Hairs and Alopecia Areata: A Machine Learning (ML) Approach [J].
Shakeel, Choudhary Sobhan ;
Khan, Saad Jawaid ;
Chaudhry, Beenish ;
Aijaz, Syeda Fatima ;
Hassan, Umer .
COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE, 2021, 2021
[36]   Classification of Diabetic Foot Ulcers from Images Using Machine Learning Approach [J].
Almufadi, Nouf ;
Alhasson, Haifa F. .
DIAGNOSTICS, 2024, 14 (16)
[37]   Detection of EGFR mutations in early-stage lung adenocarcinoma by machine learning-based radiomics [J].
Omura, Kenshiro ;
Murakami, Yu ;
Hashimoto, Kohei ;
Takahashi, Hikaru ;
Suzuki, Ryoko ;
Yoshioka, Yasuo ;
Oguchi, Masahiko ;
Ichinose, Junji ;
Matsuura, Yosuke ;
Nakao, Masayuki ;
Okumura, Sakae ;
Mun, Mingyon .
TRANSLATIONAL CANCER RESEARCH, 2023, 12 (04) :837-+
[38]   A novel approach for skin lesion symmetry classification with a deep learning model [J].
Talavera-Martinez, Lidia ;
Bibiloni, Pedro ;
Giacaman, Aniza ;
Taberner, Rosa ;
Del Pozo Hernando, Luis Javier ;
Gonzalez-Hidalgo, Manuel .
COMPUTERS IN BIOLOGY AND MEDICINE, 2022, 145
[39]   International Classification of Functioning, Disability and Health Core Set construction in systemic sclerosis and other rheumatic diseases: a EUSTAR initiative [J].
Saketkoo, Lesley Ann ;
Escorpizo, Reuben ;
Keen, Kevin J. ;
Fligelstone, Kim ;
Distler, Oliver .
RHEUMATOLOGY, 2012, 51 (12) :2170-2176
[40]   A Study on Machine Learning-Based Feature Classification for the Early Diagnosis of Blade Rubbing [J].
Park, Dong-hee ;
Choi, Byeong-keun .
SENSORS, 2024, 24 (18)