An introduction to machine learning and analysis of its use in rheumatic diseases

被引:48
作者
Kingsmore, Kathryn M. [1 ]
Puglisi, Christopher E. [1 ]
Grammer, Amrie C. [1 ]
Lipsky, Peter E. [1 ]
机构
[1] AMPEL BioSolut & RILITE Res Inst, Charlottesville, VA 22902 USA
关键词
ELECTRONIC MEDICAL-RECORDS; PRIMARY SJOGRENS-SYNDROME; GENOME-WIDE ASSOCIATION; LOGISTIC-REGRESSION; CARDIOVASCULAR RISK; FEATURE-SELECTION; ENSEMBLE METHODS; LARGE-SCALE; ARTHRITIS; CLASSIFICATION;
D O I
10.1038/s41584-021-00708-w
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
Machine learning (ML) is a computerized analytical technique that is being increasingly employed in biomedicine. ML often provides an advantage over explicitly programmed strategies in the analysis of multidimensional information by recognizing relationships in the data that were not previously appreciated. As such, the use of ML in rheumatology is increasing, and numerous studies have employed ML to classify patients with rheumatic autoimmune inflammatory diseases (RAIDs) from medical records and imaging, biometric or gene expression data. However, these studies are limited by sample size, the accuracy of sample labelling, and absence of datasets for external validation. In addition, there is potential for ML models to overfit or underfit the data and, thereby, these models might produce results that cannot be replicated in an unrelated dataset. In this Review, we introduce the basic principles of ML and discuss its current strengths and weaknesses in the classification of patients with RAIDs. Moreover, we highlight the successful analysis of the same type of input data (for example, medical records) with different algorithms, illustrating the potential plasticity of this analytical approach. Altogether, a better understanding of ML and the future application of advanced analytical techniques based on this approach, coupled with the increasing availability of biomedical data, may facilitate the development of meaningful precision medicine for patients with RAIDs. In this Review, the authors provide an introduction to machine learning and discuss the use of this approach in rheumatic autoimmune inflammatory diseases, including the classification of patients based on medical records or molecular characteristics, identification of novel biomarkers or drug repurposing candidates and prediction of disease progression or onset.
引用
收藏
页码:710 / 730
页数:21
相关论文
共 213 条
[1]   Peeking Inside the Black-Box: A Survey on Explainable Artificial Intelligence (XAI) [J].
Adadi, Amina ;
Berrada, Mohammed .
IEEE ACCESS, 2018, 6 :52138-52160
[2]   Lupus or not? SLE Risk Probability Index (SLERPI): a simple, clinician-friendly machine learning-based model to assist the diagnosis of systemic lupus erythematosus [J].
Adamichou, Christina ;
Genitsaridi, Irini ;
Nikolopoulos, Dionysis ;
Nikoloudaki, Myrto ;
Repa, Argyro ;
Bortoluzzi, Alessandra ;
Fanouriakis, Antonis ;
Sidiropoulos, Prodromos ;
Boumpas, Dimitrios T. ;
Bertsias, George K. .
ANNALS OF THE RHEUMATIC DISEASES, 2021, 80 (06) :758-766
[3]  
AHA DW, 1991, MACH LEARN, V6, P37, DOI 10.1007/BF00153759
[4]  
Alasadi S. A., 2017, J ENG APPL SCI, V12, DOI DOI 10.3923/JEASCI.2017.4102.4107
[5]   Machine learning and feature selection for drug response prediction in precision oncology applications [J].
Ali M. ;
Aittokallio T. .
Biophysical Reviews, 2019, 11 (1) :31-39
[6]   Deep Learning Applications for Predicting Pharmacological Properties of Drugs and Drug Repurposing Using Transcriptomic Data [J].
Aliper, Alexander ;
Plis, Sergey ;
Artemov, Artem ;
Ulloa, Alvaro ;
Mamoshina, Polina ;
Zhavoronkov, Alex .
MOLECULAR PHARMACEUTICS, 2016, 13 (07) :2524-2530
[7]  
Aljuaid T, 2016, PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON DATA SCIENCE & ENGINEERING (ICDSE), P146
[8]   Novel risk genes for systemic lupus erythematosus predicted by random forest classification [J].
Almlof, Jonas Carlsson ;
Alexsson, Andrei ;
Imgenberg-Kreuz, Juliana ;
Sylwan, Lina ;
Backlin, Christofer ;
Leonard, Dag ;
Nordmark, Gunnel ;
Tandre, Karolina ;
Eloranta, Maija-Leena ;
Padyukov, Leonid ;
Bengtsson, Christine ;
Jonsen, Andreas ;
Dahlqvist, Solbritt Rantapaa ;
Sjowall, Christopher ;
Bengtsson, Anders A. ;
Gunnarsson, Iva ;
Svenungsson, Elisabet ;
Ronnblom, Lars ;
Sandling, Johanna K. ;
Syvanen, Ann-Christine .
SCIENTIFIC REPORTS, 2017, 7
[9]  
Alpaydin E., 2009, Introduction to machine learning
[10]   POINTS OF SIGNIFICANCE Ensemble methods: bagging and random forests [J].
Altman, Naomi ;
Krzywinski, Martin .
NATURE METHODS, 2017, 14 (10) :933-934