Predicting the Progression from Asymptomatic to Symptomatic Multiple Myeloma and Stage Classification Using Gene Expression Data

被引:0
|
作者
Karathanasis, Nestoras [1 ]
Spyrou, George M. [1 ]
机构
[1] Cyprus Inst Neurol & Genet, Bioinformat Dept, 6 Iroon Ave, CY-2371 Nicosia, Cyprus
关键词
multiple myeloma; cancer; gammopathies; progression; machine learning; MONOCLONAL GAMMOPATHY; UNDETERMINED SIGNIFICANCE; LONG-TERM; RISK; ABNORMALITIES; PREVALENCE; PROGNOSIS; CRITERIA; MODELS;
D O I
10.3390/cancers17020332
中图分类号
R73 [肿瘤学];
学科分类号
100214 ;
摘要
Background: The accurate staging of multiple myeloma (MM) is essential for optimizing treatment strategies, while predicting the progression of asymptomatic patients, also referred to as monoclonal gammopathy of undetermined significance (MGUS), to symptomatic MM remains a significant challenge due to limited data. This study aimed to develop machine learning models to enhance MM staging accuracy and stratify asymptomatic patients by their risk of progression. Methods: We utilized gene expression microarray datasets to develop machine learning models, combined with various data transformations. For multiple myeloma staging, models were trained on a single dataset and validated across five independent datasets, with performance evaluated using multiclass area under the curve (AUC) metrics. To predict progression in asymptomatic patients, we employed two approaches: (1) training models on a dataset comprising asymptomatic patients who either progressed or remained stable without progressing to multiple myeloma, and (2) training models on multiple datasets combining asymptomatic and multiple myeloma samples and then testing their ability to distinguish between asymptomatic and asymptomatic that progressed. We performed feature selection and enrichment analyses to identify key signaling pathways underlying disease stages and progression. Results: Multiple myeloma staging models demonstrated high efficacy, with ElasticNet achieving consistent multiclass AUC values of 0.9 across datasets and transformations, demonstrating robust generalizability. For asymptomatic progression, both modeling approaches yielded similar results, with AUC values exceeding 0.8 across datasets and algorithms (ElasticNet, Boosting, and Support Vector Machines), underscoring their potential in identifying progression risk. Enrichment analyses revealed key pathways, including PI3K-Akt, MAPK, Wnt, and mTOR, as central to MM pathogenesis. Conclusions: To the best of our knowledge, this is the first study to utilize gene expression datasets for classifying patients across different stages of multiple myeloma and to integrate multiple myeloma with asymptomatic cases to predict disease progression, offering a novel methodology with potential clinical applications in patient monitoring and early intervention.
引用
收藏
页数:19
相关论文
共 50 条
  • [32] A survey of methods for classification of gene expression data using evolutionary algorithms
    Wahde, M
    Szallasi, Z
    EXPERT REVIEW OF MOLECULAR DIAGNOSTICS, 2006, 6 (01) : 101 - 110
  • [33] Cancer Classification of Gene Expression Data using Machine Learning Models
    De Guia, Joseph M.
    Devaraj, Madhavi
    Vea, Larry A.
    2018 IEEE 10TH INTERNATIONAL CONFERENCE ON HUMANOID, NANOTECHNOLOGY, INFORMATION TECHNOLOGY, COMMUNICATION AND CONTROL, ENVIRONMENT AND MANAGEMENT (HNICEM), 2018,
  • [34] Gene expression data classification using locally linear discriminant embedding
    Li, Bo
    Zheng, Chun-Hou
    Huang, De-Shuang
    Zhang, Lei
    Han, Kyungsook
    COMPUTERS IN BIOLOGY AND MEDICINE, 2010, 40 (10) : 802 - 810
  • [35] Bayesian variable selection for disease classification using gene expression data
    Yang Ai-Jun
    Song Xin-Yuan
    BIOINFORMATICS, 2010, 26 (02) : 215 - 222
  • [36] Classification of Leukemia Gene Expression Data Using Particle Swarm Optimization
    Liu, Yajie
    Shi, Xinling
    An, Zhenzhou
    2012 SIXTH INTERNATIONAL CONFERENCE ON GENETIC AND EVOLUTIONARY COMPUTING (ICGEC), 2012, : 241 - 244
  • [37] Using Prognosis-Related Gene Expression Signature and Connectivity Map for Personalized Drug Repositioning in Multiple Myeloma
    Zhu, Fang-xiao
    He, Yu-chan
    Zhang, Jun-yan
    Wang, Hang-Fei
    Zhong, Chen
    Wang, Xiao-tao
    MEDICAL SCIENCE MONITOR, 2019, 25 : 3247 - 3255
  • [38] Comparison of discrimination methods for the classification of tumors using gene expression data
    Dudoit, S
    Fridlyand, J
    Speed, TP
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2002, 97 (457) : 77 - 87
  • [39] Classification of breast cancer using microarray gene expression data: A survey
    Abd-Elnaby, Muhammed
    Alfonse, Marco
    Roushdy, Mohamed
    JOURNAL OF BIOMEDICAL INFORMATICS, 2021, 117
  • [40] Artificial neural network model for effective cancer classification using microarray gene expression data
    Ashok Kumar Dwivedi
    Neural Computing and Applications, 2018, 29 : 1545 - 1554