Predicting the Progression from Asymptomatic to Symptomatic Multiple Myeloma and Stage Classification Using Gene Expression Data

被引:0
|
作者
Karathanasis, Nestoras [1 ]
Spyrou, George M. [1 ]
机构
[1] Cyprus Inst Neurol & Genet, Bioinformat Dept, 6 Iroon Ave, CY-2371 Nicosia, Cyprus
关键词
multiple myeloma; cancer; gammopathies; progression; machine learning; MONOCLONAL GAMMOPATHY; UNDETERMINED SIGNIFICANCE; LONG-TERM; RISK; ABNORMALITIES; PREVALENCE; PROGNOSIS; CRITERIA; MODELS;
D O I
10.3390/cancers17020332
中图分类号
R73 [肿瘤学];
学科分类号
100214 ;
摘要
Background: The accurate staging of multiple myeloma (MM) is essential for optimizing treatment strategies, while predicting the progression of asymptomatic patients, also referred to as monoclonal gammopathy of undetermined significance (MGUS), to symptomatic MM remains a significant challenge due to limited data. This study aimed to develop machine learning models to enhance MM staging accuracy and stratify asymptomatic patients by their risk of progression. Methods: We utilized gene expression microarray datasets to develop machine learning models, combined with various data transformations. For multiple myeloma staging, models were trained on a single dataset and validated across five independent datasets, with performance evaluated using multiclass area under the curve (AUC) metrics. To predict progression in asymptomatic patients, we employed two approaches: (1) training models on a dataset comprising asymptomatic patients who either progressed or remained stable without progressing to multiple myeloma, and (2) training models on multiple datasets combining asymptomatic and multiple myeloma samples and then testing their ability to distinguish between asymptomatic and asymptomatic that progressed. We performed feature selection and enrichment analyses to identify key signaling pathways underlying disease stages and progression. Results: Multiple myeloma staging models demonstrated high efficacy, with ElasticNet achieving consistent multiclass AUC values of 0.9 across datasets and transformations, demonstrating robust generalizability. For asymptomatic progression, both modeling approaches yielded similar results, with AUC values exceeding 0.8 across datasets and algorithms (ElasticNet, Boosting, and Support Vector Machines), underscoring their potential in identifying progression risk. Enrichment analyses revealed key pathways, including PI3K-Akt, MAPK, Wnt, and mTOR, as central to MM pathogenesis. Conclusions: To the best of our knowledge, this is the first study to utilize gene expression datasets for classifying patients across different stages of multiple myeloma and to integrate multiple myeloma with asymptomatic cases to predict disease progression, offering a novel methodology with potential clinical applications in patient monitoring and early intervention.
引用
收藏
页数:19
相关论文
共 50 条
  • [21] MetastaSite: Predicting metastasis to different sites using deep learning with gene expression data
    Albaradei, Somayah
    Albaradei, Abdurhman
    Alsaedi, Asim
    Uludag, Mahmut
    Thafar, Maha A.
    Gojobori, Takashi
    Essack, Magbubah
    Gao, Xin
    FRONTIERS IN MOLECULAR BIOSCIENCES, 2022, 9
  • [22] Statistical characterization and classification of colon microarray gene expression data using multiple machine learning paradigms
    Maniruzzaman, Md
    Rahman, Md Jahanur
    Ahammed, Benojir
    Abedin, Md Menhazul
    Suri, Harman S.
    Biswas, Mainak
    El-Baz, Ayman
    Bangeas, Petros
    Tsoulfas, Georgios
    Suri, Jasjit S.
    COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2019, 176 : 173 - 193
  • [23] Sparse Representation for Classification of Tumors Using Gene Expression Data
    Hang, Xiyi
    Wu, Fang-Xiang
    JOURNAL OF BIOMEDICINE AND BIOTECHNOLOGY, 2009,
  • [24] Clinical value of molecular subtyping multiple myeloma using gene expression profiling
    Weinhold, N.
    Heuck, C. J.
    Rosenthal, A.
    Thanendrarajan, S.
    Stein, C. K.
    Van Rhee, F.
    Zangari, M.
    Hoering, A.
    Tian, E.
    Davies, F. E.
    Barlogie, B.
    Morgan, G. J.
    LEUKEMIA, 2016, 30 (02) : 423 - 430
  • [25] Discriminant Projection Shared Dictionary Learning for Classification of Tumors Using Gene Expression Data
    Peng, Shaoliang
    Yang, Yaning
    Liu, Wei
    Li, Fei
    Liao, Xiangke
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2021, 18 (04) : 1464 - 1473
  • [26] Classification of Microarray Gene Expression Data Using an Infiltration Tactics Optimization (ITO) Algorithm
    Zahoor, Javed
    Zafar, Kashif
    GENES, 2020, 11 (07) : 1 - 28
  • [27] GENE EXPRESSION DATA CLASSIFICATION AND PATTERN ANALYSIS USING DATA DRIVEN APPROACH
    Ramisa, Aiman Jabeen
    Hossain, Ananna
    Islam, S. K. Md Injamul
    Swadesh, Ponuel Mollah
    Islam, Md Toushif
    Rahman, Md Anisur
    Parvez, Mohammad Zavid
    PROCEEDINGS OF 2021 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS (ICMLC), 2021, : 82 - 90
  • [28] Study on the Relationship Between the Expression of B Cell Mature Antigen and the Classification, Stage, and Prognostic Factors of Multiple Myeloma
    Ma, Tiantian
    Shi, Jing
    Xiao, Yuxia
    Bian, Tianyue
    Wang, Jincheng
    Hui, Lingyun
    Wang, Mengchang
    Liu, Huasheng
    FRONTIERS IN IMMUNOLOGY, 2021, 12
  • [29] Breast and Colon Cancer Classification from Gene Expression Profiles Using Data Mining Techniques
    AbdElNabi, Mohamed Loey Ramadan
    Jasim, Mohammed Wajeeh
    EL-Bakry, Hazem M.
    Taha, Mohamed Hamed N.
    Khalifa, Nour Eldeen M.
    SYMMETRY-BASEL, 2020, 12 (03):
  • [30] Predicting Standard Penetration Test N-value from Cone Penetration Test Data Using Gene Expression Programming
    Alam, Mehtab
    Chen, Jianfeng
    Umar, Muhammad
    Ullah, Faheem
    Shahkar, Muhammad
    GEOTECHNICAL AND GEOLOGICAL ENGINEERING, 2024, 42 (07) : 5587 - 5613