Identifying Genes to Predict Cancer Radiotherapy-Related Fatigue with Machine-Learning Methods

被引:1
|
作者
Du, Wei [1 ]
Dickinson, Kristin [2 ]
Johnson, Calvin A. [1 ]
Saligan, Leorey N. [2 ]
机构
[1] NIH, Off Intramural Res, Ctr Informat Technol, Bldg 10, Bethesda, MD 20892 USA
[2] NIH, Symptom Management Branch, Bldg 10, Bethesda, MD 20892 USA
关键词
D O I
10.1145/3233547.3233636
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Fatigue is one of the most commonly reported and the most distressing side effects reported in radiotherapy (RT) [1]. Since RT-related fatigue does not occur in every patient, in the context of individualizing care, the ability to predict fatigue risk has the potential to help guide treatment choices for patients and providers. Many factors influence the fatigue experienced by patients. However, it is becoming increasingly clear that RT-related fatigue is strongly related to a series of underlying genetically controlled biological events. We hypothesize that expression of genes related to oxidative stress produced from the toxic therapeutic regimen can be predictive of RT-related fatigue. A cohort of 15 non-metastatic prostate cancer patients receiving external beam RT (EBRT) were enrolled in a NIH IRB-approved study (NCT00852111). Fatigue scores, as measured by the Functional Assessment of Cancer Therapy- Fatigue (FACT-F) and blood were obtained at baseline, prior to EBRT (T1), as well as one-month post EBRT (T2). A lower FACT-F score indicates greater fatigue symptoms. Participants were categorized into fatigue groups, based on a >= 3 decrease in FACT-F score from T1 to T2 (worsening fatigue). The expression profile of 84 genes related to oxidative stress was measured from collected blood samples using the RT2 Profiler Human Oxidative Stress Plus PCR Array (Qiagen, Inc., Valencia, CA). Of the 15 men enrolled in this study, 7 were high fatigue (HF) and 8 were low fatigue (LF) at T2 [2]. We present a two-phase scheme which first selects a limited subset of genes deemed most predictive by a regularized linear regression method known as elastic net, followed by a widely used classifier, the regularized random forest (RRF), to discriminate patients having HF from LF. The elastic net and RRF were trained with T1 and T2 data as well as with T1-only data. We compared the results of these two workflows to impute the significant genes and to assess predictive ability of the T1-only data. In addition to PCR data, clinical information and demographic data were collected at T1 and T2. The feature vectors for all models consist of demographic and clinical characteristics, such as age, depressive symptom scores, blood hemoglobin level, body-mass index; and normalized gene-expression values from PCR. There are a total of 154 features from T1 & T2 (76 features from T1 alone). In evaluating the suitability of the proposed scheme, we first tested the efficacy of the tandem of elastic net (for feature selection) followed by RRF (for prediction) compared with elastic net alone and RRF alone (each doing both feature selection and prediction). In the case of elastic net + RRF, features were selected from an elastic net trained on all data; the performance numbers were obtained from leave-one-out cross-validation of the subsequent RRF trained on only the important features. Results show that the tandem approach out-performs either single-model approach. Interestingly, using the elastic net for feature selection boosts the AUC from 0.75 to 0.80. The same trend was observed when T1-only data is used to train the models. The AUC of the T1 tandem is boosted (compared with RRF alone) from 0.62 to 0.66. Removing the T2 information from training makes the problem more challenging, introducing false positives into the prediction. In summary, we recommend a two-phase scheme consisting of an elastic net, which first selects a limited subset of most predictive genes followed by RRF, which then discriminates patients having HF from LF. The scheme improves performance in cross-validation compared with RRF alone or elastic net alone. Several genes are consistently selected, including as PRDX5, FHL2 and GPX4. This may provide clues regarding the cause of RT-related fatigue.
引用
收藏
页码:527 / 527
页数:1
相关论文
共 50 条
  • [21] Monitoring DNA damage response to predict radiotherapy-related normal tissue toxicity
    Lobachevsky, Pavel
    Leong, Trevor
    Daly, Patricia
    Campbell, Ian G.
    Thompson, Ella R.
    Li, Na
    Smith, Jai
    Best, Nickala
    Martin, Roger F.
    Martin, Olga A.
    INTERNATIONAL JOURNAL OF MOLECULAR MEDICINE, 2015, 36 : S103 - S103
  • [22] Radiotherapy-related fatigue: How to assess and how to treat the symptom. A commentary
    Jereczek-Fossa, BA
    Marsiglia, HR
    Orecchia, R
    TUMORI, 2001, 87 (03) : 147 - 151
  • [23] Identifying essential genes in bacterial metabolic networks with machine learning methods
    Plaimas, Kitiporn
    Eils, Roland
    Koenig, Rainer
    BMC SYSTEMS BIOLOGY, 2010, 4
  • [24] Radiotherapy-Related Clinical and Functional Sequelae in Lung Cancer Survivors
    Heredia-Ciuro, Alejandro
    Blasco-Valls, Paula
    Calvache-Mateo, Andres
    Valenza-Pena, Geraldine
    Lopez-Lopez, Laura
    Ortiz-Rubio, Araceli
    Valenza, Marie Carmen
    APPLIED SCIENCES-BASEL, 2024, 14 (13):
  • [25] Current trends in machine-learning methods applied to spectroscopic cancer diagnosis
    Sattlecker, Martina
    Stone, Nicholas
    Bessant, Conrad
    TRAC-TRENDS IN ANALYTICAL CHEMISTRY, 2014, 59 : 17 - 25
  • [26] Application of four machine-learning methods to predict short-horizon wind energy
    Bouabdallaoui, Doha
    Haidi, Touria
    Elmariami, Faissal
    Derri, Mounir
    Mellouli, El Mehdi
    GLOBAL ENERGY INTERCONNECTION-CHINA, 2023, 6 (06): : 726 - 737
  • [27] Application of four machine-learning methods to predict short-horizon wind energy
    Bouabdallaoui D.
    Haidi T.
    Elmariami F.
    Derri M.
    Mellouli E.M.
    Global Energy Interconnection, 2023, 6 (06) : 726 - 737
  • [28] Application of four machine-learning methods to predict short-horizon wind energy
    Doha Bouabdallaoui
    Touria Haidi
    Faissal Elmariami
    Mounir Derri
    El Mehdi Mellouli
    Global Energy Interconnection, 2023, 6 (06) : 726 - 737
  • [29] Machine-learning prediction of a novel diagnostic model using mitochondria-related genes for patients with bladder cancer
    Li, Jian
    Wang, Zhiyong
    Wang, Tianen
    SCIENTIFIC REPORTS, 2024, 14 (01):
  • [30] A comparative analysis of unsupervised machine-learning methods in PSG-related phenotyping
    Ghorvei, Mohammadreza
    Karhu, Tuomas
    Hietakoste, Salla
    Ferreira-Santos, Daniela
    Hrubos-Strom, Harald
    Islind, Anna Sigridur
    Biedebach, Luka
    Nikkonen, Sami
    Leppaenen, Timo
    Rusanen, Matias
    JOURNAL OF SLEEP RESEARCH, 2024,