Feature screening for survival trait with application to TCGA high-dimensional genomic data

被引:1
|
作者
Wang, Jie-Huei [1 ]
Li, Cai-Rong [1 ]
Hou, Po-Lin [1 ]
机构
[1] Feng Chia Univ, Dept Stat, Taichung, Taiwan
来源
PEERJ | 2022年 / 10卷
关键词
Survival feature screening; High-dimensional genomic data; Network; Survival prediction; TCGA; Esophageal cancer; Pancreatic cancer; Head and neck squamous cell carcinoma; Lung adenocarcinoma; Breast invasive carcinoma; VARIABLE SELECTION; GENES; SIGNATURE; IDENTIFICATION; VALIDATION; MODELS; HEAD;
D O I
10.7717/peerj.13098
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background: In high-dimensional survival genomic data, identifying cancer-related genes is a challenging and important subject in the field of bioinformatics. In recent years, many feature screening approaches for survival outcomes with high-dimensional survival genomic data have been developed; however, few studies have systematically compared these methods. The primary purpose of this article is to conduct a series of simulation studies for systematic comparison; the second purpose of this article is to use these feature screening methods to further establish a more accurate prediction model for patient survival based on the survival genomic datasets of The Cancer Genome Atlas (TCGA). Results: Simulation studies prove that network-adjusted feature screening measurement performs well and outperforms existing popular univariate independent feature screening methods. In the application of real data, we show that the proposed network-adjusted feature screening approach leads to more accurate survival prediction than alternative methods that do not account for gene-gene dependency information. We also use TCGA clinical survival genetic data to identify biomarkers associated with clinical survival outcomes in patients with various cancers including esophageal, pancreatic, head and neck squamous cell, lung, an d breast invasive carcinomas. Conclusions: These applications reveal advantages of the new proposed network-adjusted feature selection method over alternative methods that do not consider gene-gene dependency information. We also identify cancer-related genes that are almost detected in the literature. As a result, the network-based screening method is reliable and credible.
引用
收藏
页数:20
相关论文
共 50 条
  • [1] Overlapping group screening for detection of gene-environment interactions with application to TCGA high-dimensional survival genomic data
    Wang, Jie-Huei
    Wang, Kang-Hsin
    Chen, Yi-Hau
    BMC BIOINFORMATICS, 2022, 23 (01)
  • [2] Overlapping group screening for detection of gene-environment interactions with application to TCGA high-dimensional survival genomic data
    Jie-Huei Wang
    Kang-Hsin Wang
    Yi-Hau Chen
    BMC Bioinformatics, 23
  • [3] Robust feature screening for high-dimensional survival data
    Hao, Meiling
    Lin, Yuanyuan
    Liu, Xianhui
    Tang, Wenlu
    JOURNAL OF APPLIED STATISTICS, 2019, 46 (06) : 979 - 994
  • [4] Overlapping group screening for binary cancer classification with TCGA high-dimensional genomic data
    Wang, Jie-Huei
    Chen, Yi-Hau
    JOURNAL OF BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, 2023,
  • [5] Network-adjusted Kendall's Tau Measure for Feature Screening with Application to High-dimensional Survival Genomic Data
    Wang, Jie-Huei
    Chen, Yi-Hau
    BIOINFORMATICS, 2021, 37 (15) : 2150 - 2156
  • [6] Model-free feature screening for high-dimensional survival data
    Lin, Yuanyuan
    Liu, Xianhui
    Hao, Meiling
    SCIENCE CHINA-MATHEMATICS, 2018, 61 (09) : 1617 - 1636
  • [7] Feature screening with large-scale and high-dimensional survival data
    Yi, Grace Y.
    He, Wenqing
    Carroll, Raymond. J.
    BIOMETRICS, 2022, 78 (03) : 894 - 907
  • [8] Model-free feature screening for high-dimensional survival data
    Yuanyuan Lin
    Xianhui Liu
    Meiling Hao
    ScienceChina(Mathematics), 2018, 61 (09) : 79 - 98
  • [9] Model-free feature screening for high-dimensional survival data
    Yuanyuan Lin
    Xianhui Liu
    Meiling Hao
    Science China Mathematics, 2018, 61 : 1617 - 1636
  • [10] Feature Screening for High-Dimensional Survival Data via Censored Quantile Correlation
    XU Kai
    HUANG Xudong
    Journal of Systems Science & Complexity, 2021, 34 (03) : 1207 - 1224