A data-driven interactome of synergistic genes improves network-based cancer outcome prediction

被引:9
作者
Allahyar, Amin [1 ,2 ]
Ubels, Joske [1 ,3 ,4 ]
de Ridder, Jeroen [1 ]
机构
[1] Univ Utrecht, Univ Med Ctr Utrecht, Dept Genet, Ctr Mol Med, Utrecht, Netherlands
[2] Delft Univ Technol, Delft Bioinformat Lab, Fac Elect Engn Math & Comp Sci, Delft, Netherlands
[3] Skyline DX, Rotterdam, Netherlands
[4] Erasmus MC Canc Inst, Dept Hematol, Rotterdam, Netherlands
关键词
PROTEIN-INTERACTION NETWORKS; BREAST-CANCER; SCALE MAP; EXPRESSION; ARCHITECTURE; SELECTION; DISEASE; KINASE; METASTASIS; VALIDATION;
D O I
10.1371/journal.pcbi.1006657
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Robustly predicting outcome for cancer patients from gene expression is an important challenge on the road to better personalized treatment. Network-based outcome predictors (NOPs), which considers the cellular wiring diagram in the classification, hold much promise to improve performance, stability and interpretability of identified marker genes. Problematically, reports on the efficacy of NOPs are conflicting and for instance suggest that utilizing random networks performs on par to networks that describe biologically relevant interactions. In this paper we turn the prediction problem around: instead of using a given biological network in the NOP, we aim to identify the network of genes that truly improves outcome prediction. To this end, we propose SyNet, a gene network constructed ab initio from synergistic gene pairs derived from survival-labelled gene expression data. To obtain SyNet, we evaluate synergy for all 69 million pairwise combinations of genes resulting in a network that is specific to the dataset and phenotype under study and can be used to in a NOP model. We evaluated SyNet and 11 other networks on a compendium dataset of >4000 survival-labelled breast cancer samples. For this purpose, we used cross-study validation which more closely emulates real world application of these outcome predictors. We find that SyNet is the only network that truly improves performance, stability and interpretability in several existing NOPs. We show that SyNet overlaps significantly with existing gene networks, and can be confidently predicted (similar to 85% AUC) from graph-topological descriptions of these networks, in particular the breast tissue-specific network. Due to its data-driven nature, SyNet is not biased to well-studied genes and thus facilitates post-hoc interpretation. We find that SyNet is highly enriched for known breast cancer genes and genes related to e.g. histological grade and tamoxifen resistance, suggestive of a role in determining breast cancer outcome.
引用
收藏
页数:21
相关论文
共 90 条
  • [1] An Integrated Approach to Uncover Drivers of Cancer
    Akavia, Uri David
    Litvin, Oren
    Kim, Jessica
    Sanchez-Garcia, Felix
    Kotliar, Dylan
    Causton, Helen C.
    Pochanard, Panisa
    Mozes, Eyal
    Garraway, Levi A.
    Pe'er, Dana
    [J]. CELL, 2010, 143 (06) : 1005 - 1017
  • [2] De novo pathway-based biomarker identification
    Alcaraz, Nicolas
    List, Markus
    Batra, Richa
    Vandin, Fabio
    Ditzel, Henrik J.
    Baumbach, Jan
    [J]. NUCLEIC ACIDS RESEARCH, 2017, 45 (16)
  • [3] FERAL: network-based classifier with application to breast cancer outcome prediction
    Allahyar, Amin
    de Ridder, Jeroen
    [J]. BIOINFORMATICS, 2015, 31 (12) : 311 - 319
  • [4] Intratumoral heterogeneity as a source of discordance in breast cancer biomarker classification
    Allott, Emma H.
    Geradts, Joseph
    Sun, Xuezheng
    Cohen, Stephanie M.
    Zirpoli, Gary R.
    Khoury, Thaer
    Bshara, Wiam
    Chen, Mengjie
    Sherman, Mark E.
    Palmer, Julie R.
    Ambrosone, Christine B.
    Olshan, Andrew F.
    Troester, Melissa A.
    [J]. BREAST CANCER RESEARCH, 2016, 18
  • [5] Selection bias in gene extraction on the basis of microarray gene-expression data
    Ambroise, C
    McLachlan, GJ
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2002, 99 (10) : 6562 - 6566
  • [6] [Anonymous], 2004, Introduction to machine learning
  • [7] [Anonymous], 2006, J ROYAL STAT SOC B
  • [8] Cross-study validation for the assessment of prediction algorithms
    Bernau, Christoph
    Riester, Markus
    Boulesteix, Anne-Laure
    Parmigiani, Giovanni
    Huttenhower, Curtis
    Waldron, Levi
    Trippa, Lorenzo
    [J]. BIOINFORMATICS, 2014, 30 (12) : 105 - 112
  • [9] An Expanded View of Complex Traits: From Polygenic to Omnigenic
    Boyle, Evan A.
    Li, Yang I.
    Pritchard, Jonathan K.
    [J]. CELL, 2017, 169 (07) : 1177 - 1186
  • [10] Technical Variability Is Greater than Biological Variability in a Microarray Experiment but Both Are Outweighed by Changes Induced by Stimulation
    Bryant, Penelope A.
    Smyth, Gordon K.
    Robins-Browne, Roy
    Curtis, Nigel
    [J]. PLOS ONE, 2011, 6 (05):