Network based stratification of major cancers by integrating somatic mutation and gene expression data

被引:18
作者
He, Zongzhen [1 ]
Zhang, Junying [1 ]
Yuan, Xiguo [1 ]
Liu, Zhaowen [1 ]
Liu, Baobao [1 ]
Tuo, Shouheng [1 ]
Liu, Yajun [1 ]
机构
[1] Xidian Univ, Sch Comp Sci & Technol, Xian, Peoples R China
关键词
NONNEGATIVE MATRIX FACTORIZATION; MUTANT LUNG ADENOCARCINOMAS; CLASSIFICATION; PREDICTION; INHIBITOR; DISCOVERY; CETUXIMAB; DISEASE;
D O I
10.1371/journal.pone.0177662
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
The stratification of cancer into subtypes that are significantly associated with clinical outcomes is beneficial for targeted prognosis and treatment. In this study, we integrated somatic mutation and gene expression data to identify clusters of patients. In contrast to previous studies, we constructed cancer-type-specific significant co-expression networks (SCNs) rather than using a fixed gene network across all cancers, such as the network-based stratification (NBS) method, which ignores cancer heterogeneity. For each type of cancer, the gene expression data were used to construct the SCN network, while the gene somatic mutation data were mapped onto the network, propagated, and used for further clustering. For the clustering, we adopted an improved network-regularized non-negative matrix factorization (netNMF) (netNMF_HC) for a more precise classification. We applied our method to various datasets, including ovarian cancer (OV), lung adenocarcinoma (LUAD) and uterine corpus endometrial carcinoma (UCEC) cohorts derived from the TCGA (The Cancer Genome Atlas) project. Based on the results, we evaluated the performance of our method to identify survival-relevant subtypes and further compared it to the NBS method, which adopts priori networks and netNMF algorithm. The proposed algorithm outperformed the NBS method in identifying informative cancer subtypes that were significantly associated with clinical outcomes in most cancer types we studied. In particular, our method identified survival-associated UCEC subtypes that were not identified by the NBS method. Our analysis indicated valid subtyping of patient could be applied by mutation data with cancer-type-specific SCNs and netNMF_HC for individual cancers because of specific cancer co-expression patterns and more precise clustering.
引用
收藏
页数:12
相关论文
共 30 条
[1]   Tumor stratification by a novel graph-regularized bi-clique finding algorithm [J].
Adl, Amin Ahmadi ;
Qian, Xiaoning .
COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2015, 57 :3-11
[2]   Integrated genomic analyses of ovarian carcinoma [J].
Bell, D. ;
Berchuck, A. ;
Birrer, M. ;
Chien, J. ;
Cramer, D. W. ;
Dao, F. ;
Dhir, R. ;
DiSaia, P. ;
Gabra, H. ;
Glenn, P. ;
Godwin, A. K. ;
Gross, J. ;
Hartmann, L. ;
Huang, M. ;
Huntsman, D. G. ;
Iacocca, M. ;
Imielinski, M. ;
Kalloger, S. ;
Karlan, B. Y. ;
Levine, D. A. ;
Mills, G. B. ;
Morrison, C. ;
Mutch, D. ;
Olvera, N. ;
Orsulic, S. ;
Park, K. ;
Petrelli, N. ;
Rabeno, B. ;
Rader, J. S. ;
Sikic, B. I. ;
Smith-McCune, K. ;
Sood, A. K. ;
Bowtell, D. ;
Penny, R. ;
Testa, J. R. ;
Chang, K. ;
Dinh, H. H. ;
Drummond, J. A. ;
Fowler, G. ;
Gunaratne, P. ;
Hawes, A. C. ;
Kovar, C. L. ;
Lewis, L. R. ;
Morgan, M. B. ;
Newsham, I. F. ;
Santibanez, J. ;
Reid, J. G. ;
Trevino, L. R. ;
Wu, Y. -Q. ;
Wang, M. .
NATURE, 2011, 474 (7353) :609-615
[3]   Non-negative Matrix Factorization on Manifold [J].
Cai, Deng ;
He, Xiaofei ;
Wu, Xiaoyun ;
Han, Jiawei .
ICDM 2008: EIGHTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2008, :63-+
[4]   Gene connectivity, function, and sequence conservation: predictions from modular yeast co-expression networks [J].
Carlson, MRJ ;
Zhang, B ;
Fang, ZX ;
Mischel, PS ;
Horvath, S ;
Nelson, SF .
BMC GENOMICS, 2006, 7 (1)
[5]   EGFR-Mutant Lung Adenocarcinomas Treated First-Line with the Novel EGFR Inhibitor, XL647, Can Subsequently Retain Moderate Sensitivity to Erlotinib [J].
Chmielecki, Juliann ;
Pietanza, M. Catherine ;
Aftab, Dana ;
Shen, Ronglai ;
Zhao, Zhiguo ;
Chen, Xi ;
Hutchinson, Katherine ;
Viale, Agnes ;
Kris, Mark G. ;
Stout, Thomas ;
Miller, Vincent ;
Rizvi, Naiyer ;
Pao, William .
JOURNAL OF THORACIC ONCOLOGY, 2012, 7 (02) :434-442
[6]   A pattern recognition approach to infer time-lagged genetic interactions [J].
Chuang, Cheng-Long ;
Jen, Chih-Hung ;
Chen, Chung-Ming ;
Shieh, Grace S. .
BIOINFORMATICS, 2008, 24 (09) :1183-1190
[7]   Multiplatform Analysis of 12 Cancer Types Reveals Molecular Classification within and across Tissues of Origin [J].
Hoadley, Katherine A. ;
Yau, Christina ;
Wolf, Denise M. ;
Cherniack, Andrew D. ;
Tamborero, David ;
Ng, Sam ;
Leiserson, Max D. M. ;
Niu, Beifang ;
McLellan, Michael D. ;
Uzunangelov, Vladislav ;
Zhang, Jiashan ;
Kandoth, Cyriac ;
Akbani, Rehan ;
Shen, Hui ;
Omberg, Larsson ;
Chu, Andy ;
Margolin, Adam A. ;
van't Veer, Laura J. ;
Lopez-Bigas, Nuria ;
Laird, Peter W. ;
Raphael, Benjamin J. ;
Ding, Li ;
Robertson, A. Gordon ;
Byers, Lauren A. ;
Mills, Gordon B. ;
Weinstein, John N. ;
Van Waes, Carter ;
Chen, Zhong ;
Collisson, Eric A. ;
Benz, Christopher C. ;
Perou, Charles M. ;
Stuart, Joshua M. ;
Abbott, Rachel ;
Abbott, Scott ;
Aksoy, B. Arman ;
Aldape, Kenneth ;
Ally, Adrian ;
Amin, Samirkumar ;
Anastassiou, Dimitris ;
Auman, J. Todd ;
Baggerly, Keith A. ;
Balasundaram, Miruna ;
Balu, Saianand ;
Baylin, Stephen B. ;
Benz, Stephen C. ;
Berman, Benjamin P. ;
Bernard, Brady ;
Bhatt, Ami S. ;
Birol, Inanc ;
Black, Aaron D. .
CELL, 2014, 158 (04) :929-944
[8]  
Hofree M., 2013, NATURE METHODS, V10
[9]   Protein networks in disease [J].
Ideker, Trey ;
Sharan, Roded .
GENOME RESEARCH, 2008, 18 (04) :644-652
[10]   Identification of microRNAs with regulatory potential using a matched microRNA-mRNA time-course data [J].
Jayaswal, Vivek ;
Lutherborrow, Mark ;
Ma, David D. F. ;
Yang, Yee Hwa .
NUCLEIC ACIDS RESEARCH, 2009, 37 (08)