Identifying Stages of Kidney Renal Cell Carcinoma by Combining Gene Expression and DNA Methylation Data

被引:30
作者
Deng, Su-Ping [1 ]
Cao, Shaolong [2 ]
Huang, De-Shuang [1 ]
Wang, Yu-Ping [3 ,4 ]
机构
[1] Tongji Univ, Coll Elect & Informat Engn, Inst Machine Learning & Syst Biol, Caoan Rd 4800, Shanghai 201804, Peoples R China
[2] Tulane Univ, Sch Sci & Engn, 6823 St Charles Ave, New Orleans, LA 70118 USA
[3] Tongji Univ, Coll Elect & Informat Engn, Caoan Rd 4800, Shanghai 201804, Peoples R China
[4] Tulane Univ, Sch Sci & Engn, New Orleans, LA 70118 USA
基金
中国博士后科学基金; 美国国家科学基金会;
关键词
Cancer stage prediction; classification; kidney cancer; network fusion; data integration; TUMOR CLASSIFICATION; COMPONENT ANALYSIS; NETWORKS; REGRESSION; SELECTION; FUSION; PATHS;
D O I
10.1109/TCBB.2016.2607717
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
In this study, in order to take advantage of complementary information from different types of data for better disease status diagnosis, we combined gene expression with DNA methylation data and generated a fused network, based on which the stages of Kidney Renal Cell Carcinoma (KIRC) can be better identified. It is well recognized that a network is important for investigating the connectivity of disease groups. We exploited the potential of the network's features to identify the KIRC stage. We first constructed a patient network from each type of data. We then built a fused network based on network fusion method. Based on the link weights of patients, we used a generalized linear model to predict the group of KIRC subjects. Finally, the group prediction method was applied to test the power of network-based features. The performance (e.g., the accuracy of identifying cancer stages) when using the fused network from two types of data is shown to be superior to that when using two patient networks from only one data type. The work provides a good example for using network based features from multiple data types for a more comprehensive diagnosis.
引用
收藏
页码:1147 / 1153
页数:7
相关论文
共 30 条
[1]  
American Cancer Society, 2016, WHAT IS KIDN CANC
[2]  
[Anonymous], 2014, PROBABILISTIC REASON
[3]  
[Anonymous], 2007, MOL SYST BIOL
[4]  
[Anonymous], 2002, Advances in Neural Information Processing Systems
[5]  
[Anonymous], 2003, P 20 INT C MACH LEAR
[6]   Regularization and semi-supervised learning on large graphs [J].
Belkin, M ;
Matveeva, I ;
Niyogi, P .
LEARNING THEORY, PROCEEDINGS, 2004, 3120 :624-638
[7]   Network-based classification of breast cancer metastasis [J].
Chuang, Han-Yu ;
Lee, Eunjung ;
Liu, Yu-Tsueng ;
Lee, Doheon ;
Ideker, Trey .
MOLECULAR SYSTEMS BIOLOGY, 2007, 3 (1)
[8]   Sparse partial least squares regression for simultaneous dimension reduction and variable selection [J].
Chun, Hyonho ;
Keles, Suenduez .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2010, 72 :3-25
[9]   NEAREST NEIGHBOR PATTERN CLASSIFICATION [J].
COVER, TM ;
HART, PE .
IEEE TRANSACTIONS ON INFORMATION THEORY, 1967, 13 (01) :21-+
[10]   A K-NEAREST NEIGHBOR CLASSIFICATION RULE-BASED ON DEMPSTER-SHAFER THEORY [J].
DENOEUX, T .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1995, 25 (05) :804-813