Integration of multi-omics data for survival prediction of lung adenocarcinoma

被引:3
作者
Guo, Dingjie [1 ]
Wang, Yixian [1 ]
Chen, Jing [2 ]
Liu, Xin [1 ]
机构
[1] Jilin Univ, Sch Publ Hlth, Epidemiol & Stat, Changchun 130021, Jilin, Peoples R China
[2] Northeast Normal Univ, Acad Adv Interdisciplinary Studies, Changchun 130024, Peoples R China
关键词
Gene expression; Somatic mutations; Network embedding; Survival prediction; Lung adenocarcinoma; COEXPRESSION NETWORK; CANCER;
D O I
10.1016/j.cmpb.2024.108192
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Background and Objective: The morbidity of lung adenocarcinoma (LUAD) has been increasing year by year and the prognosis is poor. This has prompted researchers to study the survival of LUAD patients to ensure that patients can be cured in time or survive after appropriate treatment. There is still no fully valid model that can be applied to clinical practice. Methods: We introduced struc2vec-based multi-omics data integration (SBMOI), which could integrate gene expression, somatic mutations and clinical data to construct mutation gene vectors representing LUAD patient features. Based on the patient features, the random survival forest (RSF) model was used to predict the long- and short-term survival of LUAD patients. To further demonstrate the superiority of SBMOI, we simultaneously replaced scale-free gene co-expression network (FCN) with a protein-protein interaction (PPI) network and a significant co-expression network (SCN) to compare accuracy in predicting LUAD patient survival under the same conditions. Results: Our results suggested that compared with SCN and PPI network, the FCN based SBMOI combined with RSF model had better performance in long- and short-term survival prediction tasks for LUAD patients. The AUC of 1-year, 5-year, and 10-year survival in the validation dataset were 0.791, 0.825, and 0.917, respectively. Conclusions: This study provided a powerful network-based method to multi-omics data integration. SBMOI combined with RSF successfully predicted long- and short-term survival of LUAD patients, especially with high accuracy on long-term survival. Besides, SBMOI algorithm has the potential to combine with other machine learning models to complete clustering or stratificational tasks, and being applied to other diseases.
引用
收藏
页数:8
相关论文
共 36 条
[1]   Screening key prognostic factors and constructing survival prognostic risk prediction model based on ceRNA network in early lung adenocarcinoma [J].
Bai, Juncheng ;
Zhu, Xiaochun ;
Zhang, Jintao ;
Bulin, Baila .
TRANSLATIONAL CANCER RESEARCH, 2021, 10 (11) :4652-4663
[2]   Similarities and differences in genome-wide expression data of six organisms [J].
Bergmann, S ;
Ihmels, J ;
Barkai, N .
PLOS BIOLOGY, 2004, 2 (01) :85-93
[3]   Gene connectivity, function, and sequence conservation: predictions from modular yeast co-expression networks [J].
Carlson, MRJ ;
Zhang, B ;
Fang, ZX ;
Mischel, PS ;
Horvath, S ;
Nelson, SF .
BMC GENOMICS, 2006, 7 (1)
[4]   Histopathological Images and Multi-Omics Integration Predict Molecular Characteristics and Survival in Lung Adenocarcinoma [J].
Chen, Linyan ;
Zeng, Hao ;
Xiang, Yu ;
Huang, Yeqian ;
Luo, Yuling ;
Ma, Xuelei .
FRONTIERS IN CELL AND DEVELOPMENTAL BIOLOGY, 2021, 9
[5]   Prognostic risk factor of major salivary gland carcinomas and survival prediction model based on random survival forests [J].
Chen, Yufan ;
Li, Guoli ;
Jiang, Wenmei ;
Nie, Rong Cheng ;
Deng, Honghao ;
Chen, Yingle ;
Li, Hao ;
Chen, Yanfeng .
CANCER MEDICINE, 2023, 12 (09) :10899-10907
[6]   Compact Integration of Multi-Network Topology for Functional Analysis of Genes [J].
Cho, Hyunghoon ;
Berger, Bonnie ;
Peng, Jian .
CELL SYSTEMS, 2016, 3 (06) :540-+
[7]   Patterns of somatic mutation in human cancer genomes [J].
Greenman, Christopher ;
Stephens, Philip ;
Smith, Raffaella ;
Dalgliesh, Gillian L. ;
Hunter, Christopher ;
Bignell, Graham ;
Davies, Helen ;
Teague, Jon ;
Butler, Adam ;
Edkins, Sarah ;
O'Meara, Sarah ;
Vastrik, Imre ;
Schmidt, Esther E. ;
Avis, Tim ;
Barthorpe, Syd ;
Bhamra, Gurpreet ;
Buck, Gemma ;
Choudhury, Bhudipa ;
Clements, Jody ;
Cole, Jennifer ;
Dicks, Ed ;
Forbes, Simon ;
Gray, Kris ;
Halliday, Kelly ;
Harrison, Rachel ;
Hills, Katy ;
Hinton, Jon ;
Jenkinson, Andy ;
Jones, David ;
Menzies, Andy ;
Mironenko, Tatiana ;
Perry, Janet ;
Raine, Keiran ;
Richardson, Dave ;
Shepherd, Rebecca ;
Small, Alexandra ;
Tofts, Calli ;
Varian, Jennifer ;
Webb, Tony ;
West, Sofie ;
Widaa, Sara ;
Yates, Andy ;
Cahill, Daniel P. ;
Louis, David N. ;
Goldstraw, Peter ;
Nicholson, Andrew G. ;
Brasseur, Francis ;
Looijenga, Leendert ;
Weber, Barbara L. ;
Chiew, Yoke-Eng .
NATURE, 2007, 446 (7132) :153-158
[8]   Network based stratification of major cancers by integrating somatic mutation and gene expression data [J].
He, Zongzhen ;
Zhang, Junying ;
Yuan, Xiguo ;
Liu, Zhaowen ;
Liu, Baobao ;
Tuo, Shouheng ;
Liu, Yajun .
PLOS ONE, 2017, 12 (05)
[9]  
Hofree M, 2013, NAT METHODS, V10, P1108, DOI [10.1038/NMETH.2651, 10.1038/nmeth.2651]
[10]   RANDOM SURVIVAL FORESTS [J].
Ishwaran, Hemant ;
Kogalur, Udaya B. ;
Blackstone, Eugene H. ;
Lauer, Michael S. .
ANNALS OF APPLIED STATISTICS, 2008, 2 (03) :841-860