Elucidating Cancer Subtypes by Using the Relationship between DNA Methylation and Gene Expression

被引:0
作者
Jilani, Muneeba [1 ,3 ]
Degras, David [2 ]
Haspel, Nurit [1 ]
机构
[1] Univ Massachusetts, Dept Comp Sci, Boston, MA 02125 USA
[2] Univ Massachusetts, Dept Math, Boston, MA 02125 USA
[3] 100 Morrissey Blvd, Boston, MA 02125 USA
关键词
data integration; cancer subtypes; multi-omics; SURVIVAL ANALYSIS; PATHWAY ANALYSIS; TESTS;
D O I
10.3390/genes15050631
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Advancements in the field of next generation sequencing (NGS) have generated vast amounts of data for the same set of subjects. The challenge that arises is how to combine and reconcile results from different omics studies, such as epigenome and transcriptome, to improve the classification of disease subtypes. In this study, we introduce sCClust (sparse canonical correlation analysis with clustering), a technique to combine high-dimensional omics data using sparse canonical correlation analysis (sCCA), such that the correlation between datasets is maximized. This stage is followed by clustering the integrated data in a lower-dimensional space. We apply sCClust to gene expression and DNA methylation data for three cancer genomics datasets from the Cancer Genome Atlas (TCGA) to distinguish between underlying subtypes. We evaluate the identified subtypes using Kaplan-Meier plots and hazard ratio analysis on the three types of cancer-GBM (glioblastoma multiform), lung cancer and colon cancer. Comparison with subtypes identified by both single- and multi-omics studies implies improved clinical association. We also perform pathway over-representation analysis in order to identify up-regulated and down-regulated genes as tentative drug targets. The main goal of the paper is twofold: the integration of epigenomic and transcriptomic datasets followed by elucidating subtypes in the latent space. The significance of this study lies in the enhanced categorization of cancer data, which is crucial to precision medicine.
引用
收藏
页数:15
相关论文
共 78 条
[1]  
[Anonymous], 2000, R Language Definition
[2]  
[Anonymous], 2010, Int. J. Artif. Intell. Appl. (IJAIA), DOI [DOI 10.5121/IJAIA.2010.1404, 10.5121/ijaia.2010.1404]
[3]  
Arslanturk S, 2020, PACIFIC SYMPOSIUM ON BIOCOMPUTING 2020, P551
[4]   HOX and PBX gene dysregulation as a therapeutic target in glioblastoma multiforme [J].
Arunachalam, Einthavy ;
Rogers, William ;
Simpson, Guy R. ;
Moller-Levet, Carla ;
Bolton, Gemma ;
Ismael, Mohammed ;
Smith, Christopher ;
Keegen, Karl ;
Bagwan, Izhar ;
Brend, Tim ;
Short, Susan C. ;
Hong, Bangxing ;
Otani, Yoshihiro ;
Kaur, Balveen ;
Annels, Nicola ;
Morgan, Richard ;
Pandha, Hardev .
BMC CANCER, 2022, 22 (01)
[5]   DNA methylation patterns and epigenetic memory [J].
Bird, A .
GENES & DEVELOPMENT, 2002, 16 (01) :6-21
[6]   Hazard ratios in cancer clinical trials-a primer [J].
Blagoev, Krastan B. ;
Wilkerson, Julia ;
Fojo, Tito .
NATURE REVIEWS CLINICAL ONCOLOGY, 2012, 9 (03) :178-183
[7]   Colorectal papillomavirus infection in patients with colorectal cancer [J].
Bodaghi, S ;
Yamanegi, K ;
Xiao, SY ;
Da Costa, M ;
Palefsky, JM ;
Zheng, ZM .
CLINICAL CANCER RESEARCH, 2005, 11 (08) :2862-2867
[8]   Classifying lower grade glioma cases according to whole genome gene expression [J].
Chen, Baoshi ;
Liang, Tingyu ;
Yang, Pei ;
Wang, Haoyuan ;
Liu, Yanwei ;
Yang, Fan ;
You, Gan .
ONCOTARGET, 2016, 7 (45) :74031-74042
[9]   Increased HOXA5 expression provides a selective advantage for gain of whole chromosome 7 in IDH wild-type glioblastoma [J].
Cimino, Patrick J. ;
Kim, Youngmi ;
Wu, Hua-Jun ;
Alexander, Jes ;
Wirsching, Hans-Georg ;
Szulzewsky, Frank ;
Pitter, Ken ;
Ozawa, Tatsuya ;
Wang, Jiguang ;
Vazquez, Julio ;
Arora, Sonali ;
Rabadan, Raul ;
Levine, Ross ;
Michor, Franziska ;
Holland, Eric C. .
GENES & DEVELOPMENT, 2018, 32 (7-8) :512-523
[10]   Survival analysis part I: Basic concepts and first analyses [J].
Clark, TG ;
Bradburn, MJ ;
Love, SB ;
Altman, DG .
BRITISH JOURNAL OF CANCER, 2003, 89 (02) :232-238