iSOM-GSN: an integrative approach for transforming multi-omic data into gene similarity networks via self-organizing maps

被引:20
作者
Fatima, Nazia [1 ]
Rueda, Luis [1 ]
机构
[1] Univ Windsor, Sch Comp Sci, Windsor, ON N9B 3P4, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
FOXA1; EXPRESSION; BREAST; CANCER; GATA-3;
D O I
10.1093/bioinformatics/btaa500
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: One of the main challenges in applying graph convolutional neural networks (CNNs) on gene-interaction data is the lack of understanding of the vector space to which they belong, and also the inherent difficulties involved in representing those interactions on a significantly lower dimension, viz Euclidean spaces. The challenge becomes more prevalent when dealing with various types of heterogeneous data. We introduce a systematic, generalized method, called iSOM-GSN, used to transform 'multi-omic' data with higher dimensions onto a 2D grid. Afterwards, we apply a CNN to predict disease states of various types. Based on the idea of Kohonen's self-organizing map, we generate a 2D grid for each sample for a given set of genes that represent a gene similarity network. Results: We have tested the model to predict breast and prostate cancer using gene expression, DNA methylation and copy number alteration. Prediction accuracies in the 94-98% range were obtained for tumor stages of breast cancer and calculated Gleason scores of prostate cancer with just 14 input genes for both cases. The scheme not only outputs nearly perfect classification accuracy, but also provides an enhanced scheme for representation learning, visualization, dimensionality reduction and interpretation of multi-omic data.
引用
收藏
页码:4248 / 4254
页数:7
相关论文
共 36 条
[1]  
[Anonymous], 2018, P 2018 ACM INT C BIO, DOI DOI 10.1145/3233547.3233588
[2]  
Avivar-Valderas Alvaro, 2018, Oncotarget, V9, P21444, DOI 10.18632/oncotarget.25118
[3]   Implementation of Spectral Clustering on Microarray Data of Carcinoma Using Self Organizing Map (SOM) [J].
Bustamam, A. ;
Rivai, M. A. ;
Siswantining, T. .
PROCEEDINGS OF THE 3RD INTERNATIONAL SYMPOSIUM ON CURRENT PROGRESS IN MATHEMATICS AND SCIENCES 2017 (ISCPMS2017), 2018, 2023
[4]   Multi-label feature selection via feature manifold learning and sparsity regularization [J].
Cai, Zhiling ;
Zhu, William .
INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2018, 9 (08) :1321-1334
[5]  
Cardenas M, 2015, GENOMICS COMPUT BIOL, V1, pe19
[6]   A novel FOXA1/ESR1 interacting pathway: A study of Oncomine™ breast cancer microarrays [J].
Chaudhary, Sanjib ;
Krishna, B. Madhu ;
Mishra, Sandip K. .
ONCOLOGY LETTERS, 2017, 14 (02) :1247-1264
[7]   Embedding of Genes Using Cancer Gene Expression Data: Biological Relevance and Potential Application on Biomarker Discovery [J].
Choy, Chi Tung ;
Wong, Chi Hang ;
Chan, Stephen Lam .
FRONTIERS IN GENETICS, 2019, 9
[8]   GATA-3 and FOXA1 expression is useful to differentiate breast carcinoma from other carcinomas [J].
Davis, Drew G. ;
Siddiqui, Momin T. ;
Oprea-Ilies, Gabriela ;
Stevens, Keith ;
Osunkoya, Adeboye O. ;
Cohen, Cynthia ;
Li, Xiaoxian .
HUMAN PATHOLOGY, 2016, 47 (01) :26-31
[9]   An Integrative Approach for Identifying Network Biomarkers of Breast Cancer Subtypes Using Genomic, Interactomic, and Transcriptomic Data [J].
Firoozbakht, Forough ;
Rezaeian, Iman ;
D'Agnillo, Michele ;
Porter, Lisa ;
Rueda, Luis ;
Ngom, Alioune .
JOURNAL OF COMPUTATIONAL BIOLOGY, 2017, 24 (08) :756-766
[10]   scVAE: variational auto-encoders for single-cell gene expression data [J].
Gronbech, Christopher Heje ;
Vording, Maximillian Fornitz ;
Timshel, Pascal N. ;
Sonderby, Casper Kaae ;
Pers, Tune H. ;
Winther, Ole .
BIOINFORMATICS, 2020, 36 (16) :4415-4422