omicsGAT: Graph Attention Network for Cancer Subtype Analyses

被引:9
作者
Baul, Sudipto [1 ,2 ]
Ahmed, Khandakar Tanvir [1 ,2 ]
Filipek, Joseph [1 ,2 ]
Zhang, Wei [1 ,2 ]
机构
[1] Univ Cent Florida, Dept Comp Sci, Orlando, FL 32816 USA
[2] Univ Cent Florida, Genom & Bioinformat Cluster, Orlando, FL 32816 USA
关键词
graph attention network; single-cell RNA-seq; patient stratification; cancer outcome prediction; RNA-SEQ; BREAST;
D O I
10.3390/ijms231810220
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The use of high-throughput omics technologies is becoming increasingly popular in all facets of biomedical science. The mRNA sequencing (RNA-seq) method reports quantitative measures of more than tens of thousands of biological features. It provides a more comprehensive molecular perspective of studied cancer mechanisms compared to traditional approaches. Graph-based learning models have been proposed to learn important hidden representations from gene expression data and network structure to improve cancer outcome prediction, patient stratification, and cell clustering. However, these graph-based methods cannot rank the importance of the different neighbors for a particular sample in the downstream cancer subtype analyses. In this study, we introduce omicsGAT, a graph attention network (GAT) model to integrate graph-based learning with an attention mechanism for RNA-seq data analysis. The multi-head attention mechanism in omicsGAT can more effectively secure information of a particular sample by assigning different attention coefficients to its neighbors. Comprehensive experiments on The Cancer Genome Atlas (TCGA) breast cancer and bladder cancer bulk RNA-seq data and two single-cell RNA-seq datasets validate that (1) the proposed model can effectively integrate neighborhood information of a sample and learn an embedding vector to improve disease phenotype prediction, cancer patient stratification, and cell clustering of the sample and (2) the attention matrix generated from the multi-head attention coefficients provides more useful information compared to the sample correlation-based adjacency matrix. From the results, we can conclude that some neighbors play a more important role than others in cancer subtype analyses of a particular sample based on the attention coefficient.
引用
收藏
页数:16
相关论文
共 47 条
[21]   DeepInf: Social Influence Prediction with Deep Learning [J].
Qiu, Jiezhong ;
Tang, Jian ;
Ma, Hao ;
Dong, Yuxiao ;
Wang, Kuansan ;
Tang, Jie .
KDD'18: PROCEEDINGS OF THE 24TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2018, :2110-2119
[22]  
Quah F.X., 2021, BIORXIV, DOI [10.1101/2021.05.20.445027, DOI 10.1101/2021.05.20.445027]
[23]   The Graph Neural Network Model [J].
Scarselli, Franco ;
Gori, Marco ;
Tsoi, Ah Chung ;
Hagenbuchner, Markus ;
Monfardini, Gabriele .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 2009, 20 (01) :61-80
[24]   EPGAT: Gene Essentiality Prediction With Graph Attention Networks [J].
Schapke, Joao ;
Tavares, Anderson ;
Recamonde-Mendoza, Mariana .
IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2022, 19 (03) :1615-1626
[25]  
singlecell.broadinstitute, STUDY SARS COV 2 ANT
[26]   Gene expression patterns of breast carcinomas distinguish tumor subclasses with clinical implications [J].
Sorlie, T ;
Perou, CM ;
Tibshirani, R ;
Aas, T ;
Geisler, S ;
Johnsen, H ;
Hastie, T ;
Eisen, MB ;
van de Rijn, M ;
Jeffrey, SS ;
Thorsen, T ;
Quist, H ;
Matese, JC ;
Brown, PO ;
Botstein, D ;
Lonning, PE ;
Borresen-Dale, AL .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2001, 98 (19) :10869-10874
[27]   Breast cancer classification according to immunohistochemical markers: clinicopathologic features and short-term survival analysis in a population-based study from the South of Switzerland [J].
Spitale, A. ;
Mazzola, P. ;
Soldini, D. ;
Mazzucchelli, L. ;
Bordoni, A. .
ANNALS OF ONCOLOGY, 2009, 20 (04) :628-635
[28]   Molecular classifications of breast carcinoma with similar terminology and different definitions: are they the same? [J].
Tang, Ping ;
Wang, Jianmin ;
Bourne, Patria .
HUMAN PATHOLOGY, 2008, 39 (04) :506-513
[29]  
Tao YF, 2020, PACIFIC SYMPOSIUM ON BIOCOMPUTING 2020, P79
[30]   A self-attention model for inferring cooperativity between regulatory features [J].
Ullah, Fahad ;
Ben-Hur, Asa .
NUCLEIC ACIDS RESEARCH, 2021, 49 (13)