Deep Learning Based Tumor Type Classification Using Gene Expression Data

被引:90
作者
Lyu, Boyu [1 ]
Haque, Anamul [1 ]
机构
[1] Virginia Tech, Blacksburg, VA 24061 USA
来源
ACM-BCB'18: PROCEEDINGS OF THE 2018 ACM INTERNATIONAL CONFERENCE ON BIOINFORMATICS, COMPUTATIONAL BIOLOGY, AND HEALTH INFORMATICS | 2018年
关键词
Deep Learning; Tumor Type Classification; Pan-Cancer Atlas; Convolutional Neural Network; B-CELL LYMPHOMA; CANCER;
D O I
10.1145/3233547.3233588
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
The differential analysis is the most significant part of RNA-Seq analysis. Conventional methods of the differential analysis usually match the tumor samples to the normal samples, which are both from the same tumor type. Such method would fail in differentiating tumor types because it lacks the knowledge from other tumor types. The Pan-Cancer Atlas provides us with abundant information on 33 prevalent tumor types which could be used as prior knowledge to generate tumor-specific biomarkers. In this paper, we embedded the high dimensional RNA-Seq data into 2-D images and used a convolutional neural network to make classification of the 33 tumor types. The final accuracy we got was 95.59%. Furthermore, based on the idea of Guided Grad Cam, as to each class, we generated significance heat-map for all the genes. By doing functional analysis on the genes with high intensities in the heat-maps, we validated that these top genes are related to tumor-specific pathways, and some of them have already been used as biomarkers, which proved the effectiveness of our method. As far as we know, we are the first to apply a convolutional neural network on Pan-Cancer Atlas for the classification of tumor types, and we are also the first to use gene's contribution in classification to the importance of genes to identify candidate biomarkers. Our experiment results show that our method has a good performance and could also apply to other genomics data.
引用
收藏
页码:89 / 96
页数:8
相关论文
共 20 条
[11]  
Hu J, 2017, ARXIV PREPRINT ARXIV
[12]   A comprehensive genomic pan-cancer classification using The Cancer Genome Atlas gene expression data [J].
Li, Yuanyuan ;
Kang, Kai ;
Krahn, Juno M. ;
Croutwater, Nicole ;
Lee, Kevin ;
Umbach, David M. ;
Li, Leping .
BMC GENOMICS, 2017, 18
[13]  
Mashhadi R, 2014, UROL J, V11, P1968
[14]   Large-scale RNA-Seq Transcriptome Analysis of 4043 Cancers and 548 Normal Tissue Controls across 12 TCGA Cancer Types [J].
Peng, Li ;
Bian, Xiu Wu ;
Li, Di Kang ;
Xu, Chuan ;
Wang, Guang Ming ;
Xia, Qing You ;
Xiong, Qing .
SCIENTIFIC REPORTS, 2015, 5
[15]  
Schiffman M, 2007, LANCET, V370, P890, DOI [10.1016/S0140-6736(07)61416-0, 10.1016/S0140-6736(13)60022-7]
[16]   Molecular pathogenesis of malignant mesothelioma [J].
Sekido, Yoshitaka .
CARCINOGENESIS, 2013, 34 (07) :1413-1419
[17]   Grad-CAM: Visual Explanations from Deep Networks via Gradient-based Localization [J].
Selvaraju, Ramprasaath R. ;
Cogswell, Michael ;
Das, Abhishek ;
Vedantam, Ramakrishna ;
Parikh, Devi ;
Batra, Dhruv .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :618-626
[18]  
Szegedy Christian, 2015, P IEEE C COMP VIS PA, P1, DOI [10.1109/cvpr.2015.7298594, DOI 10.1109/CVPR.2015.7298594]
[19]   Infections in patients with immunodeficiency with thymoma (Good syndrome) - Report of 5 cases and review of the literature [J].
Tarr, PE ;
Sneller, MC ;
Mechanic, LJ ;
Economides, A ;
Eger, CM ;
Strober, W ;
Cunningham-Rundles, C ;
Lucey, DR .
MEDICINE, 2001, 80 (02) :123-133
[20]   Diabetes and gastric cancer: the potential links [J].
Tseng, Chin-Hsiao ;
Tseng, Farn-Hsuan .
WORLD JOURNAL OF GASTROENTEROLOGY, 2014, 20 (07) :1701-1711