Genomic pan-cancer classification using image-based deep learning

被引:2
作者
Ye, Taoyu [1 ]
Li, Sen [1 ]
Zhang, Yang [1 ]
机构
[1] Harbin Inst Technol Shenzhen, Shenzhen 518055, Guangdong, Peoples R China
来源
COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL | 2021年 / 19卷
关键词
Pan-cancer classification; Genetic mutation map; Image-based deep learning; Guided Grad-CAM visualization; Tumor-type-specific genes; Pathway analysis; RENIN-ANGIOTENSIN SYSTEM; FOCAL-ADHESION KINASE; PROSTATE-CANCER; ANDROGEN RECEPTOR; GENE FUSIONS; CELL-GROWTH; ACTIVATION; PATHWAY; TARGETS; BIOLOGY;
D O I
10.1016/j.csbj.2021.01.0102
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Accurate cancer type classification based on genetic mutation can significantly facilitate cancer-related diagnosis. However, existing methods usually use feature selection combined with simple classifiers to quantify key mutated genes, resulting in poor classification performance. To circumvent this problem, a novel image-based deep learning strategy is employed to distinguish different types of cancer. Unlike conventional methods, we first convert gene mutation data containing single nucleotide polymorphisms, insertions and deletions into a genetic mutation map, and then apply the deep learning networks to classify different cancer types based on the mutation map. We outline these methods and present results obtained in training VGG-16, Inception-v3, ResNet-50 and Inception-ResNet-v2 neural networks to classify 36 types of cancer from 9047 patient samples. Our approach achieves overall higher accuracy (over 95%) compared with other widely adopted classification methods. Furthermore, we demonstrate the application of a Guided Grad-CAM visualization to generate heatmaps and identify the top-ranked tumor-type-specific genes and pathways. Experimental results on prostate and breast cancer demonstrate our method can be applied to various types of cancer. Powered by the deep learning, this approach can potentially provide a new solution for pan-cancer classification and cancer driver gene discovery. The source code and datasets supporting the study is available at https://github.com/yetaoyu/Genomic-pancancer-classification. (C) 2021 The Author(s). Published by Elsevier B.V. on behalf of Research Network of Computational and Structural Biotechnology.
引用
收藏
页码:835 / 846
页数:12
相关论文
共 68 条
[31]   An enhanced deep learning approach for brain cancer MRI images classification using residual networks [J].
Ismael, Sarah Ali Abdelaziz ;
Mohammed, Ammar ;
Hefny, Hesham .
ARTIFICIAL INTELLIGENCE IN MEDICINE, 2020, 102
[32]  
Jemal A, 2011, CA-CANCER J CLIN, V61, P134, DOI [10.3322/caac.21492, 10.3322/caac.20115, 10.3322/caac.20107]
[33]  
Jiao W, 2020, NAT COMMUN, V11, DOI [10.1038/s41467-020-14779-y, 10.1038/s41467-019-13825-8]
[34]   Focal Adhesion Kinase Controls Aggressive Phenotype of Androgen-Independent Prostate Cancer (Publication with Expression of Concern) [J].
Johnson, Thomas R. ;
Khandrika, Lakshmipathi ;
Kumar, Binod ;
Venezia, Sarah ;
Koul, Sweaty ;
Chandhoke, Ryan ;
Maroni, Paul ;
Donohue, Robert ;
Meacham, Randall B. ;
Koul, Hari K. .
MOLECULAR CANCER RESEARCH, 2008, 6 (10) :1639-1648
[35]   Clinical relevance of blood-based ctDNA analysis: mutation detection and beyond [J].
Keller, Laura ;
Belloum, Yassine ;
Wikman, Harriet ;
Pantel, Klaus .
BRITISH JOURNAL OF CANCER, 2021, 124 (02) :345-358
[36]   Deep feature learning for histopathological image classification of canine mammary tumors and human breast cancer [J].
Kumar, Abhinav ;
Singh, Sanjay Kumar ;
Saxena, Sonal ;
Lakshmanan, K. ;
Sangaiah, Arun Kumar ;
Chauhan, Himanshu ;
Shrivastava, Sameer ;
Singh, Raj Kumar .
INFORMATION SCIENCES, 2020, 508 :405-421
[37]   Recurrent gene fusions in prostate cancer [J].
Kumar-Sinha, Chandan ;
Tomlins, Scott A. ;
Chinnaiyan, Arul M. .
NATURE REVIEWS CANCER, 2008, 8 (07) :497-511
[38]   A Systematic Study of the Impact of Estrogens and Selective Estrogen Receptor Modulators on Prostate Cancer Cell Proliferation [J].
Lafront, Camille ;
Germain, Lucas ;
Weidmann, Cindy ;
Audet-Walsh, Etienne .
SCIENTIFIC REPORTS, 2020, 10 (01)
[39]   A machine learning approach to integrate big data for precision medicine in acute myeloid leukemia [J].
Lee, Su-In ;
Celik, Safiye ;
Logsdon, Benjamin A. ;
Lundberg, Scott M. ;
Martins, Timothy J. ;
Oehler, Vivian G. ;
Estey, Elihu H. ;
Miller, Chris P. ;
Chien, Sylvia ;
Dai, Jin ;
Saxena, Akanksha ;
Blau, C. Anthony ;
Becker, Pamela S. .
NATURE COMMUNICATIONS, 2018, 9
[40]   TMPRSS2/ERG Promotes Epithelial to Mesenchymal Transition through the ZEB1/ZEB2 Axis in a Prostate Cancer Model [J].
Leshem, Orit ;
Madar, Shalom ;
Kogan-Sakin, Ira ;
Kamer, Iris ;
Goldstein, Ido ;
Brosh, Ran ;
Cohen, Yehudit ;
Jacob-Hirsch, Jasmine ;
Ehrlich, Marcelo ;
Ben-Sasson, Shmuel ;
Goldfinger, Naomi ;
Loewenthal, Ron ;
Gazit, Ephraim ;
Rotter, Varda ;
Berger, Raanan .
PLOS ONE, 2011, 6 (07)