DCAMCP: A deep learning model based on capsule network and attention mechanism for molecular carcinogenicity prediction

被引:93
作者
Chen, Zhe [1 ]
Zhang, Li [2 ]
Sun, Jianqiang [3 ]
Meng, Rui [4 ]
Yin, Shuaidong [4 ]
Zhao, Qi [4 ,5 ]
机构
[1] Liaoning Univ, Sch Math & Stat, Shenyang, Peoples R China
[2] Liaoning Univ, Sch Life Sci, Shenyang, Peoples R China
[3] Linyi Univ, Sch Informat Sci & Engn, Linyi, Peoples R China
[4] Univ Sci & Technol Liaoning, Sch Comp Sci & Software Engn, Anshan, Peoples R China
[5] Univ Sci & Technol Liaoning, Sch Comp Sci & Software Engn, Anshan 114051, Peoples R China
基金
中国国家自然科学基金;
关键词
capsule network; carcinogenicity; computational toxicology; deep learning; graph attention network; NONCONGENERIC CHEMICALS; CANCER; DATABASE;
D O I
10.1111/jcmm.17889
中图分类号
Q2 [细胞生物学];
学科分类号
071009 ; 090102 ;
摘要
The carcinogenicity of drugs can have a serious impact on human health, so carcinogenicity testing of new compounds is very necessary before being put on the market. Currently, many methods have been used to predict the carcinogenicity of compounds. However, most methods have limited predictive power and there is still much room for improvement. In this study, we construct a deep learning model based on capsule network and attention mechanism named DCAMCP to discriminate between carcinogenic and non-carcinogenic compounds. We train the DCAMCP on a dataset containing 1564 different compounds through their molecular fingerprints and molecular graph features. The trained model is validated by fivefold cross-validation and external validation. DCAMCP achieves an average accuracy (ACC) of 0.718 & PLUSMN; 0.009, sensitivity (SE) of 0.721 & PLUSMN; 0.006, specificity (SP) of 0.715 & PLUSMN; 0.014 and area under the receiver-operating characteristic curve (AUC) of 0.793 & PLUSMN; 0.012. Meanwhile, comparable results can be achieved on an external validation dataset containing 100 compounds, with an ACC of 0.750, SE of 0.778, SP of 0.727 and AUC of 0.811, which demonstrate the reliability of DCAMCP. The results indicate that our model has made progress in cancer risk assessment and could be used as an efficient tool in drug design.
引用
收藏
页码:3117 / 3126
页数:10
相关论文
共 44 条
[1]   Quantitative structure-activity relationships of mutagenic and carcinogenic aromatic amines [J].
Benigni, R ;
Giuliani, A ;
Franke, R ;
Gruska, A .
CHEMICAL REVIEWS, 2000, 100 (10) :3697-3714
[2]  
Benigni R, 2008, ANN I SUPER SANITA, V44, P48
[3]   Predicting the carcinogenicity of chemicals with alternative approaches: recent advances [J].
Benigni, Romualdo .
EXPERT OPINION ON DRUG METABOLISM & TOXICOLOGY, 2014, 10 (09) :1199-1208
[4]   Nongenotoxic Carcinogenicity of Chemicals: Mechanisms of Action and Early Recognition through a New Set of Structural Alerts [J].
Benigni, Romualdo ;
Bossa, Cecilia ;
Tcheremenskaia, Olga .
CHEMICAL REVIEWS, 2013, 113 (05) :2940-2957
[5]   Mechanisms of Chemical Carcinogenicity and Mutagenicity: A Review with Implications for Predictive Toxicology [J].
Benigni, Romualdo ;
Bossa, Cecilia .
CHEMICAL REVIEWS, 2011, 111 (04) :2507-2536
[6]  
Boice JD, 1996, CANCER CAUSE CONTROL, V7, pS3
[7]   MicroRNAs and complex diseases: from experimental results to computational models [J].
Chen, Xing ;
Xie, Di ;
Zhao, Qi ;
You, Zhu-Hong .
BRIEFINGS IN BIOINFORMATICS, 2019, 20 (02) :515-539
[8]   Comparison of MC4PC and MDL-QSAR rodent carcinogenicity predictions and the enhancement of predictive performance by combining QSAR models [J].
Contrera, Joseph F. ;
Kruhlak, Naomi L. ;
Matthews, Edwin J. ;
Benz, R. Daniel .
REGULATORY TOXICOLOGY AND PHARMACOLOGY, 2007, 49 (03) :172-182
[9]   THE CAUSES OF CANCER - QUANTITATIVE ESTIMATES OF AVOIDABLE RISKS OF CANCER IN THE UNITED-STATES TODAY [J].
DOLL, R ;
PETO, R .
JNCI-JOURNAL OF THE NATIONAL CANCER INSTITUTE, 1981, 66 (06) :1191-+
[10]   Quantitative and qualitative models for carcinogenicity prediction for non-congeneric chemicals using CP ANN method for regulatory uses [J].
Fjodorova, Natalja ;
Vracko, Marjan ;
Tusar, Marjan ;
Jezierska, Aneta ;
Novic, Marjana ;
Kuehne, Ralph ;
Schueuermann, Gerrit .
MOLECULAR DIVERSITY, 2010, 14 (03) :581-594