Multiclassification Prediction of Enzymatic Reactions for Oxidoreductases and Hydrolases Using Reaction Fingerprints and Machine Learning Methods

被引:15
作者
Cai, Yingchun [1 ]
Yang, Hongbin [1 ]
Li, Weihua [1 ]
Liu, Guixia [1 ]
Lee, Philip W. [1 ]
Tang, Yun [1 ]
机构
[1] East China Univ Sci & Technol, Shanghai Key Lab New Drug Design, Sch Pharm, Shanghai 200237, Peoples R China
基金
中国国家自然科学基金;
关键词
IN-SILICO PREDICTION; EC NUMBERS; CLASSIFICATION; METABOLISM; KNOWLEDGE; INFORMATION; ASSIGNMENT; REGRESSION; QSAR; SAR;
D O I
10.1021/acs.jcim.7b00656
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
Drug metabolism is a complex procedure in the human body, including a series of enzymatically catalyzed reactions. However, it is costly and time consuming to investigate drug metabolism experimentally; computational methods are hence developed to predict drug metabolism and have shown great advantages. As the first step, classification of metabolic reactions and enzymes is highly desirable for drug metabolism prediction. In this study, we developed multi classification models for prediction of reaction types catalyzed by oxidoreductases and hydrolases, in which three reaction fingerprints were used to describe the reactions and seven machine learnings algorithms were employed for model building. Data retrieved from KEGG containing 1055 hydrolysis and 2510 redox reactions were used to build the models, respectively. The external validation data consisted of 213 hydrolysis and 512 redox reactions extracted from the Rhea database. The best models were built by neural network or logistic regression with a 2048-bit transformation reaction fingerprint. The predictive accuracies of the main class, subclass, and superclass classification models on external validation sets were all above 90%. This study will be very helpful for enzymatic reaction annotation and further study on metabolism prediction.
引用
收藏
页码:1169 / 1181
页数:13
相关论文
共 44 条
[1]   Rhea-a manually curated resource of biochemical reactions [J].
Alcantara, Rafael ;
Axelsen, Kristian B. ;
Morgat, Anne ;
Belda, Eugeni ;
Coudert, Elisabeth ;
Bridge, Alan ;
Cao, Hong ;
de Matos, Paula ;
Ennis, Marcus ;
Turner, Steve ;
Owen, Gareth ;
Bougueleret, Lydie ;
Xenarios, Ioannis ;
Steinbeck, Christoph .
NUCLEIC ACIDS RESEARCH, 2012, 40 (D1) :D754-D760
[2]   Definitions of enzyme function for the structural genomics era [J].
Babbitt, PC .
CURRENT OPINION IN CHEMICAL BIOLOGY, 2003, 7 (02) :230-237
[3]   ATOM PAIRS AS MOLECULAR-FEATURES IN STRUCTURE ACTIVITY STUDIES - DEFINITION AND APPLICATIONS [J].
CARHART, RE ;
SMITH, DH ;
VENKATARAGHAVAN, R .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1985, 25 (02) :64-73
[4]   Computational models to predict endocrine-disrupting chemical binding with androgen or oestrogen receptors [J].
Chen, Yingjie ;
Cheng, Feixiong ;
Sun, Lu ;
Li, Weihua ;
Liu, Guixia ;
Tang, Yun .
ECOTOXICOLOGY AND ENVIRONMENTAL SAFETY, 2014, 110 :280-287
[5]   Classification of Cytochrome P450 Inhibitors and Noninhibitors Using Combined Classifiers [J].
Cheng, Feixiong ;
Yu, Yue ;
Shen, Jie ;
Yang, Lei ;
Li, Weihua ;
Liu, Guixia ;
Lee, Philip W. ;
Tang, Yun .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2011, 51 (05) :996-1011
[6]   NEAREST NEIGHBOR PATTERN CLASSIFICATION [J].
COVER, TM ;
HART, PE .
IEEE TRANSACTIONS ON INFORMATION THEORY, 1967, 13 (01) :21-+
[7]   Robust optimization of SVM hyperparameters in the classification of bioactive compounds [J].
Czarnecki, Wojciech M. ;
Podlewska, Sabina ;
Bojarski, Andrzej J. .
JOURNAL OF CHEMINFORMATICS, 2015, 7
[8]   Characterising Complex Enzyme Reaction Data [J].
Donertas, Handan Melike ;
Cuesta, Sergio Martinez ;
Rahman, Syed Asad ;
Thornton, Janet M. .
PLOS ONE, 2016, 11 (02)
[9]   Genome annotation errors in pathway databases due to semantic ambiguity in partial EC numbers [J].
Green, ML ;
Karp, PD .
NUCLEIC ACIDS RESEARCH, 2005, 33 (13) :4035-4039
[10]   Assignment of EC Numbers to Enzymatic Reactions with Reaction Difference Fingerprints [J].
Hu, Qian-Nan ;
Zhu, Hui ;
Li, Xiaobing ;
Zhang, Manman ;
Deng, Zhe ;
Yang, Xiaoyan ;
Deng, Zixin .
PLOS ONE, 2012, 7 (12)