Multiclassification Prediction of Enzymatic Reactions for Oxidoreductases and Hydrolases Using Reaction Fingerprints and Machine Learning Methods

被引:15
作者
Cai, Yingchun [1 ]
Yang, Hongbin [1 ]
Li, Weihua [1 ]
Liu, Guixia [1 ]
Lee, Philip W. [1 ]
Tang, Yun [1 ]
机构
[1] East China Univ Sci & Technol, Shanghai Key Lab New Drug Design, Sch Pharm, Shanghai 200237, Peoples R China
基金
中国国家自然科学基金;
关键词
IN-SILICO PREDICTION; EC NUMBERS; CLASSIFICATION; METABOLISM; KNOWLEDGE; INFORMATION; ASSIGNMENT; REGRESSION; QSAR; SAR;
D O I
10.1021/acs.jcim.7b00656
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
Drug metabolism is a complex procedure in the human body, including a series of enzymatically catalyzed reactions. However, it is costly and time consuming to investigate drug metabolism experimentally; computational methods are hence developed to predict drug metabolism and have shown great advantages. As the first step, classification of metabolic reactions and enzymes is highly desirable for drug metabolism prediction. In this study, we developed multi classification models for prediction of reaction types catalyzed by oxidoreductases and hydrolases, in which three reaction fingerprints were used to describe the reactions and seven machine learnings algorithms were employed for model building. Data retrieved from KEGG containing 1055 hydrolysis and 2510 redox reactions were used to build the models, respectively. The external validation data consisted of 213 hydrolysis and 512 redox reactions extracted from the Rhea database. The best models were built by neural network or logistic regression with a 2048-bit transformation reaction fingerprint. The predictive accuracies of the main class, subclass, and superclass classification models on external validation sets were all above 90%. This study will be very helpful for enzymatic reaction annotation and further study on metabolism prediction.
引用
收藏
页码:1169 / 1181
页数:13
相关论文
共 50 条
[31]   In Silico Prediction of Chemicals Binding to Aromatase with Machine Learning Methods [J].
Du, Hanwen ;
Cai, Yingchun ;
Yang, Hongbin ;
Zhang, Hongxiao ;
Xue, Yuhan ;
Liu, Guixia ;
Tang, Yun ;
Li, Weihua .
CHEMICAL RESEARCH IN TOXICOLOGY, 2017, 30 (05) :1209-1218
[32]   Machine learning based methods for software fault prediction: A survey [J].
Pandey, Sushant Kumar ;
Mishra, Ravi Bhushan ;
Tripathi, Anil Kumar .
EXPERT SYSTEMS WITH APPLICATIONS, 2021, 172
[33]   Machine Learning Ensemble Methods for Optical Network Traffic Prediction [J].
Szostak, Daniel .
14TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE IN SECURITY FOR INFORMATION SYSTEMS AND 12TH INTERNATIONAL CONFERENCE ON EUROPEAN TRANSNATIONAL EDUCATIONAL (CISIS 2021 AND ICEUTE 2021), 2022, 1400 :105-115
[34]   A review on gender identification using machine learning based on fingerprints [J].
Yadav, Jitendra Singh ;
Gupta, Amit Kumar ;
Saxena, Arjit .
JOURNAL OF INFORMATION & OPTIMIZATION SCIENCES, 2019, 40 (05) :1121-1129
[35]   Machine Learning Methods for Quality Prediction in Production [J].
Sankhye, Sidharth ;
Hu, Guiping .
LOGISTICS-BASEL, 2020, 4 (04)
[36]   A survey on machine learning methods for churn prediction [J].
Geiler, Louis ;
Affeldt, Severine ;
Nadif, Mohamed .
INTERNATIONAL JOURNAL OF DATA SCIENCE AND ANALYTICS, 2022, 14 (03) :217-242
[37]   Risk prediction with machine learning and regression methods [J].
Steyerberg, Ewout W. ;
van der Ploeg, Tjeerd ;
Van Calster, Ben .
BIOMETRICAL JOURNAL, 2014, 56 (04) :601-606
[38]   In Silico Prediction of CYP2C8 Inhibition with Machine-Learning Methods [J].
Zhang, Xiaoxiao ;
Zhao, Piaopiao ;
Wang, Zhiyuan ;
Xu, Xuan ;
Liu, Guixia ;
Tang, Yun ;
Li, Weihua .
CHEMICAL RESEARCH IN TOXICOLOGY, 2021, 34 (08) :1850-1859
[39]   Machine learning methods and predictive ability metrics for genome-wide prediction of complex traits [J].
Gonzalez-Recio, Oscar ;
Rosa, Guilherme J. M. ;
Gianola, Daniel .
LIVESTOCK SCIENCE, 2014, 166 :217-231
[40]   In silico prediction of chemical neurotoxicity using machine learning [J].
Jiang, Changsheng ;
Zhao, Piaopiao ;
Li, Weihua ;
Tang, Yun ;
Liu, Guixia .
TOXICOLOGY RESEARCH, 2020, 9 (03) :164-172