Multiclassification Prediction of Enzymatic Reactions for Oxidoreductases and Hydrolases Using Reaction Fingerprints and Machine Learning Methods

被引:15
|
作者
Cai, Yingchun [1 ]
Yang, Hongbin [1 ]
Li, Weihua [1 ]
Liu, Guixia [1 ]
Lee, Philip W. [1 ]
Tang, Yun [1 ]
机构
[1] East China Univ Sci & Technol, Shanghai Key Lab New Drug Design, Sch Pharm, Shanghai 200237, Peoples R China
基金
中国国家自然科学基金;
关键词
IN-SILICO PREDICTION; EC NUMBERS; CLASSIFICATION; METABOLISM; KNOWLEDGE; INFORMATION; ASSIGNMENT; REGRESSION; QSAR; SAR;
D O I
10.1021/acs.jcim.7b00656
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
Drug metabolism is a complex procedure in the human body, including a series of enzymatically catalyzed reactions. However, it is costly and time consuming to investigate drug metabolism experimentally; computational methods are hence developed to predict drug metabolism and have shown great advantages. As the first step, classification of metabolic reactions and enzymes is highly desirable for drug metabolism prediction. In this study, we developed multi classification models for prediction of reaction types catalyzed by oxidoreductases and hydrolases, in which three reaction fingerprints were used to describe the reactions and seven machine learnings algorithms were employed for model building. Data retrieved from KEGG containing 1055 hydrolysis and 2510 redox reactions were used to build the models, respectively. The external validation data consisted of 213 hydrolysis and 512 redox reactions extracted from the Rhea database. The best models were built by neural network or logistic regression with a 2048-bit transformation reaction fingerprint. The predictive accuracies of the main class, subclass, and superclass classification models on external validation sets were all above 90%. This study will be very helpful for enzymatic reaction annotation and further study on metabolism prediction.
引用
收藏
页码:1169 / 1181
页数:13
相关论文
共 50 条
  • [1] Similarity Perception of Reactions Catalyzed by Oxidoreductases and Hydrolases Using Different Classification Methods
    Hu, Xiaoying
    Yan, Aixia
    Tan, Tianwei
    Sacher, Oliver
    Gasteiger, Johann
    JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2010, 50 (06) : 1089 - 1100
  • [2] In Silico Prediction of Chemical Acute Oral Toxicity Using MultiClassification Methods
    Li, Xiao
    Chen, Lei
    Cheng, Feixiong
    Wu, Zengrui
    Bian, Hanping
    Xu, Congying
    Li, Weihua
    Liu, Guixia
    Shen, Xu
    Tang, Yun
    JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2014, 54 (04) : 1061 - 1069
  • [3] A Parallel Multiclassification Algorithm for Big Data Using an Extreme Learning Machine
    Duan, Mingxing
    Li, Kenli
    Liao, Xiangke
    Li, Keqin
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 29 (06) : 2337 - 2351
  • [4] Prediction of pKa Using Machine Learning Methods with Rooted Topological Torsion Fingerprints: Application to Aliphatic Amines
    Lu, Yipin
    Anand, Shankara
    Shirley, William
    Gedeck, Peter
    Kelley, Brian P.
    Skolnik, Suzanne
    Rodde, Stephane
    Mai Nguyen
    Lindvall, Mika
    Jia, Weiping
    JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2019, 59 (11) : 4706 - 4719
  • [5] Assignment of EC Numbers to Enzymatic Reactions with Reaction Difference Fingerprints
    Hu, Qian-Nan
    Zhu, Hui
    Li, Xiaobing
    Zhang, Manman
    Deng, Zhe
    Yang, Xiaoyan
    Deng, Zixin
    PLOS ONE, 2012, 7 (12):
  • [6] Risk estimation and risk prediction using machine-learning methods
    Kruppa, Jochen
    Ziegler, Andreas
    Koenig, Inke R.
    HUMAN GENETICS, 2012, 131 (10) : 1639 - 1654
  • [7] Prediction of Multicomponent Reaction Yields Using Machine Learning
    Zhu, Xing-Yong
    Ran, Chuan-Kun
    Wen, Ming
    Guo, Gui-Ling
    Liu, Yuan
    Liao, Li-Li
    Li, Yi-Zhou
    Li, Meng-Long
    Yu, Da-Gang
    CHINESE JOURNAL OF CHEMISTRY, 2021, 39 (12) : 3231 - 3237
  • [8] Machine Learning Methods for Septic Shock Prediction
    Darwiche, Aiman
    Mukherjee, Sumitra
    AIVR 2018: 2018 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND VIRTUAL REALITY, 2018, : 104 - 110
  • [9] In Silico Prediction of Physicochemical Properties of Environmental Chemicals Using Molecular Fingerprints and Machine Learning
    Zang, Qingda
    Mansouri, Kamel
    Williams, Antony J.
    Judson, Richard S.
    Allen, David G.
    Casey, Warren M.
    Kleinstreuer, Nicole C.
    JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2017, 57 (01) : 36 - 49
  • [10] Prediction of hERG potassium channel blockage using ensemble learning methods and molecular fingerprints
    Liu, Miao
    Zhang, Li
    Li, Shimeng
    Yang, Tianzhou
    Liu, Lili
    Zhao, Jian
    Liu, Hongsheng
    TOXICOLOGY LETTERS, 2020, 332 : 88 - 96