An Inverse QSAR Method Based on Linear Regression and Integer Programming

被引:1
作者
Zhu, Jianshen [1 ]
Azam, Naveed Ahmed [1 ]
Haraguchi, Kazuya [1 ]
Zhao, Liang [2 ]
Nagamochi, Hiroshi [1 ]
Akutsu, Tatsuya [3 ]
机构
[1] Kyoto Univ, Dept Appl Math & Phys, Kyoto 6068501, Japan
[2] Kyoto Univ, Grad Sch Adv Integrated Studies Human Survavibil, Kyoto 6068306, Japan
[3] Kyoto Univ, Bioinformat Ctr, Inst Chem Res, Uji, Kyoto 6110011, Japan
来源
FRONTIERS IN BIOSCIENCE-LANDMARK | 2022年 / 27卷 / 06期
基金
日本学术振兴会;
关键词
machine learning; linear regression; integer programming; chemoinformatics; materials informatics; QSAR/QSPR; molecular design; DESIGN; INDEXES;
D O I
10.31083/j.fbl2706188
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Background: Drug design is one of the important applications of biological science. Extensive studies have been done on computer-aided drug design based on inverse quantitative structure activity relationship (inverse QSAR), which is to infer chemical compounds from given chemical activities and constraints. However, exact or optimal solutions are not guaranteed in most of the existing methods. Method: Recently a novel framework based on artificial neural networks (ANNs) and mixed integer linear programming (MILP) has been proposed for designing chemical structures. This framework consists of two phases: an ANN is used to construct a prediction function, and then an MILP formulated on the trained ANN and a graph search algorithm are used to infer desired chemical structures. In this paper, we use linear regression instead of ANNs to construct a prediction function. For this, we derive a novel MILP formulation that simulates the computation process of a prediction function by linear regression. Results: For the first phase, we performed computational experiments using 18 chemical properties, and the proposed method achieved good prediction accuracy for a relatively large number of properties, in comparison with ANNs in our previous work. For the second phase, we performed computational experiments on five chemical properties, and the method could infer chemical structures with around up to 50 non-hydrogen atoms. Conclusions: Combination of linear regression and integer programming is a potentially useful approach to computational molecular design.
引用
收藏
页数:14
相关论文
共 37 条
[1]  
Akutsu T., 2020, ARXIV
[2]   A Mixed Integer Linear Programming Formulation to Artificial Neural Networks [J].
Akutsu, Tatsuya ;
Nagamochi, Hiroshi .
PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND SYSTEMS (ICISS 2019), 2019, :215-220
[3]   Inferring a graph from path frequency [J].
Akutsu, Tatsuya ;
Fukagawa, Daiji ;
Jansson, Jesper ;
Sadakane, Kunihiko .
DISCRETE APPLIED MATHEMATICS, 2012, 160 (10-11) :1416-1428
[4]  
[Anonymous], ANNOTATIONS HSDB ON
[5]   A novel method for inference of acyclic chemical compounds with bounded branch-height based on artificial neural networks and integer programming [J].
Azam, Naveed Ahmed ;
Zhu, Jianshen ;
Sun, Yanming ;
Shi, Yu ;
Shurbevski, Aleksandar ;
Zhao, Liang ;
Nagamochi, Hiroshi ;
Akutsu, Tatsuya .
ALGORITHMS FOR MOLECULAR BIOLOGY, 2021, 16 (01)
[6]   A Novel Method for the Inverse QSAR/QSPR based on Artificial Neural Networks and Mixed Integer Linear Programming with Guaranteed Admissibility [J].
Azam, Naveed Ahmed ;
Chiewvanichakorn, Rachaya ;
Zhang, Fan ;
Shurbevski, Aleksandar ;
Nagamochi, Hiroshi ;
Akutsu, Tatsuya .
PROCEEDINGS OF THE 13TH INTERNATIONAL JOINT CONFERENCE ON BIOMEDICAL ENGINEERING SYSTEMS AND TECHNOLOGIES, VOL 3: BIOINFORMATICS, 2020, :101-108
[7]  
Bohacek RS, 1996, MED RES REV, V16, P3, DOI 10.1002/(SICI)1098-1128(199601)16:1<3::AID-MED1>3.3.CO
[8]  
2-D
[9]   QSAR Modeling: Where Have You Been? Where Are You Going To? [J].
Cherkasov, Artem ;
Muratov, Eugene N. ;
Fourches, Denis ;
Varnek, Alexandre ;
Baskin, Igor I. ;
Cronin, Mark ;
Dearden, John ;
Gramatica, Paola ;
Martin, Yvonne C. ;
Todeschini, Roberto ;
Consonni, Viviana ;
Kuz'min, Victor E. ;
Cramer, Richard ;
Benigni, Romualdo ;
Yang, Chihae ;
Rathman, James ;
Terfloth, Lothar ;
Gasteiger, Johann ;
Richard, Ann ;
Tropsha, Alexander .
JOURNAL OF MEDICINAL CHEMISTRY, 2014, 57 (12) :4977-5010
[10]  
De Cao N., 2018, ARXIV