Graph Classification of Molecules Using Force Field Atom and Bond Types

被引:5
作者
Jippo, Hideyuki [1 ]
Matsuo, Tatsuru [2 ]
Kikuchi, Ryota [2 ]
Fukuda, Daisuke [2 ]
Matsuura, Azuma [1 ]
Ohfuchi, Mari [1 ]
机构
[1] Fujitsu Labs Ltd, Digital Annealer Unit, 10-1 Morinosato Wakamiya, Atsugi, Kanagawa 2430197, Japan
[2] Fujitsu Labs Ltd, Artificial Intelligence Lab, Nakahara Ku, 4-1-1 Kamikodanaka, Kawasaki, Kanagawa 2118588, Japan
关键词
Machine learning; Force field; Graph kernel; Support vector machine; Deep neural network; NEURAL-NETWORKS; KERNELS; QSAR; VALIDATION; REGRESSION; PROTEINS; MODEL;
D O I
10.1002/minf.201800155
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
Classification of the biological activities of chemical substances is important for developing new medicines efficiently. Various machine learning methods are often employed to screen large libraries of compounds and predict the activities of new substances by training the molecular structure-activity relationships. One such method is graph classification, in which a molecular structure can be represented in terms of a labeled graph with nodes that correspond to atoms and edges that correspond to the bonds between these atoms. In a conventional graph definition, atomic symbols and bond orders are employed as node and edge labels, respectively. In this study, we developed new graph definitions using the assignment of atom and bond types in the force fields of molecular dynamics methods as node and edge labels, respectively. We found that these graph definitions improved the accuracies of activity classifications for chemical substances using graph kernels with support vector machines and deep neural networks. The higher accuracies obtained using our proposed definitions can enhance the development of the materials informatics using graph-based machine learning methods.
引用
收藏
页数:8
相关论文
共 44 条
[1]  
[Anonymous], ICML
[2]  
[Anonymous], P 29 INT C MACH LEAR
[3]   Shortest-path kernels on graphs [J].
Borgwardt, KM ;
Kriegel, HP .
Fifth IEEE International Conference on Data Mining, Proceedings, 2005, :74-81
[4]   ATOM PAIRS AS MOLECULAR-FEATURES IN STRUCTURE ACTIVITY STUDIES - DEFINITION AND APPLICATIONS [J].
CARHART, RE ;
SMITH, DH ;
VENKATARAGHAVAN, R .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1985, 25 (02) :64-73
[5]   PubChem as a Source of Polypharmacology [J].
Chen, Bin ;
Wild, David ;
Guha, Rajarshi .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2009, 49 (09) :2044-2055
[6]   Postgrowth annealing of defects in ZnO studied by positron annihilation, x-ray diffraction, Rutherford backscattering, cathodoluminescence, and Hall measurements [J].
Chen, ZQ ;
Yamamoto, S ;
Maekawa, M ;
Kawasuso, A ;
Yuan, XL ;
Sekiguchi, T .
JOURNAL OF APPLIED PHYSICS, 2003, 94 (08) :4807-4812
[7]   QSAR Modeling: Where Have You Been? Where Are You Going To? [J].
Cherkasov, Artem ;
Muratov, Eugene N. ;
Fourches, Denis ;
Varnek, Alexandre ;
Baskin, Igor I. ;
Cronin, Mark ;
Dearden, John ;
Gramatica, Paola ;
Martin, Yvonne C. ;
Todeschini, Roberto ;
Consonni, Viviana ;
Kuz'min, Victor E. ;
Cramer, Richard ;
Benigni, Romualdo ;
Yang, Chihae ;
Rathman, James ;
Terfloth, Lothar ;
Gasteiger, Johann ;
Richard, Ann ;
Tropsha, Alexander .
JOURNAL OF MEDICINAL CHEMISTRY, 2014, 57 (12) :4977-5010
[8]   A 2ND GENERATION FORCE-FIELD FOR THE SIMULATION OF PROTEINS, NUCLEIC-ACIDS, AND ORGANIC-MOLECULES [J].
CORNELL, WD ;
CIEPLAK, P ;
BAYLY, CI ;
GOULD, IR ;
MERZ, KM ;
FERGUSON, DM ;
SPELLMEYER, DC ;
FOX, T ;
CALDWELL, JW ;
KOLLMAN, PA .
JOURNAL OF THE AMERICAN CHEMICAL SOCIETY, 1995, 117 (19) :5179-5197
[9]   CROSS-VALIDATION, BOOTSTRAPPING, AND PARTIAL LEAST-SQUARES COMPARED WITH MULTIPLE-REGRESSION IN CONVENTIONAL QSAR STUDIES [J].
CRAMER, RD ;
BUNCE, JD ;
PATTERSON, DE ;
FRANK, IE .
QUANTITATIVE STRUCTURE-ACTIVITY RELATIONSHIPS, 1988, 7 (01) :18-25
[10]   A point-charge force field for molecular mechanics simulations of proteins based on condensed-phase quantum mechanical calculations [J].
Duan, Y ;
Wu, C ;
Chowdhury, S ;
Lee, MC ;
Xiong, GM ;
Zhang, W ;
Yang, R ;
Cieplak, P ;
Luo, R ;
Lee, T ;
Caldwell, J ;
Wang, JM ;
Kollman, P .
JOURNAL OF COMPUTATIONAL CHEMISTRY, 2003, 24 (16) :1999-2012