graphkit-learn: A Python']Python library for graph kernels based on linear patterns

被引:5
作者
Jia, Linlin [1 ]
Gauzere, Benoit [1 ]
Honeine, Paul [2 ]
机构
[1] INSA Rouen Normandie, LITIS, Rouen, France
[2] Univ Rouen Normandie, LITIS, Rouen, France
关键词
Graph kernels; Linear patterns; !text type='Python']Python[!/text] implementation; NEURAL-NETWORK; DATABASE;
D O I
10.1016/j.patrec.2021.01.003
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents graphkit-learn , the first Python library for efficient computation of graph kernels based on linear patterns, able to address various types of graphs. Graph kernels based on linear patterns are thoroughly implemented, each with specific computing methods, as well as two well-known graph kernels based on non-linear patterns for comparative analysis. Since computational complexity is an Achilles' heel of graph kernels, we provide several strategies to address this critical issue, including parallelization, the trie data structure, and the FCSP method that we extend to other kernels and edge comparison. All proposed strategies save orders of magnitudes of computing time and memory usage. Moreover, all the graph kernels can be simply computed with a single Python statement, thus are appealing to researchers and practitioners. For the convenience of use, an advanced model selection procedure is provided for both regression and classification problems. Experiments on synthesized datasets and 11 real-world benchmark datasets show the relevance of the proposed library. (c) 2021 Elsevier B.V. All rights reserved.
引用
收藏
页码:113 / 121
页数:9
相关论文
共 31 条
[1]  
[Anonymous], NEURAL NETW
[2]  
[Anonymous], 2003, ICML
[3]  
[Anonymous], 2007, EUR S ART NEUR NETW
[4]   Shortest-path kernels on graphs [J].
Borgwardt, KM ;
Kriegel, HP .
Fifth IEEE International Conference on Data Mining, Proceedings, 2005, :74-81
[5]   Protein function prediction via graph kernels [J].
Borgwardt, KM ;
Ong, CS ;
Schönauer, S ;
Vishwanathan, SVN ;
Smola, AJ ;
Kriegel, HP .
BIOINFORMATICS, 2005, 21 :I47-I56
[6]  
Bougleux S, 2012, LECT NOTES COMPUT SC, V7626, P199, DOI 10.1007/978-3-642-34166-3_22
[7]  
Brun L., 2018, GREYC CHEM DATASET
[8]   USE OF A NEURAL-NETWORK TO DETERMINE THE NORMAL BOILING POINTS OF ACYCLIC ETHERS, PEROXIDES, ACETALS AND THEIR SULFUR ANALOGS [J].
CHERQAOUI, D ;
VILLEMIN, D ;
MESBAH, A ;
CENSE, JM ;
KVASNICKA, V .
JOURNAL OF THE CHEMICAL SOCIETY-FARADAY TRANSACTIONS, 1994, 90 (14) :2015-2019
[9]   USE OF A NEURAL-NETWORK TO DETERMINE THE BOILING-POINT OF ALKANES [J].
CHERQAOUI, D ;
VILLEMIN, D .
JOURNAL OF THE CHEMICAL SOCIETY-FARADAY TRANSACTIONS, 1994, 90 (01) :97-102
[10]   STRUCTURE ACTIVITY RELATIONSHIP OF MUTAGENIC AROMATIC AND HETEROAROMATIC NITRO-COMPOUNDS - CORRELATION WITH MOLECULAR-ORBITAL ENERGIES AND HYDROPHOBICITY [J].
DEBNATH, AK ;
DECOMPADRE, RLL ;
DEBNATH, G ;
SHUSTERMAN, AJ ;
HANSCH, C .
JOURNAL OF MEDICINAL CHEMISTRY, 1991, 34 (02) :786-797