graphkit-learn : a Python Library for Graph Kernels Based on Linear Patterns
Abstract
This paper presents graphkit-learn, the first Python library for efficient computation of graph kernels based on linear patterns, able to address various types of graphs. Graph kernels based on linear patterns are thoroughly implemented, each with specific computing methods, as well as two wellknown graph kernels based on non-linear patterns for comparative analysis. Since computational complexity is an Achilles' heel of graph kernels, we provide several strategies to address this critical issue, including parallelization, the trie data structure, and the FCSP method that we extend to other kernels and edge comparison. All proposed strategies save orders of magnitudes of computing time and memory usage. Moreover, all the graph kernels can be simply computed with a single Python statement, thus are appealing to researchers and practitioners. For the convenience of use, an advanced model selection procedure is provided for both regression and classification problems. Experiments on synthesized datasets and 11 real-world benchmark datasets show the relevance of the proposed library.
Domains
Statistics [stat] Machine Learning [stat.ML] Engineering Sciences [physics] Signal and Image processing Mathematics [math] Statistics [math.ST] Computer Science [cs] Signal and Image Processing Computer Science [cs] Neural and Evolutionary Computing [cs.NE] Computer Science [cs] Machine Learning [cs.LG] Computer Science [cs] Computers and Society [cs.CY] Computer Science [cs] Computer Vision and Pattern Recognition [cs.CV] Computer Science [cs] Artificial Intelligence [cs.AI]
Fichier principal
PRL_graphkit_learn__A_Python_Library_for_Graph_Kernels_Based_on_Linear_Patterns.pdf (499.84 Ko)
Télécharger le fichier
Origin : Files produced by the author(s)