Pattern discovery from graph-structured data - A data mining perspective

被引:0
作者
Motoda, Hiroshi [1 ]
机构
[1] Air Force Off Sci Res, Asian Off Aerosp Res & Dev, Tokyo, Japan
来源
NEW TRENDS IN APPLIED ARTIFICIAL INTELLIGENCE, PROCEEDINGS | 2007年 / 4570卷
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Mining from graph-structured data has its root in concept formation. Recent advancement of data mining techniques has broadened its applicability. Graph mining faces with subgraph isomorphism which is known to be NP-complete. Two contrasting approaches of our work on extracting frequent subgraphs are revisited, one using complete search (AGM) and the other using heuristic search (GBI). Both use canonical labelling to deal with subgraph isomorphism. AGM represents a graph by its adjacency matrix and employs an Apriori-like bottom up search algorithm using anti-monotonicity of frequency. It can handle both connected and dis-connected graphs, and has been extended to handle a tree data and a sequential data by incorporating a different bias to each in joining operators. It has also been extended to incorporate taxonomy in labels to extract generalized subgraphs. GBI employs a notion of chunking, which recursively chunks two adjoining nodes, thus generating fairly large subgraphs at an early stage of search. The recent improved version extends it to employ pseudo-chunking which is called chunkingless chunking, enabling to extract overlapping subgraphs. It can impose two kinds of constraints to accelerate search, one to include one or more of the designated subgraphs and the other to exclude all of the designated subgraphs. It has been extended to extract paths and trees from a graph data by placing a restriction on pseudo-chunking operations. GBI can further be used as a feature constructor in decision tree building. The paper explains how both GBI and AGM with their extended versions can be applied to solve various data mining problems which are difficult to solve by other methods.
引用
收藏
页码:12 / 22
页数:11
相关论文
共 50 条
[21]   A Distributed Placement Service for Graph-Structured and Tree-Structured Data [J].
Buehrer, Gregory ;
Parthasarathy, Srinivasan ;
Tatikonda, Shirish .
ACM SIGPLAN NOTICES, 2010, 45 (05) :355-356
[22]   Graph-Informed Neural Networks for Regressions on Graph-Structured Data [J].
Berrone, Stefano ;
Della Santa, Francesco ;
Mastropietro, Antonio ;
Pieraccini, Sandra ;
Vaccarino, Francesco .
MATHEMATICS, 2022, 10 (05)
[23]   Classifier construction by graph-based induction for graph-structured data [J].
Geamsakul, W ;
Matsuda, T ;
Yoshida, T ;
Motoda, H ;
Washio, T .
ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, 2003, 2637 :52-62
[24]   GrapHisto: A Robust Representation of Graph-Structured Data for Graph Convolutional Networks [J].
Benini, Marco ;
Bongini, Pietro ;
Trentin, Edmondo .
NEURAL PROCESSING LETTERS, 2025, 57 (01)
[25]   Exploiting local similarity for indexing paths in graph-structured data [J].
Kaushik, R ;
Shenoy, P ;
Bohannon, P ;
Gudes, E .
18TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, PROCEEDINGS, 2002, :129-140
[26]   Labeling scheme and structural joins for graph-structured XML data [J].
Wang, HZ ;
Wang, W ;
Lin, XM ;
Li, JZ .
WEB TECHNOLOGIES RESEARCH AND DEVELOPMENT - APWEB 2005, 2005, 3399 :277-289
[27]   Expressive Languages for Path Queries over Graph-Structured Data [J].
Barcelo, Pablo ;
Libkin, Leonid ;
Lin, Anthony W. ;
Wood, Peter T. .
ACM TRANSACTIONS ON DATABASE SYSTEMS, 2012, 37 (04)
[28]   A New Reachability Query Method for Graph-structured XML Data [J].
Lu Yan ;
Ma, Funing ;
Chu, Shanzhong .
ADVANCES IN COMPUTING, CONTROL AND INDUSTRIAL ENGINEERING, 2012, 235 :394-+
[29]   Expressive Languages for Path Queries over Graph-Structured Data [J].
Barcelo, Pablo ;
Hurtado, Carlos ;
Libkin, Leonid ;
Wood, Peter .
PODS 2010: PROCEEDINGS OF THE TWENTY-NINTH ACM SIGMOD-SIGACT-SIGART SYMPOSIUM ON PRINCIPLES OF DATABASE SYSTEMS, 2010, :3-14
[30]   Visualization and classification of graph-structured data: the case of the Enron dataset [J].
Bouveyron, Charles ;
Chipman, Hugh .
2007 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-6, 2007, :1506-1517