MIGP: Metapath Integrated Graph Prompt Neural Network

被引:0
作者
Lai, Pei-Yuan [1 ,2 ]
Dai, Qing-Yun [1 ,3 ]
Lu, Yi-Hong [4 ]
Wang, Zeng-Hui [2 ]
Chen, Man-Sheng [4 ]
Wang, Chang-Dong [3 ,4 ]
机构
[1] Guangdong Univ Technol, Sch Informat Engn, Guangzhou, Peoples R China
[2] South China Technol Commercializat Ctr, Guangzhou, Peoples R China
[3] Guangdong Prov Key Lab Intellectual Property & Big, Guangzhou, Peoples R China
[4] Sun Yat Sen Univ, Sch Comp Sci & Engn, Guangzhou, Peoples R China
关键词
Graph prompt; Metapath; Heterogeneous graphs; Graph neural network;
D O I
10.1016/j.neunet.2024.106595
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Graph neural networks (GNNs) leveraging metapaths have garnered extensive utilization. Nevertheless, the escalating parameters and data corpus within graph pre-training models incur mounting training costs. Consequently, GNN models encounter hurdles including diminished generalization capacity and compromised performance amidst small sample datasets. Drawing inspiration from the efficacy demonstrated by self- supervised learning methodologies in natural language processing, we embark on an exploration. We endeavor to imbue graph data with augmentable, learnable prompt vectors targeting node representation enhancement to foster superior adaptability to downstream tasks. This paper proposes a novel approach, the Metapath Integrated Graph Prompt Neural Network (MIGP), which leverages learnable prompt vectors to enhance node representations within a pretrained model framework. By leveraging learnable prompt vectors, MIGP aims to address the limitations posed by mall sample datasets and improve GNNs' model generalization. In the pretraining stage, we split symmetric metapaths in heterogeneous graphs into short metapaths and explicitly propagate information along the metapaths to update node representations. In the prompt-tuning stage, the parameters of the pretrained model are fixed, a set of independent basis vectors is introduced, and an attention mechanism is employed to generate task-specific learnable prompt vectors for each node. Another notable contribution of our work is the introduction of three patent datasets, which is a pioneering application in related fields. We will make these three patent datasets publicly available to facilitate further research on large-scale patent data analysis. Through comprehensive experiments conducted on three patent datasets and three other public datasets, i.e., ACM, IMDB, and DBLP, we demonstrate the superior performance of the MIGP model in enhancing model applicability and performance across a variety of downstream datasets. The source code and datasets are available in the website.1 1
引用
收藏
页数:11
相关论文
共 41 条
[1]  
Brown TB, 2020, ADV NEUR IN, V33
[2]  
Busbridge D, 2019, Arxiv, DOI [arXiv:1904.05811, 10.48550/arxiv.1904.05811, DOI 10.48550/ARXIV.1904.05811]
[3]  
de Souza C. M., 2023, Journal of Information & Knowledge Management, V22
[4]   metapath2vec: Scalable Representation Learning for Heterogeneous Networks [J].
Dong, Yuxiao ;
Chawla, Nitesh V. ;
Swami, Ananthram .
KDD'17: PROCEEDINGS OF THE 23RD ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2017, :135-144
[5]  
Fang Taoran, 2023, NeurIPS
[6]   HIN2Vec: Explore Meta-paths in Heterogeneous Information Networks for Representation Learning [J].
Fu, Tao-yang ;
Lee, Wang-Chien ;
Lei, Zhen .
CIKM'17: PROCEEDINGS OF THE 2017 ACM CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2017, :1797-1806
[7]   MAGNN: Metapath Aggregated Graph Neural Network for Heterogeneous Graph Embedding [J].
Fu, Xinyu ;
Zhang, Jiani ;
Men, Ziqiao ;
King, Irwin .
WEB CONFERENCE 2020: PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE (WWW 2020), 2020, :2331-2341
[8]   HMSG: Heterogeneous graph neural network based on Metapath SubGraph learning [J].
Guan, Mengya ;
Cai, Xinjun ;
Shang, Jiaxing ;
Hao, Fei ;
Liu, Dajiang ;
Jiao, Xianlong ;
Ni, Wancheng .
KNOWLEDGE-BASED SYSTEMS, 2023, 279
[9]  
Hamilton WL, 2017, ADV NEUR IN, V30
[10]   GPT-GNN: Generative Pre-Training of Graph Neural Networks [J].
Hu, Ziniu ;
Dong, Yuxiao ;
Wang, Kuansan ;
Chang, Kai-Wei ;
Sun, Yizhou .
KDD '20: PROCEEDINGS OF THE 26TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2020, :1857-1867