Graph Out-of-Distribution Generalization With Controllable Data Augmentation

被引:0
|
作者
Lu, Bin [1 ]
Zhao, Ze [1 ]
Gan, Xiaoying [1 ]
Liang, Shiyu [2 ]
Fu, Luoyi [3 ]
Wang, Xinbing [1 ]
Zhou, Chenghu [4 ]
机构
[1] Shanghai Jiao Tong Univ, Dept Elect Engn, Shanghai 200240, Peoples R China
[2] Shanghai Jiao Tong Univ, John Hopcroft Ctr Comp Sci, Shanghai 200240, Peoples R China
[3] Shanghai Jiao Tong Univ, Dept Comp Sci & Engn, Shanghai 200240, Peoples R China
[4] Chinese Acad Sci, Inst Geog Sci & Nat Resources Res, Beijing 100045, Peoples R China
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
Out-of-distribution generalization; graph neural network; domain generalization; data augmentation;
D O I
10.1109/TKDE.2024.3393109
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Graph Neural Network (GNN) has demonstrated extraordinary performance in classifying graph properties. However, due to the selection bias of training and testing data (e.g., training on small graphs and testing on large graphs, or training on dense graphs and testing on sparse graphs), distribution deviation is widespread. More importantly, we often observe hybrid structure distribution shift of both scale and density, despite of one-sided biased data partition. The spurious correlations over hybrid distribution deviation degrade the performance of previous GNN methods and show large instability among different datasets. To alleviate this problem, we propose OOD-GMixup to jointly manipulate the training distribution with controllable data augmentation in metric space. Specifically, we first extract the graph rationales to eliminate the spurious correlations due to irrelevant information. Second, we generate virtual samples with perturbation on graph rationale representation domain to obtain potential OOD training samples. Finally, we propose OOD calibration to measure the distribution deviation of virtual samples by leveraging Extreme Value Theory, and further actively control the training distribution by emphasizing the impact of virtual OOD samples. Extensive studies on several real-world datasets on graph classification demonstrate the superiority of our proposed method over state-of-the-art baselines.
引用
收藏
页码:6317 / 6329
页数:13
相关论文
共 50 条
  • [1] DIVE: Subgraph Disagreement for Graph Out-of-Distribution Generalization
    Sun, Xin
    Wang, Liang
    Liu, Qiang
    Wu, Shu
    Wang, Zilei
    Wang, Liang
    PROCEEDINGS OF THE 30TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2024, 2024, : 2794 - 2805
  • [2] Learning Invariant Graph Representations for Out-of-Distribution Generalization
    Li, Haoyang
    Zhang, Ziwei
    Wang, Xin
    Zhu, Wenwu
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [3] Selecting Augmentation Methods for Domain Generalization and Out-of-Distribution Detection Using Unlabeled Data
    Kucuktas, Ulku Tuncer
    Uysal, Fatih
    Hardalac, Firat
    32ND IEEE SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE, SIU 2024, 2024,
  • [4] Individual and Structural Graph Information Bottlenecks for Out-of-Distribution Generalization
    Yang, Ling
    Zheng, Jiayi
    Wang, Heyuan
    Liu, Zhongyi
    Huang, Zhilin
    Hong, Shenda
    Zhang, Wentao
    Cui, Bin
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2024, 36 (02) : 682 - 693
  • [5] Graph out-of-distribution generalization through contrastive learning paradigm
    Du, Hongyi
    Li, Xuewei
    Shao, Minglai
    KNOWLEDGE-BASED SYSTEMS, 2025, 315
  • [6] Certifiable Out-of-Distribution Generalization
    Ye, Nanyang
    Zhu, Lin
    Wang, Jia
    Zeng, Zhaoyu
    Shao, Jiayao
    Peng, Chensheng
    Pan, Bikang
    Li, Kaican
    Zhu, Jun
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 9, 2023, : 10927 - 10935
  • [7] Negative as Positive: Enhancing Out-of-distribution Generalization for Graph Contrastive Learning
    Wang, Zixu
    Xu, Bingbing
    Yuan, Yige
    Shen, Huawei
    Cheng, Xueqi
    PROCEEDINGS OF THE 47TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2024, 2024, : 2548 - 2552
  • [8] Environment-Aware Dynamic Graph Learning for Out-of-Distribution Generalization
    Yuan, Haonan
    Sun, Qingyun
    Fu, Xingcheng
    Zhang, Ziwei
    Ji, Cheng
    Peng, Hao
    Li, Jianxin
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [9] DecAug: Out-of-Distribution Generalization via Decomposed Feature Representation and Semantic Augmentation
    Bai, Haoyue
    Sun, Rui
    Hong, Lanqing
    Zhou, Fengwei
    Ye, Nanyang
    Ye, Han-Jia
    Chan, S-H Gary
    Li, Zhenguo
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 6705 - 6713
  • [10] Understanding the Generalization of Pretrained Diffusion Models on Out-of-Distribution Data
    Ramachandran, Sai Niranjan
    Mukhopadhyay, Rudrabha
    Agarwal, Madhav
    Jawahar, C. V.
    Namboodiri, Vinay
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 13, 2024, : 14767 - 14775