Graph Out-of-Distribution Generalization With Controllable Data Augmentation

被引:0
|
作者
Lu, Bin [1 ]
Zhao, Ze [1 ]
Gan, Xiaoying [1 ]
Liang, Shiyu [2 ]
Fu, Luoyi [3 ]
Wang, Xinbing [1 ]
Zhou, Chenghu [4 ]
机构
[1] Shanghai Jiao Tong Univ, Dept Elect Engn, Shanghai 200240, Peoples R China
[2] Shanghai Jiao Tong Univ, John Hopcroft Ctr Comp Sci, Shanghai 200240, Peoples R China
[3] Shanghai Jiao Tong Univ, Dept Comp Sci & Engn, Shanghai 200240, Peoples R China
[4] Chinese Acad Sci, Inst Geog Sci & Nat Resources Res, Beijing 100045, Peoples R China
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
Out-of-distribution generalization; graph neural network; domain generalization; data augmentation;
D O I
10.1109/TKDE.2024.3393109
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Graph Neural Network (GNN) has demonstrated extraordinary performance in classifying graph properties. However, due to the selection bias of training and testing data (e.g., training on small graphs and testing on large graphs, or training on dense graphs and testing on sparse graphs), distribution deviation is widespread. More importantly, we often observe hybrid structure distribution shift of both scale and density, despite of one-sided biased data partition. The spurious correlations over hybrid distribution deviation degrade the performance of previous GNN methods and show large instability among different datasets. To alleviate this problem, we propose OOD-GMixup to jointly manipulate the training distribution with controllable data augmentation in metric space. Specifically, we first extract the graph rationales to eliminate the spurious correlations due to irrelevant information. Second, we generate virtual samples with perturbation on graph rationale representation domain to obtain potential OOD training samples. Finally, we propose OOD calibration to measure the distribution deviation of virtual samples by leveraging Extreme Value Theory, and further actively control the training distribution by emphasizing the impact of virtual OOD samples. Extensive studies on several real-world datasets on graph classification demonstrate the superiority of our proposed method over state-of-the-art baselines.
引用
收藏
页码:6317 / 6329
页数:13
相关论文
共 50 条
  • [31] Counterfactual Active Learning for Out-of-Distribution Generalization
    Deng, Xun
    Wang, Wenjie
    Feng, Fuli
    Zhang, Hanwang
    He, Xiangnan
    Liao, Yong
    PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023): LONG PAPERS, VOL 1, 2023, : 11362 - 11377
  • [32] Out-of-distribution Generalization with Causal Invariant Transformations
    Wang, Ruoyu
    Yi, Mingyang
    Chen, Zhitang
    Zhu, Shengyu
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 375 - 385
  • [33] Deep Stable Learning for Out-Of-Distribution Generalization
    Zhang, Xingxuan
    Cui, Peng
    Xu, Renzhe
    Zhou, Linjun
    He, Yue
    Shen, Zheyan
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 5368 - 5378
  • [34] Out-of-Distribution Generalization via Risk Extrapolation
    Krueger, David
    Caballero, Ethan
    Jacobsen, Joern-Henrik
    Zhang, Amy
    Binas, Jonathan
    Zhang, Dinghuai
    Le Priol, Remi
    Courville, Aaron
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
  • [35] Out-of-distribution generalization for learning quantum dynamics
    Matthias C. Caro
    Hsin-Yuan Huang
    Nicholas Ezzell
    Joe Gibbs
    Andrew T. Sornborger
    Lukasz Cincio
    Patrick J. Coles
    Zoë Holmes
    Nature Communications, 14
  • [36] Towards a Theoretical Framework of Out-of-Distribution Generalization
    Ye, Haotian
    Xie, Chuanlong
    Cai, Tianle
    Li, Ruichen
    Li, Zhenguo
    Wang, Liwei
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021,
  • [37] The Value of Out-of-Distribution Data
    De Silva, Ashwin
    Ramesh, Rahul
    Priebe, Carey E.
    Chaudhari, Pratik
    Vogelstein, Joshua T.
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 202, 2023, 202
  • [38] Graph Convolution for Semi-Supervised Classification: Improved Linear Separability and Out-of-Distribution Generalization
    Baranwal, Aseem
    Fountoulakis, Kimon
    Jagannath, Aukosh
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
  • [39] Toward Out-of-Distribution Generalization Through Inductive Biases
    Moruzzi, Caterina
    PHILOSOPHY AND THEORY OF ARTIFICIAL INTELLIGENCE 2021, 2022, 63 : 57 - 66
  • [40] Discovering causally invariant features for out-of-distribution generalization
    Wang, Yujie
    Yu, Kui
    Xiang, Guodu
    Cao, Fuyuan
    Liang, Jiye
    PATTERN RECOGNITION, 2024, 150