Graph Out-of-Distribution Generalization With Controllable Data Augmentation

被引:0
|
作者
Lu, Bin [1 ]
Zhao, Ze [1 ]
Gan, Xiaoying [1 ]
Liang, Shiyu [2 ]
Fu, Luoyi [3 ]
Wang, Xinbing [1 ]
Zhou, Chenghu [4 ]
机构
[1] Shanghai Jiao Tong Univ, Dept Elect Engn, Shanghai 200240, Peoples R China
[2] Shanghai Jiao Tong Univ, John Hopcroft Ctr Comp Sci, Shanghai 200240, Peoples R China
[3] Shanghai Jiao Tong Univ, Dept Comp Sci & Engn, Shanghai 200240, Peoples R China
[4] Chinese Acad Sci, Inst Geog Sci & Nat Resources Res, Beijing 100045, Peoples R China
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
Out-of-distribution generalization; graph neural network; domain generalization; data augmentation;
D O I
10.1109/TKDE.2024.3393109
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Graph Neural Network (GNN) has demonstrated extraordinary performance in classifying graph properties. However, due to the selection bias of training and testing data (e.g., training on small graphs and testing on large graphs, or training on dense graphs and testing on sparse graphs), distribution deviation is widespread. More importantly, we often observe hybrid structure distribution shift of both scale and density, despite of one-sided biased data partition. The spurious correlations over hybrid distribution deviation degrade the performance of previous GNN methods and show large instability among different datasets. To alleviate this problem, we propose OOD-GMixup to jointly manipulate the training distribution with controllable data augmentation in metric space. Specifically, we first extract the graph rationales to eliminate the spurious correlations due to irrelevant information. Second, we generate virtual samples with perturbation on graph rationale representation domain to obtain potential OOD training samples. Finally, we propose OOD calibration to measure the distribution deviation of virtual samples by leveraging Extreme Value Theory, and further actively control the training distribution by emphasizing the impact of virtual OOD samples. Extensive studies on several real-world datasets on graph classification demonstrate the superiority of our proposed method over state-of-the-art baselines.
引用
收藏
页码:6317 / 6329
页数:13
相关论文
共 50 条
  • [21] IMPROVING ROBUSTNESS TO OUT-OF-DISTRIBUTION DATA BY FREQUENCY-BASED AUGMENTATION
    Mukai, Koki
    Kumano, Soichiro
    Yamasaki, Toshihiko
    2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 3116 - 3120
  • [22] GOOD: A Graph Out-of-Distribution Benchmark
    Gui, Shurui
    Li, Xiner
    Wang, Limei
    Ji, Shuiwang
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [23] Out-of-distribution generalization for learning quantum dynamics
    Caro, Matthias C.
    Huang, Hsin-Yuan
    Ezzell, Nicholas
    Gibbs, Joe
    Sornborger, Andrew T.
    Cincio, Lukasz
    Coles, Patrick J.
    Holmes, Zoe
    NATURE COMMUNICATIONS, 2023, 14 (01)
  • [24] On the Adversarial Robustness of Out-of-distribution Generalization Models
    Zou, Xin
    Liu, Weiwei
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [25] Assaying Out-Of-Distribution Generalization in Transfer Learning
    Wenzel, Florian
    Dittadi, Andrea
    Gehler, Peter
    Simon-Gabriel, Carl-Johann
    Horn, Max
    Zietlow, Dominik
    Kernert, David
    Russell, Chris
    Brox, Thomas
    Schiele, Bernt
    Scholkopf, Bernhard
    Locatello, Francesco
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [26] On the Out-of-distribution Generalization of Probabilistic Image Modelling
    Zhang, Mingtian
    Zhang, Andi
    McDonagh, Steven
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [27] Out-of-distribution Generalization and Its Applications for Multimedia
    Wang, Xin
    Cui, Peng
    Zhu, Wenwu
    PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 5681 - 5682
  • [28] Out-of-Distribution Generalization With Causal Feature Separation
    Wang, Haotian
    Kuang, Kun
    Lan, Long
    Wang, Zige
    Huang, Wanrong
    Wu, Fei
    Yang, Wenjing
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2024, 36 (04) : 1758 - 1772
  • [29] A Stable Vision Transformer for Out-of-Distribution Generalization
    Yu, Haoran
    Liu, Baodi
    Wang, Yingjie
    Zhang, Kai
    Tao, Dapeng
    Liu, Weifeng
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT VIII, 2024, 14432 : 328 - 339
  • [30] Diverse Weight Averaging for Out-of-Distribution Generalization
    Rame, Alexandre
    Kirchmeyer, Matthieu
    Rahier, Thibaud
    Rakotomamonjy, Alain
    Gallinari, Patrick
    Cord, Matthieu
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,