Graph Out-of-Distribution Generalization With Controllable Data Augmentation

被引:0
|
作者
Lu, Bin [1 ]
Zhao, Ze [1 ]
Gan, Xiaoying [1 ]
Liang, Shiyu [2 ]
Fu, Luoyi [3 ]
Wang, Xinbing [1 ]
Zhou, Chenghu [4 ]
机构
[1] Shanghai Jiao Tong Univ, Dept Elect Engn, Shanghai 200240, Peoples R China
[2] Shanghai Jiao Tong Univ, John Hopcroft Ctr Comp Sci, Shanghai 200240, Peoples R China
[3] Shanghai Jiao Tong Univ, Dept Comp Sci & Engn, Shanghai 200240, Peoples R China
[4] Chinese Acad Sci, Inst Geog Sci & Nat Resources Res, Beijing 100045, Peoples R China
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
Out-of-distribution generalization; graph neural network; domain generalization; data augmentation;
D O I
10.1109/TKDE.2024.3393109
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Graph Neural Network (GNN) has demonstrated extraordinary performance in classifying graph properties. However, due to the selection bias of training and testing data (e.g., training on small graphs and testing on large graphs, or training on dense graphs and testing on sparse graphs), distribution deviation is widespread. More importantly, we often observe hybrid structure distribution shift of both scale and density, despite of one-sided biased data partition. The spurious correlations over hybrid distribution deviation degrade the performance of previous GNN methods and show large instability among different datasets. To alleviate this problem, we propose OOD-GMixup to jointly manipulate the training distribution with controllable data augmentation in metric space. Specifically, we first extract the graph rationales to eliminate the spurious correlations due to irrelevant information. Second, we generate virtual samples with perturbation on graph rationale representation domain to obtain potential OOD training samples. Finally, we propose OOD calibration to measure the distribution deviation of virtual samples by leveraging Extreme Value Theory, and further actively control the training distribution by emphasizing the impact of virtual OOD samples. Extensive studies on several real-world datasets on graph classification demonstrate the superiority of our proposed method over state-of-the-art baselines.
引用
收藏
页码:6317 / 6329
页数:13
相关论文
共 50 条
  • [31] Few-Shot Causal Representation Learning for Out-of-Distribution Generalization on Heterogeneous Graphs
    Ding, Pengfei
    Wang, Yan
    Liu, Guanfeng
    Wang, Nan
    Zhou, Xiaofang
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2025, 37 (04) : 1804 - 1818
  • [32] Out-of-Distribution Representation and Graph Neural Network Fusion Learning for ECG Biometrics
    Ma, Tianbang
    Huang, Yuwen
    Yi, Ran
    Yang, Gongping
    Yin, Yilong
    IEEE TRANSACTIONS ON BIOMETRICS, BEHAVIOR, AND IDENTITY SCIENCE, 2025, 7 (02): : 225 - 233
  • [33] MixOOD: Improving Out-of-distribution Detection with Enhanced Data Mixup
    Yang, Taocun
    Huang, Yaping
    Xie, Yanlin
    Liu, Junbo
    Wang, Shengchun
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2023, 19 (05)
  • [34] NeuralOOD: Improving out-of-distribution generalization performance with brain-machine fusion learning framework
    Zhao, Shuangchen
    Du, Changde
    Li, Jingze
    Li, Hui
    He, Huiguang
    INFORMATION FUSION, 2025, 119
  • [35] On the Usage of Continual Learning for Out-of-Distribution Generalization in Pre-trained Language Models of Code
    Weyssow, Martin
    Zhou, Xin
    Kim, Kisub
    Lo, David
    Sahraoui, Houari
    PROCEEDINGS OF THE 31ST ACM JOINT MEETING EUROPEAN SOFTWARE ENGINEERING CONFERENCE AND SYMPOSIUM ON THE FOUNDATIONS OF SOFTWARE ENGINEERING, ESEC/FSE 2023, 2023, : 1470 - 1482
  • [36] IW-ViT: Independence-Driven Weighting Vision Transformer for out-of-distribution generalization
    Liu, Weifeng
    Yu, Haoran
    Wang, Yingjie
    Liu, Baodi
    Tao, Dapeng
    Chen, Honglong
    PATTERN RECOGNITION, 2025, 161
  • [37] Unifying invariant and variant features for graph out-of-distribution via probability of necessity and sufficiency
    Chen, Xuexin
    Cai, Ruichu
    Zheng, Kaitao
    Jiang, Zhifan
    Huang, Zhengting
    Hao, Zhifeng
    Li, Zijian
    NEURAL NETWORKS, 2025, 184
  • [38] PEGNN: A physics embedded graph neural network for out-of-distribution temperature field reconstruction
    Li, Qiao
    Li, Xingchen
    Chen, Xiaoqian
    Yao, Wen
    INTERNATIONAL JOURNAL OF THERMAL SCIENCES, 2025, 207
  • [39] Exploring Out-of-Distribution Generalization in Text Classifiers Trained on Tobacco-3482 and RVL-CDIP
    Larson, Stefan
    Singh, Navtej
    Maheshwari, Saarthak
    Stewart, Shanti
    Krishnaswamy, Uma
    DOCUMENT ANALYSIS AND RECOGNITION, ICDAR 2021, PT II, 2021, 12917 : 416 - 423
  • [40] Test-Time Image-to-Image Translation Ensembling Improves Out-of-Distribution Generalization in Histopathology
    Scalbert, Marin
    Vakalopoulou, Maria
    Couzinie-Devy, Florent
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2022, PT II, 2022, 13432 : 120 - 129