Graph Out-of-Distribution Generalization With Controllable Data Augmentation

被引：0

作者：

Lu, Bin ^{[1
]}

Zhao, Ze ^{[1
]}

Gan, Xiaoying ^{[1
]}

Liang, Shiyu ^{[2
]}

Fu, Luoyi ^{[3
]}

Wang, Xinbing ^{[1
]}

Zhou, Chenghu ^{[4
]}

机构：

[1] Shanghai Jiao Tong Univ, Dept Elect Engn, Shanghai 200240, Peoples R China

[2] Shanghai Jiao Tong Univ, John Hopcroft Ctr Comp Sci, Shanghai 200240, Peoples R China

[3] Shanghai Jiao Tong Univ, Dept Comp Sci & Engn, Shanghai 200240, Peoples R China

[4] Chinese Acad Sci, Inst Geog Sci & Nat Resources Res, Beijing 100045, Peoples R China

来源：

IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING | 2024年 / 36卷 / 11期

基金：

中国国家自然科学基金; 国家重点研发计划;

关键词：

Out-of-distribution generalization; graph neural network; domain generalization; data augmentation;

D O I：

10.1109/TKDE.2024.3393109

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Graph Neural Network (GNN) has demonstrated extraordinary performance in classifying graph properties. However, due to the selection bias of training and testing data (e.g., training on small graphs and testing on large graphs, or training on dense graphs and testing on sparse graphs), distribution deviation is widespread. More importantly, we often observe hybrid structure distribution shift of both scale and density, despite of one-sided biased data partition. The spurious correlations over hybrid distribution deviation degrade the performance of previous GNN methods and show large instability among different datasets. To alleviate this problem, we propose OOD-GMixup to jointly manipulate the training distribution with controllable data augmentation in metric space. Specifically, we first extract the graph rationales to eliminate the spurious correlations due to irrelevant information. Second, we generate virtual samples with perturbation on graph rationale representation domain to obtain potential OOD training samples. Finally, we propose OOD calibration to measure the distribution deviation of virtual samples by leveraging Extreme Value Theory, and further actively control the training distribution by emphasizing the impact of virtual OOD samples. Extensive studies on several real-world datasets on graph classification demonstrate the superiority of our proposed method over state-of-the-art baselines.

引用

页码：6317 / 6329

页数：13

共 50 条

[31] Few-Shot Causal Representation Learning for Out-of-Distribution Generalization on Heterogeneous Graphs
Ding, Pengfei
Wang, Yan
Liu, Guanfeng
Wang, Nan
Zhou, Xiaofang
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2025, 37 (04) : 1804 - 1818
[32] Out-of-Distribution Representation and Graph Neural Network Fusion Learning for ECG Biometrics
Ma, Tianbang
Huang, Yuwen
Yi, Ran
Yang, Gongping
Yin, Yilong
IEEE TRANSACTIONS ON BIOMETRICS, BEHAVIOR, AND IDENTITY SCIENCE, 2025, 7 (02): : 225 - 233
[33] MixOOD: Improving Out-of-distribution Detection with Enhanced Data Mixup
Yang, Taocun
Huang, Yaping
Xie, Yanlin
Liu, Junbo
Wang, Shengchun
ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2023, 19 (05)
[34] NeuralOOD: Improving out-of-distribution generalization performance with brain-machine fusion learning framework
Zhao, Shuangchen
Du, Changde
Li, Jingze
Li, Hui
He, Huiguang
INFORMATION FUSION, 2025, 119
[35] On the Usage of Continual Learning for Out-of-Distribution Generalization in Pre-trained Language Models of Code
Weyssow, Martin
Zhou, Xin
Kim, Kisub
Lo, David
Sahraoui, Houari
PROCEEDINGS OF THE 31ST ACM JOINT MEETING EUROPEAN SOFTWARE ENGINEERING CONFERENCE AND SYMPOSIUM ON THE FOUNDATIONS OF SOFTWARE ENGINEERING, ESEC/FSE 2023, 2023, : 1470 - 1482
[36] IW-ViT: Independence-Driven Weighting Vision Transformer for out-of-distribution generalization
Liu, Weifeng
Yu, Haoran
Wang, Yingjie
Liu, Baodi
Tao, Dapeng
Chen, Honglong
PATTERN RECOGNITION, 2025, 161
[37] Unifying invariant and variant features for graph out-of-distribution via probability of necessity and sufficiency
Chen, Xuexin
Cai, Ruichu
Zheng, Kaitao
Jiang, Zhifan
Huang, Zhengting
Hao, Zhifeng
Li, Zijian
NEURAL NETWORKS, 2025, 184
[38] PEGNN: A physics embedded graph neural network for out-of-distribution temperature field reconstruction
Li, Qiao
Li, Xingchen
Chen, Xiaoqian
Yao, Wen
INTERNATIONAL JOURNAL OF THERMAL SCIENCES, 2025, 207
[39] Exploring Out-of-Distribution Generalization in Text Classifiers Trained on Tobacco-3482 and RVL-CDIP
Larson, Stefan
Singh, Navtej
Maheshwari, Saarthak
Stewart, Shanti
Krishnaswamy, Uma
DOCUMENT ANALYSIS AND RECOGNITION, ICDAR 2021, PT II, 2021, 12917 : 416 - 423
[40] Test-Time Image-to-Image Translation Ensembling Improves Out-of-Distribution Generalization in Histopathology
Scalbert, Marin
Vakalopoulou, Maria
Couzinie-Devy, Florent
MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2022, PT II, 2022, 13432 : 120 - 129

← 1 2 3 4 5 →