Imbalanced node classification with Graph Neural Networks: A unified approach leveraging homophily and label information

被引:1
|
作者
Lv, Dingyang [1 ]
Xu, Zhengjia [2 ]
Zhang, Jinghui [1 ]
Wang, Yuchen [1 ]
Dong, Fang [1 ]
机构
[1] Southeast Univ, Sch Comp Sci & Engn, Nanjing 211189, Jiangsu, Peoples R China
[2] Southeast Univ, Coll Software Engn, Nanjing 211189, Jiangsu, Peoples R China
基金
中国国家自然科学基金;
关键词
Unbalanced classification; Low homophily; Graph neural networks; Label utilization; Representation learning;
D O I
10.1016/j.asoc.2023.110985
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The homophily assumption in graph theory posits that nodes with similar characteristics have a higher tendency to form connections. This principle has rendered Graph Neural Networks (GNNs) as vital tools for graph representation learning. However, many real-world graphs may exhibit a phenomenon often termed as neighbor class imbalance, which is characterized by frequent connections between dissimilar nodes, a scenario reflecting low homophily. Classical GNNs tend to overlook this issue, leading to a significant decline in performance. Prior research has attempted to address this challenge by employing high-order neighborhoods and filtering out dissimilar neighbors, yet they have paid little attention to homophily degree estimation and label utilization. In this work, we initially explore the performance of classical GNNs on a synthetic graph with varying homophily degrees, designated as SynG-N. Following this, we introduce a novel method, HLA-GNN, which integrates homophily degree estimation and label utilization to enhance classical GNNs. The degrees of homophily between node pairs are estimated using a limited set of ground-truth labels, which can be integrated into classic GNNs to guide the message aggregation process. Drawing on the label propagation algorithm, we combine the partially observed class labels to enhance the original feature space. Here, the observed class labels are randomly masked as a feature augmentation and training signal. Our experimental results on eight datasets with varying degrees of homophily underscore the effectiveness of our method. HLA-GNN achieves a 12.69%similar to 34.19% improvement on low-homophily graphs, while maintaining competitive results in homophilous settings.
引用
收藏
页数:13
相关论文
共 50 条
  • [21] Boosting-GNN: Boosting Algorithm for Graph Networks on Imbalanced Node Classification
    Shi, Shuhao
    Qiao, Kai
    Yang, Shuai
    Wang, Linyuan
    Chen, Jian
    Yan, Bin
    FRONTIERS IN NEUROROBOTICS, 2021, 15
  • [22] Graph alternate learning for robust graph neural networks in node classification
    Zhang, Baoliang
    Guo, Xiaoxin
    Tu, Zhenchuan
    Zhang, Jia
    NEURAL COMPUTING & APPLICATIONS, 2022, 34 (11): : 8723 - 8735
  • [23] Graph alternate learning for robust graph neural networks in node classification
    Baoliang Zhang
    Xiaoxin Guo
    Zhenchuan Tu
    Jia Zhang
    Neural Computing and Applications, 2022, 34 : 8723 - 8735
  • [24] Neural Networks Learn Specified Information for Imbalanced Data Classification
    Huang, Zhan Ao
    Sang, Yongsheng
    Sun, Yanan
    Lv, Jiancheng
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2024, 36 (11) : 6719 - 6730
  • [25] Unified Robust Training for Graph Neural Networks Against Label Noise
    Li, Yayong
    Yin, Jie
    Chen, Ling
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2021, PT I, 2021, 12712 : 528 - 540
  • [26] A graph neural network-based node classification model on class-imbalanced graph data
    Huang, Zhenhua
    Tang, Yinhao
    Chen, Yunwen
    KNOWLEDGE-BASED SYSTEMS, 2022, 244
  • [27] MAPPING: debiasing graph neural networks for fair node classification with limited sensitive information leakage
    Song, Ying
    Palanisamy, Balaji
    WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2024, 27 (06):
  • [28] NHSH: Graph Hybrid Learning with Node Homophily and Spectral Heterophily for Node Classification
    Liu, Kang
    Dai, Wenqing
    Liu, Xunyuan
    Kang, Mengtao
    Ji, Runshi
    SYMMETRY-BASEL, 2025, 17 (01):
  • [29] Multi-label classification with imbalanced classes by fuzzy deep neural networks
    Succetti, Federico
    Rosato, Antonello
    Panella, Massimo
    INTEGRATED COMPUTER-AIDED ENGINEERING, 2025, 32 (01) : 23 - 36
  • [30] Classification optimization node injection attack on graph neural networks
    Ma, Mingda
    Xia, Hui
    Li, Xin
    Zhang, Rui
    Xu, Shuo
    KNOWLEDGE-BASED SYSTEMS, 2024, 301