Null Model-Based Data Augmentation for Graph Classification

被引:2
作者
Wang, Zeyu [1 ,2 ]
Wang, Jinhuan [1 ,2 ]
Shan, Yalu [1 ,2 ]
Yu, Shanqing [1 ,2 ]
Xu, Xiaoke [3 ,4 ]
Xuan, Qi [1 ,2 ]
Chen, Guanrong [5 ]
机构
[1] Zhejiang Univ Technol, Inst Cyberspace Secur, Coll Informat Engn, Hangzhou 310014, Peoples R China
[2] Zhejiang Univ Technol, Bingjiang Cyberspace Secur Inst ZJUT, Hangzhou 310056, Peoples R China
[3] Beijing Normal Univ, Computat Commun Res Ctr, Beijing 100875, Peoples R China
[4] Beijing Normal Univ, Sch Journalism & Commun, Beijing 100875, Peoples R China
[5] City Univ Hong Kong, Dept Elect Engn, Hong Kong 999077, Peoples R China
来源
IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING | 2024年 / 11卷 / 02期
关键词
Data models; Biological system modeling; Data augmentation; Task analysis; Brain modeling; Feature extraction; Data mining; Null model; data augmentation; graph classification; topological feature; NEURAL-NETWORKS; TOPOLOGY;
D O I
10.1109/TNSE.2023.3332499
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Graph classification is an important task widely applied to biochemistry, social networks, and other fields. Since it is a data-dependent problem, insufficient training data will deteriorate the performance of graph classification models. To address this issue, various data augmentation methods have been proposed. However, most existing methods tend to destroy the topological features, leading to a negative impact on information propagation and semantics. While the null model generates new data with the same topological features as the original graph and helps capture the latent information based on topology. Hence, combining the null model with data augmentation for graph classification may be useful in helping models learn graph representations. This paper introduces a novel null model-based augmentation technique for graph classification. Specifically, four standard and four approximate null model-based augmentation methods are developed to verify the effectiveness of the technique. Experimental results on benchmark datasets demonstrate significant performance improvements with the proposed technique. Depending on the design mechanisms of the null models, standard augmentation methods outperform the approximate ones. These findings emphasize the critical role of non-trivial features in enhancing augmentation models for different network structures, providing a new perspective on data augmentation for studying graph classification methods.
引用
收藏
页码:1821 / 1833
页数:13
相关论文
共 65 条
[1]   Network biology:: Understanding the cell's functional organization [J].
Barabási, AL ;
Oltvai, ZN .
NATURE REVIEWS GENETICS, 2004, 5 (02) :101-U15
[2]   Hierarchical multi-label prediction of gene function [J].
Barutcuoglu, Z ;
Schapire, RE ;
Troyanskaya, OG .
BIOINFORMATICS, 2006, 22 (07) :830-836
[3]   AN IMPROVED INDEX OF CENTRALITY [J].
BEAUCHAMP, MA .
BEHAVIORAL SCIENCE, 1965, 10 (02) :161-163
[4]   A simple null model predicts the island rule [J].
Biddick, Matt ;
Burns, Kevin C. .
ECOLOGY LETTERS, 2021, 24 (08) :1646-1654
[5]   Some unique properties of eigenvector centrality [J].
Bonacich, Phillip .
SOCIAL NETWORKS, 2007, 29 (04) :555-564
[6]   Classification of topological quantum matter with symmetries [J].
Chiu, Ching-Kai ;
Teo, Jeffrey C. Y. ;
Schnyder, Andreas P. ;
Ryu, Shinsei .
REVIEWS OF MODERN PHYSICS, 2016, 88 (03)
[7]  
CHUNG F., 2002, Ann. Comb., V6, P125, DOI [10.1007/PL00012580, DOI 10.1007/PL00012580, 10.1007/pl00012580]
[8]   AutoAugment: Learning Augmentation Strategies from Data [J].
Cubuk, Ekin D. ;
Zoph, Barret ;
Mane, Dandelion ;
Vasudevan, Vijay ;
Le, Quoc V. .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :113-123
[9]  
de Lara N, 2018, Arxiv, DOI arXiv:1810.09155
[10]   STRUCTURE ACTIVITY RELATIONSHIP OF MUTAGENIC AROMATIC AND HETEROAROMATIC NITRO-COMPOUNDS - CORRELATION WITH MOLECULAR-ORBITAL ENERGIES AND HYDROPHOBICITY [J].
DEBNATH, AK ;
DECOMPADRE, RLL ;
DEBNATH, G ;
SHUSTERMAN, AJ ;
HANSCH, C .
JOURNAL OF MEDICINAL CHEMISTRY, 1991, 34 (02) :786-797