Semi-supervised Multi-label Learning for Graph-structured Data

被引:22
作者
Song, Zixing [1 ]
Meng, Ziqiao [1 ]
Zhang, Yifei [1 ]
King, Irwin [1 ]
机构
[1] Chinese Univ Hong Kong, Sha Tin, Hong Kong, Peoples R China
来源
PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, CIKM 2021 | 2021年
关键词
Graph Neural Networks; Multi-label Learning; Semi-supervised Learning; Graph Representation Learning; CLASSIFIERS;
D O I
10.1145/3459637.3482391
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The semi-supervised multi-label classification problem primarily deals with Euclidean data, such as text with a 1D grid of tokens and images with a 2D grid of pixels. However, the non-Euclidean graph-structured data naturally and constantly appears in semisupervised multi-label learning tasks from various domains like social networks, citation networks, and protein-protein interaction (PPI) networks. Moreover, the existing popular node embedding methods, like Graph Neural Networks (GNN), focus on graphs with simplex labels and tend to neglect label correlations in the multilabel setting, so the easy adaption proves empirically ineffective. Therefore, graph representation learning for the semi-supervised multi-label learning task is crucial and challenging. In this work, we incorporate the idea of label embedding into our proposed model to capture both network topology and higher-order multi-label correlations. The label embedding is generated along with the node embedding based on the topological structure to serve as the prototype center for each class. Moreover, the similarity of the label embedding and node embedding can be used as a confidence vector to guide the label smoothing process, formulating as a margin ranking optimization problem to learn the second-order relations between labels. Extensive experiments on real-world datasets from various domains demonstrate that our model significantly outperforms the state-of-the-art models for node-level tasks.
引用
收藏
页码:1723 / 1733
页数:11
相关论文
共 53 条
[1]   Collaborative Graph Walk for Semi-supervised Multi-Label Node Classification [J].
Akujuobi, Uchenna ;
Han Yufei ;
Zhang, Qiannan ;
Zhang, Xiangliang .
2019 19TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM 2019), 2019, :1-10
[2]  
[Anonymous], 2009, P 26 ANN INT C MACH
[3]   Learning multi-label scene classification [J].
Boutell, MR ;
Luo, JB ;
Shen, XP ;
Brown, CM .
PATTERN RECOGNITION, 2004, 37 (09) :1757-1771
[4]   The BioGRID interaction database:: 2008 update [J].
Breitkreutz, Bobby-Joe ;
Stark, Chris ;
Reguly, Teresa ;
Boucher, Lorrie ;
Breitkreutz, Ashton ;
Livstone, Michael ;
Oughtred, Rose ;
Lackner, Daniel H. ;
Bahler, Jurg ;
Wood, Valerie ;
Dolinski, Kara ;
Tyers, Mike .
NUCLEIC ACIDS RESEARCH, 2008, 36 :D637-D640
[5]   Learning Community Embedding with Community Detection and Node Embedding on Graphs [J].
Cavallari, Sandro ;
Zheng, Vincent W. ;
Cai, Hongyun ;
Chang, Kevin Chen-Chuan ;
Cambria, Erik .
CIKM'17: PROCEEDINGS OF THE 2017 ACM CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2017, :377-386
[6]   A Survey on Network Embedding [J].
Cui, Peng ;
Wang, Xiao ;
Pei, Jian ;
Zhu, Wenwu .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2019, 31 (05) :833-852
[7]   DeepXML: A Deep Extreme Multi-Label Learning Framework Applied to Short Text Documents [J].
Dahiya, Kunal ;
Saini, Deepak ;
Mittal, Anshul ;
Shaw, Ankush ;
Dave, Kushal ;
Soni, Akshay ;
Jain, Himanshu ;
Agarwal, Sumeet ;
Varma, Manik .
WSDM '21: PROCEEDINGS OF THE 14TH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING, 2021, :31-39
[8]  
Demsar J, 2006, J MACH LEARN RES, V7, P1
[9]   Robust Online Multilabel Learning Under Dynamic Changes in Data Distribution With Labels [J].
Du, Jie ;
Vong, Chi-Man .
IEEE TRANSACTIONS ON CYBERNETICS, 2020, 50 (01) :374-385
[10]   Semi-supervised Graph Embedding for Multi-label Graph Node Classification [J].
Gao, Kaisheng ;
Zhang, Jing ;
Zhou, Cangqi .
WEB INFORMATION SYSTEMS ENGINEERING - WISE 2019, 2019, 11881 :555-567