Strongly Local Hypergraph Diffusions for Clustering and Semi-supervised Learning

被引:11
|
作者
Liu, Meng [1 ]
Veldt, Nate [2 ]
Song, Haoyu [1 ]
Li, Pan [1 ]
Gleich, David F. [1 ]
机构
[1] Purdue Univ, W Lafayette, IN 47907 USA
[2] Cornell Univ, Ithaca, NY 14853 USA
来源
PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE 2021 (WWW 2021) | 2021年
关键词
hypergraph; local clustering; community detection; PageRank; GRAPHS;
D O I
10.1145/3442381.3449887
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Hypergraph-based machine learning methods are now widely recognized as important for modeling and using higher-order and multiway relationships between data objects. Local hypergraph clustering and semi-supervised learning specifically involve finding a well-connected set of nodes near a given set of labeled vertices. Although many methods for local graph clustering exist, there are relatively few for localized clustering in hypergraphs. Moreover, those that exist often lack flexibility to model a general class of hypergraph cut functions or cannot scale to large problems. To tackle these issues, this paper proposes a new diffusion-based hypergraph clustering algorithm that solves a quadratic hypergraph cut based objective akin to a hypergraph analog of Andersen-Chung-Lang personalized PageRank clustering for graphs. We prove that, for graphs with fixed maximum hyperedge size, this method is strongly local, meaning that its runtime only depends on the size of the output instead of the size of the hypergraph and is highly scalable. Moreover, our method enables us to compute with a wide variety of cardinality-based hypergraph cut functions. We also prove that the clusters found by solving the new objective function satisfy a Cheeger-like quality guarantee. We demonstrate that on large real-world hypergraphs our new method finds better clusters and runs much faster than existing approaches. Specifically, it runs in a few seconds for hypergraphs with a few million hyperedges compared with minutes for a flow-based technique. We furthermore show that our framework is general enough that can also be used to solve other p-norm based cut objectives on hypergraphs.
引用
收藏
页码:2092 / 2103
页数:12
相关论文
共 50 条
  • [21] Distributed and Asynchronous Methods for Semi-supervised Learning
    Avrachenkov, Konstantin
    Borkar, Vivek S.
    Saboo, Krishnakant
    ALGORITHMS AND MODELS FOR THE WEB GRAPH, WAW 2016, 2016, 10088 : 34 - 46
  • [22] Overlapping coefficient in network-based semi-supervised clustering
    Conversano, Claudio
    Frigau, Luca
    Contu, Giulia
    COMPUTATIONAL STATISTICS, 2024, 39 (07) : 3831 - 3854
  • [23] SEMI-SUPERVISED HYPERSPECTRAL BAND SELECTION VIA SPARSE LINEAR REGRESSION AND HYPERGRAPH MODELS
    Guo, Zhouxiao
    Yang, Haichuan
    Bai, Xiao
    Zhang, Zhihong
    Zhou, Jun
    2013 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS), 2013, : 1474 - 1477
  • [24] Penalized Flow Hypergraph Local Clustering
    Zhong, Hao
    Zhang, Yubo
    Yan, Chenggang
    Xuan, Zuxing
    Yu, Ting
    Zhang, Ji
    Ying, Shihui
    Gao, Yue
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2024, 36 (05) : 2110 - 2125
  • [25] Semi-supervised learning in unbalanced networks with heterogeneous degree
    Li, Ting
    Ying, Ningchen
    Yu, Xianshi
    Jing, Bing-Yi
    STATISTICS AND ITS INTERFACE, 2024, 17 (03) : 501 - 516
  • [26] EMPIRICAL STATIONARY CORRELATIONS FOR SEMI-SUPERVISED LEARNING ON GRAPHS
    Xu, Ya
    Dyer, Justin S.
    Owen, Art B.
    ANNALS OF APPLIED STATISTICS, 2010, 4 (02) : 589 - 614
  • [27] Adaptive Graph Learning for Semi-supervised Classification of GCNs
    Wan, Yingying
    Zhan, Mengmeng
    Li, Yangding
    DATABASES THEORY AND APPLICATIONS (ADC 2021), 2021, 12610 : 13 - 22
  • [28] Hypergraph based semi-supervised support vector machine for binary and multi-category classifications
    Sun, Yuting
    Ding, Shifei
    Zhang, ZiChen
    Zhang, Chenglong
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2022, 13 (05) : 1369 - 1386
  • [29] Motif-Based Hypergraph Convolution Network for Semi-Supervised Node Classification on Heterogeneous Graph
    Wu Y.
    Wang Y.
    Wang X.
    Xu Z.-X.
    Li L.-N.
    Jisuanji Xuebao/Chinese Journal of Computers, 2021, 44 (11): : 2248 - 2260
  • [30] Hypergraph based semi-supervised support vector machine for binary and multi-category classifications
    Yuting Sun
    Shifei Ding
    ZiChen Zhang
    Chenglong Zhang
    International Journal of Machine Learning and Cybernetics, 2022, 13 : 1369 - 1386