Strongly Local Hypergraph Diffusions for Clustering and Semi-supervised Learning

被引:11
|
作者
Liu, Meng [1 ]
Veldt, Nate [2 ]
Song, Haoyu [1 ]
Li, Pan [1 ]
Gleich, David F. [1 ]
机构
[1] Purdue Univ, W Lafayette, IN 47907 USA
[2] Cornell Univ, Ithaca, NY 14853 USA
来源
PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE 2021 (WWW 2021) | 2021年
关键词
hypergraph; local clustering; community detection; PageRank; GRAPHS;
D O I
10.1145/3442381.3449887
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Hypergraph-based machine learning methods are now widely recognized as important for modeling and using higher-order and multiway relationships between data objects. Local hypergraph clustering and semi-supervised learning specifically involve finding a well-connected set of nodes near a given set of labeled vertices. Although many methods for local graph clustering exist, there are relatively few for localized clustering in hypergraphs. Moreover, those that exist often lack flexibility to model a general class of hypergraph cut functions or cannot scale to large problems. To tackle these issues, this paper proposes a new diffusion-based hypergraph clustering algorithm that solves a quadratic hypergraph cut based objective akin to a hypergraph analog of Andersen-Chung-Lang personalized PageRank clustering for graphs. We prove that, for graphs with fixed maximum hyperedge size, this method is strongly local, meaning that its runtime only depends on the size of the output instead of the size of the hypergraph and is highly scalable. Moreover, our method enables us to compute with a wide variety of cardinality-based hypergraph cut functions. We also prove that the clusters found by solving the new objective function satisfy a Cheeger-like quality guarantee. We demonstrate that on large real-world hypergraphs our new method finds better clusters and runs much faster than existing approaches. Specifically, it runs in a few seconds for hypergraphs with a few million hyperedges compared with minutes for a flow-based technique. We furthermore show that our framework is general enough that can also be used to solve other p-norm based cut objectives on hypergraphs.
引用
收藏
页码:2092 / 2103
页数:12
相关论文
共 50 条
  • [41] Semi-Supervised Eigenvectors for Large-Scale Locally-Biased Learning
    Hansen, Toke J.
    Mahoney, Michael W.
    JOURNAL OF MACHINE LEARNING RESEARCH, 2014, 15 : 3691 - 3734
  • [42] Self-adaptive Local Fisher Discriminant Analysis for semi-supervised image recognition
    Liu, Zhonghua
    Wang, Jingyan
    Man, Jiaju
    Li, Yongping
    You, Xinge
    Wang, Chao
    INTERNATIONAL JOURNAL OF BIOMETRICS, 2012, 4 (04) : 338 - 356
  • [43] Self-adaptive Local Fisher Discriminant Analysis for semi-supervised image recognition
    Li, Y. (ypli@sinap.ac.cn), 1600, Inderscience Enterprises Ltd. (04): : 338 - 356
  • [44] A Compressed Sensing Based Least Squares Approach to Semi-supervised Local Cluster Extraction
    Ming-Jun Lai
    Zhaiming Shen
    Journal of Scientific Computing, 2023, 94
  • [45] A Compressed Sensing Based Least Squares Approach to Semi-supervised Local Cluster Extraction
    Lai, Ming-Jun
    Shen, Zhaiming
    JOURNAL OF SCIENTIFIC COMPUTING, 2023, 94 (03)
  • [46] Predicting individual socioeconomic status from mobile phone data: a semi-supervised hypergraph-based factor graph approach
    Tao Zhao
    Hong Huang
    Xiaoming Yao
    Jar-der Luo
    Xiaoming Fu
    International Journal of Data Science and Analytics, 2020, 9 : 361 - 372
  • [47] Predicting individual socioeconomic status from mobile phone data: a semi-supervised hypergraph-based factor graph approach
    Zhao, Tao
    Huang, Hong
    Yao, Xiaoming
    Luo, Jar-der
    Fu, Xiaoming
    INTERNATIONAL JOURNAL OF DATA SCIENCE AND ANALYTICS, 2020, 9 (03) : 361 - 372
  • [48] MULTIVALUED LABEL DIFFUSION FOR SEMI-SUPERVISED SEGMENTATION
    Buyssens, Pierre
    Lezoray, Olivier
    2015 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2015, : 3275 - 3279
  • [49] THE COUNTERINTUITIVE MECHANISM OF GRAPH-BASED SEMI-SUPERVISED LEARNING IN THE BIG DATA REGIME
    Mai, Xiaoyi
    Couillet, Romain
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 2821 - 2825
  • [50] Evolutionary Analysis of International Student Mobility Based on Complex Networks and Semi-Supervised Learning
    Cui, Mingwei
    Hu, Jun
    Wu, Peng
    Hu, Yuxia
    Zhang, Xin
    FRONTIERS IN PHYSICS, 2022, 10