A Probabilistic Framework for Relational Clustering

被引:0
|
作者
Long, Bo [1 ]
Zhang, Zhongfei [1 ]
Yu, Philip S.
机构
[1] SUNY Binghamton, Dept Comp Sci, Binghamton, NY 13902 USA
关键词
Clustering; Relational data; Relational clustering; Semi-supervised clustering; EM-algorithm; Bregman divergences; Exponential families;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Relational clustering has attracted more and more attention due to its phenomenal impact in various important applications which involve multi-type interrelated data objects, such as Web mining, search marketing, bioinformatics, citation analysis, and epidemiology. In this paper, we propose a probabilistic model for relational clustering, which also provides a principal framework to unify various important clustering tasks including traditional attributes-based clustering, semi-supervised clustering, co-clustering and graph clustering. The proposed model seeks to identify cluster structures for each type of data objects and interaction patterns between different types of objects. Under this model, we propose parametric hard and soft relational clustering algorithms under a large number of exponential family distributions. The algorithms are applicable to relational data of various structures and at the same time unifies a number of stat-of-the-art clustering algorithms: co-clustering algorithms, the k-partite graph clustering, and semi-supervised clustering based on hidden Markov random fields.
引用
收藏
页码:470 / 479
页数:10
相关论文
共 50 条
  • [1] Probabilistic Relational Models with Clustering Uncertainty
    Coutant, Anthony
    Leray, Philippe
    Le Capitaine, Hoel
    2015 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2015,
  • [2] A probabilistic relational approach for web document clustering
    Fersini, E.
    Messina, E.
    Archetti, F.
    INFORMATION PROCESSING & MANAGEMENT, 2010, 46 (02) : 117 - 130
  • [3] A probabilistic framework for graph clustering
    Luo, B
    Robles-Kelly, A
    Torsello, A
    Wilson, RC
    Hancock, ER
    2001 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2001, : 912 - 919
  • [4] A generalized Bayes framework for probabilistic clustering
    Rigon, Tommaso
    Herring, Amy H.
    Dunson, David B.
    BIOMETRIKA, 2023, 110 (03) : 559 - 578
  • [5] Clustering of Mixed Data by Integrating Fuzzy, Probabilistic, and Collaborative Clustering Framework
    Arkanath Pathak
    Nikhil R. Pal
    International Journal of Fuzzy Systems, 2016, 18 : 339 - 348
  • [6] Clustering of Mixed Data by Integrating Fuzzy, Probabilistic, and Collaborative Clustering Framework
    Pathak, Arkanath
    Pal, Nikhil R.
    INTERNATIONAL JOURNAL OF FUZZY SYSTEMS, 2016, 18 (03) : 339 - 348
  • [7] A probabilistic ranking framework for web-based relational data imputation
    Chen, Zhaoqiang
    Chen, Qun
    Li, Jiajun
    Li, Zhanhuai
    Chen, Lei
    INFORMATION SCIENCES, 2016, 355 : 152 - 168
  • [8] An NMF-framework for Unifying Posterior Probabilistic Clustering and Probabilistic Latent Semantic Indexing
    Zhang, Zhong-Yuan
    Li, Tao
    Ding, Chris
    Tang, Jie
    COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2014, 43 (19) : 4011 - 4024
  • [9] A unified probabilistic framework for clustering correlated heterogeneous web objects
    Liu, GW
    Zhu, WB
    Yu, Y
    WEB TECHNOLOGIES RESEARCH AND DEVELOPMENT - APWEB 2005, 2005, 3399 : 76 - 87
  • [10] Towards a model based asset deterioration framework represented by probabilistic relational models
    Zhang, Haoyuan
    Marsh, D. William R.
    SAFETY AND RELIABILITY - SAFE SOCIETIES IN A CHANGING WORLD, 2018, : 671 - 679