A Probabilistic Framework for Relational Clustering

被引:0
|
作者
Long, Bo [1 ]
Zhang, Zhongfei [1 ]
Yu, Philip S.
机构
[1] SUNY Binghamton, Dept Comp Sci, Binghamton, NY 13902 USA
关键词
Clustering; Relational data; Relational clustering; Semi-supervised clustering; EM-algorithm; Bregman divergences; Exponential families;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Relational clustering has attracted more and more attention due to its phenomenal impact in various important applications which involve multi-type interrelated data objects, such as Web mining, search marketing, bioinformatics, citation analysis, and epidemiology. In this paper, we propose a probabilistic model for relational clustering, which also provides a principal framework to unify various important clustering tasks including traditional attributes-based clustering, semi-supervised clustering, co-clustering and graph clustering. The proposed model seeks to identify cluster structures for each type of data objects and interaction patterns between different types of objects. Under this model, we propose parametric hard and soft relational clustering algorithms under a large number of exponential family distributions. The algorithms are applicable to relational data of various structures and at the same time unifies a number of stat-of-the-art clustering algorithms: co-clustering algorithms, the k-partite graph clustering, and semi-supervised clustering based on hidden Markov random fields.
引用
收藏
页码:470 / 479
页数:10
相关论文
共 50 条
  • [31] A Decomposition Framework for Computing and Querying Multidimensional OLAP Data Cubes over Probabilistic Relational Data
    Cuzzocrea, Alfredo
    Gunopulos, Dimitrios
    FUNDAMENTA INFORMATICAE, 2014, 132 (02) : 239 - 266
  • [32] Multi-relational Clustering Based on Relational Distance
    Wei, Liting
    Li, Yun
    2015 12TH WEB INFORMATION SYSTEM AND APPLICATION CONFERENCE (WISA), 2015, : 297 - 300
  • [33] Multi-relational Clustering Based on Relational Distance
    Luan, Luan
    Li, Yun
    Yin, Jiang
    Sheng, Yan
    INTERNATIONAL CONFERENCE ON APPLIED PHYSICS AND INDUSTRIAL ENGINEERING 2012, PT C, 2012, 24 : 1982 - 1989
  • [34] Multi-relational Clustering Based on Relational Distance
    Luan, Luan
    Li, Yun
    Yin, Jiang
    Sheng, Yan
    2010 INTERNATIONAL COLLOQUIUM ON COMPUTING, COMMUNICATION, CONTROL, AND MANAGEMENT (CCCM2010), VOL IV, 2010, : 47 - 50
  • [35] A PROBABILISTIC APPROACH TO CLUSTERING
    BRAILOVSKY, VL
    PATTERN RECOGNITION LETTERS, 1991, 12 (04) : 193 - 198
  • [36] Classification by probabilistic clustering
    Breuel, TM
    2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING - VOL IV: SIGNAL PROCESSING FOR COMMUNICATIONS; VOL V: SIGNAL PROCESSING EDUCATION SENSOR ARRAY & MULTICHANNEL SIGNAL PROCESSING AUDIO & ELECTROACOUSTICS; VOL VI: SIGNAL PROCESSING THEORY & METHODS STUDENT FORUM, 2001, : 1333 - 1336
  • [37] Probabilistic Fair Clustering
    Esmaeili, Seyed A.
    Brubach, Brian
    Tsepenekas, Leonidas
    Dickerson, John P.
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [38] Scalable probabilistic clustering
    Bradley, PS
    Fayyad, UM
    Reina, CA
    COMPLEMENTARITY: APPLICATIONS, ALGORITHMS AND EXTENSIONS, 2001, 50 : 43 - 65
  • [39] A probabilistic theory of clustering
    Dougherty, ER
    Brun, M
    PATTERN RECOGNITION, 2004, 37 (05) : 917 - 925
  • [40] Probabilistic quantum clustering
    Casana-Eslava, Raul V.
    Lisboa, Paulo J. G.
    Ortega-Martorell, Sandra
    Jarman, Ian H.
    Martin-Guerrero, Jose D.
    KNOWLEDGE-BASED SYSTEMS, 2020, 194