Directional co-clustering

被引:0
|
作者
Aghiles Salah
Mohamed Nadif
机构
[1] SIS,
[2] Singapore Management University,undefined
[3] LIPADE,undefined
[4] Paris Descartes University,undefined
来源
Advances in Data Analysis and Classification | 2019年 / 13卷
关键词
Co-clustering; Directional data; von Mises-Fisher distribution; EM algorithm; Document clustering; Main 62H30; Secondary 62H11;
D O I
暂无
中图分类号
学科分类号
摘要
Co-clustering addresses the problem of simultaneous clustering of both dimensions of a data matrix. When dealing with high dimensional sparse data, co-clustering turns out to be more beneficial than one-sided clustering even if one is interested in clustering along one dimension only. Aside from being high dimensional and sparse, some datasets, such as document-term matrices, exhibit directional characteristics, and the L2\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$L_2$$\end{document} normalization of such data, so that it lies on the surface of a unit hypersphere, is useful. Popular co-clustering assumptions such as Gaussian or Multinomial are inadequate for this type of data. In this paper, we extend the scope of co-clustering to directional data. We present Diagonal Block Mixture of Von Mises–Fisher distributions (dbmovMFs), a co-clustering model which is well suited for directional data lying on a unit hypersphere. By setting the estimate of the model parameters under the maximum likelihood (ML) and classification ML approaches, we develop a class of EM algorithms for estimating dbmovMFs from data. Extensive experiments, on several real-world datasets, confirm the advantage of our approach and demonstrate the effectiveness of our algorithms.
引用
收藏
页码:591 / 620
页数:29
相关论文
共 50 条
  • [1] Directional co-clustering
    Salah, Aghiles
    Nadif, Mohamed
    ADVANCES IN DATA ANALYSIS AND CLASSIFICATION, 2019, 13 (03) : 591 - 620
  • [2] Regularized bi-directional co-clustering
    Affeldt, Severine
    Labiod, Lazhar
    Nadif, Mohamed
    STATISTICS AND COMPUTING, 2021, 31 (03)
  • [3] Regularized bi-directional co-clustering
    Séverine Affeldt
    Lazhar Labiod
    Mohamed Nadif
    Statistics and Computing, 2021, 31
  • [4] Co-clustering directed graphs to discover asymmetries and directional communities
    Rohe, Karl
    Qin, Tai
    Yu, Bin
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2016, 113 (45) : 12679 - 12684
  • [5] Joint co-clustering: Co-clustering of genomic and clinical bioimaging data
    Ficarra, Elisa
    De Micheli, Giovanni
    Yoon, Sungroh
    Benini, Luca
    Macii, Enrico
    COMPUTERS & MATHEMATICS WITH APPLICATIONS, 2008, 55 (05) : 938 - 949
  • [6] Bayesian Co-clustering
    Shan, Hanhuai
    Banerjee, Arindam
    ICDM 2008: EIGHTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2008, : 530 - 539
  • [7] A Survey of Co-Clustering
    Wang, Hongjun
    Song, Yi
    Chen, Wei
    Luo, Zhipeng
    Li, Chongshou
    Li, Tianrui
    ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2024, 18 (09)
  • [8] Co-Clustering on Manifolds
    Gu, Quanquan
    Zhou, Jie
    KDD-09: 15TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2009, : 359 - 367
  • [9] Bayesian co-clustering
    Domeniconi, Carlotta
    Laskey, Kathryn
    WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL STATISTICS, 2015, 7 (05) : 347 - 356
  • [10] Spectral co-clustering ensemble
    Huang, Shudong
    Wang, Hongjun
    Li, Dingcheng
    Yang, Yan
    Li, Tianrui
    KNOWLEDGE-BASED SYSTEMS, 2015, 84 : 46 - 55