Ensemble Non-negative Matrix Factorization for Clustering Biomedical Documents

被引:0
|
作者
Zhu, Shanfeng [1 ]
Yuan, Wei [1 ]
Wang, Fei [1 ]
机构
[1] Fudan Univ, Sch Comp Sci & Technol, Shanghai 200433, Peoples R China
来源
OPTIMIZATION AND SYSTEMS BIOLOGY, PROCEEDINGS | 2008年 / 9卷
关键词
D O I
暂无
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Searching and mining biomedical literature database, such as MEDLINE, is the main source of generating scientific hypothesis for biomedical researchers. Through grouping similar documents together, clustering techniques can facilitate user's need of effectively finding interested documents. Since non-negative matrix factorization (NMF) can effectively capture the latent semantic space with non-negative factorization in both the basis and the weight, it has been utilized to clustering general text documents. Considering the stochastic nature of NMF with respect to initialization, we propose to use ensemble NMF for biomedical document clustering. The performance of ensemble NMF was evaluated on clustering a large number of datasets generated from TREC Genomics track dataset. The experimental results show that our method outperforms classical clustering algorithms bisect k-means, k-means and hierarchical clustering significantly in most of the datasets.
引用
收藏
页码:358 / 364
页数:7
相关论文
共 50 条
  • [1] Enhanced clustering of biomedical documents using ensemble non-negative matrix factorization
    Huang, Xiaodi
    Zheng, Xiaodong
    Yuan, Wei
    Wang, Fei
    Zhu, Shanfeng
    INFORMATION SCIENCES, 2011, 181 (11) : 2293 - 2302
  • [2] Hybrid Online Non-negative Matrix Factorization for Clustering of Documents
    Jadhao, Vinod
    Murty, M. Narasimha
    NEURAL INFORMATION PROCESSING, ICONIP 2012, PT I, 2012, 7663 : 516 - 523
  • [3] Ensemble non-negative matrix factorization methods for clustering proteinprotein interactions
    Greene, Derek
    Cagney, Gerard
    Krogan, Nevan
    Cunningham, Padraig
    BIOINFORMATICS, 2008, 24 (15) : 1722 - 1728
  • [4] Integrating K-means and non-negative matrix factorization to ensemble document clustering
    Xu, Sen
    Lu, Zhi-Mao
    Gu, Guo-Chang
    Jilin Daxue Xuebao (Gongxueban)/Journal of Jilin University (Engineering and Technology Edition), 2011, 41 (04): : 1077 - 1082
  • [5] Speaker Clustering Based on Non-negative Matrix Factorization
    Nishida, Masafumi
    Yamamoto, Seiichi
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 956 - 959
  • [6] Biased unconstrained non-negative matrix factorization for clustering
    Deng, Ping
    Zhang, Fan
    Li, Tianrui
    Wang, Hongjun
    Horng, Shi-Jinn
    KNOWLEDGE-BASED SYSTEMS, 2022, 239
  • [7] Document clustering based on spectral clustering and non-negative matrix factorization
    Bao, Lei
    Tang, Sheng
    Li, Jintao
    Zhang, Yongdong
    Ye, Wei-Ping
    NEW FRONTIERS IN APPLIED ARTIFICIAL INTELLIGENCE, 2008, 5027 : 149 - +
  • [8] Non-Negative Matrix Factorization With Dual Constraints for Image Clustering
    Yang, Zuyuan
    Zhang, Yu
    Xiang, Yong
    Yan, Wei
    Xie, Shengli
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2020, 50 (07): : 2524 - 2533
  • [9] Non-negative matrix factorization by maximizing correntropy for cancer clustering
    Wang, Jim Jing-Yan
    Wang, Xiaolei
    Gao, Xin
    BMC BIOINFORMATICS, 2013, 14
  • [10] Document Clustering with Cluster Refinement and Non-negative Matrix Factorization
    Park, Sun
    An, Dong Un
    Char, ByungRea
    Kim, Chul-Won
    NEURAL INFORMATION PROCESSING, PT 2, PROCEEDINGS, 2009, 5864 : 281 - +