Research on the Text Data Mining and Clustering Algorithm based on the Bayesian Theory and Markov Random Fields

被引:0
|
作者
Sheng, Xiaobao [1 ]
Luo, Yimin [1 ]
Cao, Xiaodong [1 ]
Tao, Junjie [1 ]
机构
[1] Minist Publ Secur, Res Inst 3, Shanghai 200031, Peoples R China
关键词
Text Data Mining; Bayesian Theory; Markov Random Fields; Pattern Clustering;
D O I
暂无
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
In this paper, we research on the text data mining and clustering algorithm based on Bayesian theory and Markov random fields. Clustering analysis is an important means of realization of text mining. However, the text data with high dimension, sparse data distribution and the core characteristics of different categories of overlapping important characteristics such as the great challenges for practical applications. In recent years, the soft subspace clustering technology has become the focus in the academic circles. Its basic idea is to give the data set of all kinds of don't give the characteristics of the different weight vectors, used to represent the clustering process in the contribution of each feature in this category. Our algorithm combines the advances of the Bayesian theory and Markov random fields to help enhance the overall performance.
引用
收藏
页码:136 / 141
页数:6
相关论文
共 50 条
  • [41] Research on the Parallelization of the DBSCAN Clustering Algorithm for Spatial Data Mining Based on the Spark Platform
    Huang, Fang
    Zhu, Qiang
    Zhou, Ji
    Tao, Jian
    Zhou, Xiaocheng
    Jin, Du
    Tan, Xicheng
    Wang, Lizhe
    REMOTE SENSING, 2017, 9 (12)
  • [42] An ant-based clustering algorithm in data mining
    Tang, Y
    Ma, YK
    SHAPING BUSINESS STRATEGY IN A NETWORKED WORLD, VOLS 1 AND 2, PROCEEDINGS, 2004, : 1101 - 1105
  • [43] Bayesian Inference in Hidden Markov Random Fields for Binary Data Defined on Large Lattices
    Friel, N.
    Pettitt, A. N.
    Reeves, R.
    Wit, E.
    JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2009, 18 (02) : 243 - 261
  • [44] A Fuzzy Clustering Algorithm of Data Mining Based on IWO
    Zhao Xiao-qiang
    Zhou Jin-Hu
    Yang Jia-Min
    2013 32ND CHINESE CONTROL CONFERENCE (CCC), 2013, : 7988 - 7993
  • [45] A clustering algorithm for data mining based on swarm intelligence
    Jin, Peng
    Zhu, Vun-Long
    Hu, Kun-Yuan
    PROCEEDINGS OF 2007 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2007, : 803 - 807
  • [46] Analysis and Application of Data Mining Based on Clustering Algorithm
    Lai Honghui
    Lai Xiao Tao
    PROCEEDINGS OF THE 2015 INFORMATION TECHNOLOGY AND MECHATRONICS ENGINEERING CONFERENCE, 2015, 7 : 129 - 133
  • [47] An algorithm for time series data mining based on clustering
    Wu, Shaozhi
    Wu, Yue
    Wang, Ying
    Ye, Yalan
    2006 INTERNATIONAL CONFERENCE ON COMMUNICATIONS, CIRCUITS AND SYSTEMS PROCEEDINGS, VOLS 1-4: VOL 1: SIGNAL PROCESSING, 2006, : 2155 - +
  • [48] Unsupervised clustering of PolSAR data using Polarimetric G Distribution and Markov Random Fields
    Khan, Salman
    Doulgeris, Anthony Paul
    10TH EUROPEAN CONFERENCE ON SYNTHETIC APERTURE RADAR (EUSAR 2014), 2014,
  • [49] Text/non-text ink stroke classification in Japanese handwriting based on Markov random fields
    Zhou, Xiang-Dong
    Liu, Cheng-Lin
    ICDAR 2007: NINTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, VOLS I AND II, PROCEEDINGS, 2007, : 377 - 381
  • [50] Gaussian Markov random fields: Theory and applications
    Congdon, Peter
    JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES A-STATISTICS IN SOCIETY, 2007, 170 : 858 - 858