A Text Clustering Algorithm Based on Simplified Cluster Hypothesis

被引:0
|
作者
Sun Yuan [1 ,2 ]
Guo Wenbin [1 ,2 ]
机构
[1] Minzu Univ China, Sch Informat Engn, Beijing 100081, Peoples R China
[2] Natl Language Resource & Monitoring Res Ctr, Minor Languages Branch, Beijing 100081, Peoples R China
来源
2013 2ND INTERNATIONAL SYMPOSIUM ON INSTRUMENTATION AND MEASUREMENT, SENSOR NETWORK AND AUTOMATION (IMSNA) | 2013年
关键词
component; VSM; feature vector optimization; text similarity; text clustering;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
How to quickly and efficiently determine the subject category from a large amount of text is becoming an important challenge in text clustering. In this paper, One-Next text clustering algorithm based on the simplified cluster hypothesis is proposed. Meanwhile, a feature vector optimization method using grading feature vector extraction method is designed. Finally, the experimental results show that this method can get a high precession and F value, and the algorithm complexity is lower than other text clustering methods.
引用
收藏
页码:412 / 415
页数:4
相关论文
共 50 条
  • [1] Text Clustering Algorithm Based on Random Cluster Core
    Huang, Long-Jun
    Cheng, Meng-Zhen
    Xiao, Yao
    3RD ANNUAL INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND APPLICATIONS (ITA 2016), 2016, 7
  • [2] Text clustering based on kernel KNN clustering algorithm
    Xiong, Hao
    Sun, Sheng
    Feng, Yunfang
    International Journal of Applied Mathematics and Statistics, 2013, 46 (16): : 69 - 75
  • [3] Graph based AHC Algorithm for Text Clustering
    Jo, Taeho
    PROCEEDINGS 2017 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND COMPUTATIONAL INTELLIGENCE (CSCI), 2017, : 309 - 314
  • [4] DFSSM Based Web Text Clustering Algorithm
    Qian, Rong
    Zhang, Kejun
    Zhao, Xiaorong
    PROCEEDINGS OF 2012 2ND INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT 2012), 2012, : 703 - 707
  • [5] Text stream clustering based on Squeezer algorithm
    School of Economics and Management, Beihang University, Beijing 100191, China
    不详
    Kongzhi yu Juece Control Decis, 2012, 4 (542-546):
  • [6] Text clustering algorithm based on lexical graph
    Sha, Yun
    Zhang, Guoying
    Jiang, Huina
    FOURTH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, VOL 2, PROCEEDINGS, 2007, : 277 - 281
  • [7] A Text Hybrid Clustering Algorithm Based on HowNet Semantics
    Zhu, Zheng-yu
    Dong, Shu-jia
    Yu, Chun-lei
    He, Jie
    ADVANCED MATERIALS AND COMPUTER SCIENCE, PTS 1-3, 2011, 474-476 : 2071 - 2078
  • [8] Genetic algorithm-based text clustering technique
    Song, Wei
    Park, Soon Cheol
    ADVANCES IN NATURAL COMPUTATION, PT 1, 2006, 4221 : 779 - 782
  • [10] IMPROVED GA-BASED TEXT CLUSTERING ALGORITHM
    Shi, Kansheng
    Li, Lemin
    He, Jie
    Zhang, Naitong
    Liu, Haitao
    Song, Wentao
    2011 4TH IEEE INTERNATIONAL CONFERENCE ON BROADBAND NETWORK AND MULTIMEDIA TECHNOLOGY (4TH IEEE IC-BNMT2011), 2011, : 675 - +