A Fused Multi-feature Based Co-training Approach For Document Clustering

被引:4
|
作者
Wang, Yuanqing [1 ]
Wang, Wenjun [1 ]
Dai, Weidi [1 ]
Jiao, Pengfei [1 ]
Yu, Wei [1 ]
机构
[1] Tianjin Univ, Sch Comp Sci & Technol, Tianjin Key Lab Cognit Comp & Applicat, Tianjin, Peoples R China
关键词
multi-feature; co-training; document clustering; spectral clustering;
D O I
10.1109/ICISCE.2016.19
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Document clustering is a popular topic in data mining and information retrieval. Most models and methods for this problem are based on computing the similarity between pair documents modeled in a space of all terms, or a new feature space obtained by applying a topic modeling technique for a given corpus. In this paper, we regard these two ideas as clustering on term feature and on semantic feature, and have an assumption that they can contribute to each other in clustering. Also, we propose a co-training approach for spectral clustering taking two features into account. Experiments on four real-world datasets show the feasibility and efficacy of our proposed approach compared with a number of the baseline methods.
引用
收藏
页码:38 / 43
页数:6
相关论文
共 50 条
  • [1] A Co-training Approach for Multi-view Density Peak Clustering
    Ling, Yu
    He, Jinrong
    Ren, Silin
    Pan, Heng
    He, Guoliang
    PATTERN RECOGNITION AND COMPUTER VISION, PT III, 2018, 11258 : 503 - 513
  • [2] Multi-view Document Classification with Co-training
    Sevim, Semih
    Ekinci, Ekin
    Ilhan Omurca, Sevinc
    2020 28TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2020,
  • [3] GUIDED CO-TRAINING FOR MULTI-VIEW SPECTRAL CLUSTERING
    Lee, Chung-Kuei
    Liu, Tyng-Luh
    2016 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2016, : 4042 - 4046
  • [4] Sea-land Segmentation in SAR Images Based on Multi-feature Fused Boundary Clustering
    Wu, Kejiang
    Xu, Xiaojian
    TARGET AND BACKGROUND SIGNATURES IV, 2018, 10794
  • [5] A subspace co-training framework for multi-view clustering
    Zhao, Xuran
    Evans, Nicholas
    Dugelay, Jean-Luc
    PATTERN RECOGNITION LETTERS, 2014, 41 : 73 - 82
  • [6] Video Object Segmentation Based on Multi-Feature Clustering
    Hu, Shuangyan
    Li, Junshan
    Li, Xuhui
    Huang, Baigang
    2008 7TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION, VOLS 1-23, 2008, : 5946 - 5949
  • [7] Clustering with Extended Constraints by Co-Training
    Okabe, Masayuki
    Yamada, Seiji
    2012 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE AND INTELLIGENT AGENT TECHNOLOGY WORKSHOPS (WI-IAT WORKSHOPS 2012), VOL 3, 2012, : 79 - 82
  • [8] Product feature extraction with co-training
    Wu, Xing
    He, Zhongshi
    Huang, Yongwen
    Journal of Information and Computational Science, 2009, 6 (03): : 1515 - 1523
  • [9] Gesture Detection and Recognition Fused with Multi-feature Based on Kinect
    Sang, Haifeng
    Li, Wei
    BIOMETRIC RECOGNITION, CCBR 2015, 2015, 9428 : 597 - 606
  • [10] A Document Image Retrieval Method Based on Multi-Feature Fusion
    Zhu, Zhiyuan
    Ren, Dongchun
    Zhou, Guangyou
    Zhou, Yin
    2016 INTERNATIONAL CONFERENCE ON NETWORK AND INFORMATION SYSTEMS FOR COMPUTERS (ICNISC), 2016, : 306 - 311