A Fused Multi-feature Based Co-training Approach For Document Clustering

被引:4
|
作者
Wang, Yuanqing [1 ]
Wang, Wenjun [1 ]
Dai, Weidi [1 ]
Jiao, Pengfei [1 ]
Yu, Wei [1 ]
机构
[1] Tianjin Univ, Sch Comp Sci & Technol, Tianjin Key Lab Cognit Comp & Applicat, Tianjin, Peoples R China
关键词
multi-feature; co-training; document clustering; spectral clustering;
D O I
10.1109/ICISCE.2016.19
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Document clustering is a popular topic in data mining and information retrieval. Most models and methods for this problem are based on computing the similarity between pair documents modeled in a space of all terms, or a new feature space obtained by applying a topic modeling technique for a given corpus. In this paper, we regard these two ideas as clustering on term feature and on semantic feature, and have an assumption that they can contribute to each other in clustering. Also, we propose a co-training approach for spectral clustering taking two features into account. Experiments on four real-world datasets show the feasibility and efficacy of our proposed approach compared with a number of the baseline methods.
引用
收藏
页码:38 / 43
页数:6
相关论文
共 50 条
  • [21] Co-training Approach for Teacher Evaluation
    尹哲峰
    崔荣一
    延边大学学报(自然科学版), 2009, (02) : 167 - 170
  • [22] Multi-feature Spectral Clustering with Minimax Optimization
    Wang, Hongxing
    Weng, Chaoqun
    Yuan, Junsong
    2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 4106 - 4113
  • [23] Multi-Label Co-Training
    Xing, Yuying
    Yu, Guoxian
    Domeniconi, Carlotta
    Wang, Jun
    Zhang, Zili
    PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2018, : 2882 - 2888
  • [24] Co-training Based on Multi-type Text Features
    Liu, Wenting
    Jing, Xiaojun
    Chen, Yaqin
    Li, Jia
    SIGNAL AND INFORMATION PROCESSING, NETWORKING AND COMPUTERS, 2018, 473 : 213 - 220
  • [25] Multiple Feature Fusion Based on Co-Training Approach and Time Regularization for Place Classification in Wearable Video
    Dovgalecs, Vladislavs
    Megret, Remi
    Berthoumieu, Yannick
    ADVANCES IN MULTIMEDIA, 2013, 2013
  • [26] Image segmentation based on fussing multi-feature and spatial spectral clustering
    Gou, S. P.
    Chen, P. J.
    Yang, X. Y.
    Jiao, L. C.
    CISP 2008: FIRST INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, VOL 3, PROCEEDINGS, 2008, : 667 - 671
  • [27] A co-training method based on entropy and multi-criteria
    Jia Lu
    Yanlu Gong
    Applied Intelligence, 2021, 51 : 3212 - 3225
  • [28] A co-training method based on entropy and multi-criteria
    Lu, Jia
    Gong, Yanlu
    APPLIED INTELLIGENCE, 2021, 51 (06) : 3212 - 3225
  • [29] Band Selection Algorithm Based on Multi-Feature and Affinity Propagation Clustering
    Zhuang, Junbin
    Chen, Wenying
    Huang, Xunan
    Yan, Yunyi
    REMOTE SENSING, 2025, 17 (02)
  • [30] Crack detection for nuclear containments based on multi-feature fused semantic segmentation
    Pan, Pai
    Xu, Yaming
    Xing, Cheng
    Chen, Yang
    CONSTRUCTION AND BUILDING MATERIALS, 2022, 329