A Fused Multi-feature Based Co-training Approach For Document Clustering

被引:4
|
作者
Wang, Yuanqing [1 ]
Wang, Wenjun [1 ]
Dai, Weidi [1 ]
Jiao, Pengfei [1 ]
Yu, Wei [1 ]
机构
[1] Tianjin Univ, Sch Comp Sci & Technol, Tianjin Key Lab Cognit Comp & Applicat, Tianjin, Peoples R China
关键词
multi-feature; co-training; document clustering; spectral clustering;
D O I
10.1109/ICISCE.2016.19
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Document clustering is a popular topic in data mining and information retrieval. Most models and methods for this problem are based on computing the similarity between pair documents modeled in a space of all terms, or a new feature space obtained by applying a topic modeling technique for a given corpus. In this paper, we regard these two ideas as clustering on term feature and on semantic feature, and have an assumption that they can contribute to each other in clustering. Also, we propose a co-training approach for spectral clustering taking two features into account. Experiments on four real-world datasets show the feasibility and efficacy of our proposed approach compared with a number of the baseline methods.
引用
收藏
页码:38 / 43
页数:6
相关论文
共 50 条
  • [41] A New Approach Based on Multi-feature for Cooperative Target Detection
    Sun, Guopeng
    Hao, Xiangyang
    Zhang, Xiaodong
    Zhang, Zhenjie
    2017 2ND INTERNATIONAL CONFERENCE ON MULTIMEDIA AND IMAGE PROCESSING (ICMIP), 2017, : 6 - 10
  • [42] Multi-Feature Fusion Based Approach for Robust Face Recognition
    Essa, Almabrok
    Asari, Vijayan
    MOBILE MULTIMEDIA/IMAGE PROCESSING, SECURITY, AND APPLICATIONS 2018, 2018, 10668
  • [43] Multi-Feature Agglomerative Hierarchical Clustering Based Abnormal Driving Behavior Detection
    Hui, Fei
    Guo, Jing
    Tang, Shuyu
    Xing, Meihua
    CICTP 2020: ADVANCED TRANSPORTATION TECHNOLOGIES AND DEVELOPMENT-ENHANCING CONNECTIONS, 2020, : 3792 - 3803
  • [44] Contour Grouping by Clustering with Multi-feature Similarity Measure
    Bai, Xue
    Luo, Siwei
    Zou, Qi
    Zhao, Yibiao
    STRUCTURAL, SYNTACTIC, AND STATISTICAL PATTERN RECOGNITION, 2010, 6218 : 415 - 422
  • [45] Crowd Counting via Attention and Multi-Feature Fused Network
    Guo, Xiangyu
    Gao, Mingliang
    Pan, Jinfeng
    Shang, Jianrun
    Souri, Alireza
    Li, Qilei
    Bruno, Alessandro
    HUMAN-CENTRIC COMPUTING AND INFORMATION SCIENCES, 2023, 13
  • [46] Robust music information retrieval on mobile network based on multi-feature clustering
    Yoon, Won-Jung
    Oh, Sanghun
    Park, Kyu-Sik
    ADVANCED DATA MINING AND APPLICATIONS, PROCEEDINGS, 2006, 4093 : 279 - 283
  • [47] A CO-TRAINING APPROACH TO AUTOMATIC FACE RECOGNITION
    Zhao, Xuran
    Evans, Nicholas
    Dugelay, Jean-Luc
    19TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO-2011), 2011, : 1979 - 1983
  • [48] A Co-Training Approach for Spatial Data Disaggregation
    Monteiro, Joao
    Martins, Bruno
    Costa, Miguel
    Pires, Joao M.
    30TH ACM SIGSPATIAL INTERNATIONAL CONFERENCE ON ADVANCES IN GEOGRAPHIC INFORMATION SYSTEMS, ACM SIGSPATIAL GIS 2022, 2022, : 649 - 658
  • [49] Disagreement-Based Co-Training
    Tanha, Jafar
    van Someren, Maarten
    Afsarmanesh, Hamideh
    2011 23RD IEEE INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2011), 2011, : 803 - 810
  • [50] Underwater Multi-object Segmentation Technology Based on Spectral Clustering with Multi-feature Weighting
    Liu G.
    Cao Y.
    Zeng Z.
    Zhao E.
    Xing C.
    Hunan Daxue Xuebao/Journal of Hunan University Natural Sciences, 2022, 49 (10): : 51 - 60