Unlabelled text mining methods based on two extension models of concept lattices

被引:3
作者
Xiaoyu Chen
Jianjun Qi
Xiaomin Zhu
Xin Wang
Zhen Wang
机构
[1] Xidian University,School of Computer Science and Technology
[2] University of Calgary,Department of Geomatics Engineering
[3] Northwest University,School of Mathematics
[4] Northwest University,School of Information Science and Technology
来源
International Journal of Machine Learning and Cybernetics | 2020年 / 11卷
关键词
Formal concept analysis; Three-way concept lattice; Fuzzy concept lattice; Text clustering; Text classification;
D O I
暂无
中图分类号
学科分类号
摘要
Concept lattice is a useful tool for text extraction. The common text clustering method fails to generate hierarchical relationships among categories and realize soft clustering simultaneously, while the concept lattice ignores the negative correlation between an object subset and an attribute subset. Motivated by the problems, we propose unlabelled text mining methods based on fuzzy concept lattice and three-way concept lattice. Firstly, we excavate hierarchical text categories to construct a classification system based on fuzzy concept lattice, and the labelled samples are obtained by the word matching method. Then, we construct a three-way concept lattice to get positive and negative classification rules based on the labelled samples, and the classifier is constructed to predict the new samples. Finally, Sogou laboratory news corpus is used to evaluate the efficiency of text clustering and classification methods. The results demonstrate that the improved clustering method has a higher average cluster goodness than earlier procedures and the classification model based on three-way concept lattice achieves a higher accuracy.
引用
收藏
页码:475 / 490
页数:15
相关论文
共 92 条
  • [1] Ravi K(2015)A survey on opinion mining and sentiment analysis: tasks, approaches and applications Knowl-Based Syst 89 14-46
  • [2] Ravi V(2000)Concept lattice and association rule discovery J Comput Res Dev 37 1415-1421
  • [3] Xie Z(2012)Fuzzy clustering-based formal concept analysis for association rules mining Appl Artif Intell 26 274-301
  • [4] Liu Z(2018)A new FCA-based method for identifying biclusters in gene expression data Int J Mach Learn Cybern 9 1879-1893
  • [5] Kumar CA(2012)Analysis of vector space model, latent semantic indexing and formal concept analysis for information retrieval Cybern Inf Technol 12 34-48
  • [6] Houari A(2010)Concept lattice reduction using fuzzy K-means clustering Expert Syst Appl 37 2696-2704
  • [7] Ayadi W(2018)Relation granulation and algebraic structure based on concept lattice in complex information systems Int J Mach Learn Cybern 9 1895-1907
  • [8] Ben Yahia S(2016)A comprehensive survey on formal concept analysis, its research trends and applications Int J Appl Math Comput Sci 26 495-516
  • [9] Kumar CA(2019)Similarity reasoning in formal concept analysis: from one- to many-valued contexts Knowl Inf Syst 60 715-739
  • [10] Radavansky M(1996)Information retrieval through hybrid navigation of lattice representations Int J Hum Comput Stud 45 553-578