Unlabelled text mining methods based on two extension models of concept lattices

被引:28
作者
Chen, Xiaoyu [1 ]
Qi, Jianjun [1 ]
Zhu, Xiaomin [1 ]
Wang, Xin [2 ,4 ]
Wang, Zhen [3 ]
机构
[1] Xidian Univ, Sch Comp Sci & Technol, Xian, Peoples R China
[2] Univ Calgary, Dept Geomat Engn, Calgary, AB, Canada
[3] Northwest Univ, Sch Math, Xian, Peoples R China
[4] Northwest Univ, Sch Informat Sci & Technol, Xian, Peoples R China
基金
中国国家自然科学基金;
关键词
Formal concept analysis; Three-way concept lattice; Fuzzy concept lattice; Text clustering; Text classification; FORMAL CONCEPT ANALYSIS; ASSOCIATION RULES; 3-WAY; ONTOLOGY;
D O I
10.1007/s13042-019-00987-6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Concept lattice is a useful tool for text extraction. The common text clustering method fails to generate hierarchical relationships among categories and realize soft clustering simultaneously, while the concept lattice ignores the negative correlation between an object subset and an attribute subset. Motivated by the problems, we propose unlabelled text mining methods based on fuzzy concept lattice and three-way concept lattice. Firstly, we excavate hierarchical text categories to construct a classification system based on fuzzy concept lattice, and the labelled samples are obtained by the word matching method. Then, we construct a three-way concept lattice to get positive and negative classification rules based on the labelled samples, and the classifier is constructed to predict the new samples. Finally, Sogou laboratory news corpus is used to evaluate the efficiency of text clustering and classification methods. The results demonstrate that the improved clustering method has a higher average cluster goodness than earlier procedures and the classification model based on three-way concept lattice achieves a higher accuracy.
引用
收藏
页码:475 / 490
页数:16
相关论文
共 58 条
[1]  
[Anonymous], LECT NOTES COMPUTER
[2]  
[Anonymous], COMPUT ENG DES
[3]  
[Anonymous], P NIPS
[4]  
[Anonymous], ADV ENG FORUM
[5]  
[Anonymous], COMPUT TECHNOL DEV
[6]  
[Anonymous], P INT C ROUGH SETS D
[7]  
[Anonymous], THESIS
[8]  
[Anonymous], FORMAL CONCEPT ANAL
[9]  
[Anonymous], 1975, Synth Met, DOI [10.1007/BF00485052, DOI 10.1007/BF00485052]
[10]  
[Anonymous], P SOFTW ENG KNOWL EN