Clustering of Rough Set Related Documents with Use of Knowledge from DBpedia

被引:0
|
作者
Szczuka, Marcin [1 ]
Janusz, Andrzej [1 ]
Herba, Kamil [1 ]
机构
[1] Univ Warsaw, Fac Math Informat & Mech, PL-02097 Warsaw, Poland
来源
ROUGH SETS AND KNOWLEDGE TECHNOLOGY | 2011年 / 6954卷
关键词
Text mining; semantic clustering; DBpedia; document grouping; rough sets;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A case study of semantic clustering of scientific articles related to Rough Sets is presented. The proposed method groups the documents on the basis of their content and with assistance of DBpedia knowledge base. The text corpus is first treated with Natural Language Processing tools in order to produce vector representations of the content and then matched against a collection of concepts retrieved from DBpedia. As a result, a new representation is constructed that better reflects the semantics of the texts. With this new representation, the documents are hierarchically clustered in order to form partition of papers that share semantic relatedness. The steps in textual data preparation, utilization of DBpedia and clustering are explained and illustrated with results of experiments performed on a corpus of scientific documents about rough sets.
引用
收藏
页码:394 / 403
页数:10
相关论文
共 50 条
  • [41] A rough set approach to knowledge reduction based on inclusion degree and evidence reasoning theory
    Zhang, M
    Xu, LD
    Zhang, WX
    Li, HZ
    EXPERT SYSTEMS, 2003, 20 (05) : 298 - 304
  • [42] The use of rough sets theory for clustering rule extraction of ART1
    Zhu, XQ
    Xu, SC
    Chen, ZX
    Chen, HX
    8TH INTERNATIONAL CONFERENCE ON NEURAL INFORMATION PROCESSING, VOLS 1-3, PROCEEDING, 2001, : 355 - 360
  • [44] A novel dynamic knowledge extraction method in cooperative multiple robot system using rough set
    Xu, H
    Jia, PF
    ARTIFICIAL INTELLIGENCE APPLICATIONS AND INNOVATIONS II, 2005, 187 : 535 - 543
  • [45] Biological Knowledge Integration in DNA Microarray Gene Expression Classification Based on Rough Set Theory
    Calvo-Dmgz, D.
    Galvez, J. F.
    Glez-Pena, Daniel
    Fdez-Riverola, Florentino
    6TH INTERNATIONAL CONFERENCE ON PRACTICAL APPLICATIONS OF COMPUTATIONAL BIOLOGY & BIOINFORMATICS, 2012, 154 : 53 - 61
  • [46] A rough set approach to mining concise rules from inconsistent data
    Sai, Ying
    Nie, Peiyao
    Xu, Ruzhi
    Huang, Jincai
    2006 IEEE INTERNATIONAL CONFERENCE ON GRANULAR COMPUTING, 2006, : 333 - +
  • [47] Rough set based rule induction from two decision tables
    Inuiguchi, Masahiro
    Miyajima, Takuya
    EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2007, 181 (03) : 1540 - 1553
  • [48] Attribute Reduction from Closure Operators and Matroids in Rough Set Theory
    Restrepo, Mauricio
    Cornelis, Chris
    ROUGH SETS, IJCRS 2020, 2020, 12179 : 183 - 192
  • [49] Rough Set Model based Knowledge Acquisition of Market Movements in Tick-wise Price Data
    Matsumoto, Yoshiyuki
    Watada, Junzo
    6TH INTERNATIONAL CONFERENCE ON SOFT COMPUTING AND INTELLIGENT SYSTEMS, AND THE 13TH INTERNATIONAL SYMPOSIUM ON ADVANCED INTELLIGENT SYSTEMS, 2012, : 1768 - 1771
  • [50] Confidence limits on rough set rules generated from a limited number of examples
    Anderson, GT
    Shelton, RO
    INTERNATIONAL JOURNAL OF GENERAL SYSTEMS, 2000, 29 (06) : 867 - 877