Clustering of Rough Set Related Documents with Use of Knowledge from DBpedia

被引:0
|
作者
Szczuka, Marcin [1 ]
Janusz, Andrzej [1 ]
Herba, Kamil [1 ]
机构
[1] Univ Warsaw, Fac Math Informat & Mech, PL-02097 Warsaw, Poland
来源
ROUGH SETS AND KNOWLEDGE TECHNOLOGY | 2011年 / 6954卷
关键词
Text mining; semantic clustering; DBpedia; document grouping; rough sets;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A case study of semantic clustering of scientific articles related to Rough Sets is presented. The proposed method groups the documents on the basis of their content and with assistance of DBpedia knowledge base. The text corpus is first treated with Natural Language Processing tools in order to produce vector representations of the content and then matched against a collection of concepts retrieved from DBpedia. As a result, a new representation is constructed that better reflects the semantics of the texts. With this new representation, the documents are hierarchically clustered in order to form partition of papers that share semantic relatedness. The steps in textual data preparation, utilization of DBpedia and clustering are explained and illustrated with results of experiments performed on a corpus of scientific documents about rough sets.
引用
收藏
页码:394 / 403
页数:10
相关论文
共 50 条
  • [21] Granulation using Clustering and Rough Set Theory & its Tree Representation
    Singh, Girish Kumar
    Minz, Sonajharia
    PROCEEDINGS OF WORLD ACADEMY OF SCIENCE, ENGINEERING AND TECHNOLOGY, VOL 19, 2007, 19 : 271 - 276
  • [22] A vague-rough set approach for uncertain knowledge acquisition
    Feng, Lin
    Li, Tianrui
    Ruan, Da
    Gou, Shirong
    KNOWLEDGE-BASED SYSTEMS, 2011, 24 (06) : 837 - 843
  • [23] Knowledge acquisition in incomplete information systems: A rough set approach
    Leung, Y
    Wu, WZ
    Zhang, WX
    EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2006, 168 (01) : 164 - 180
  • [24] Neighbourhood rough set model for knowledge acquisition using MapReduce
    Hiremath, Shruthi
    Chandra, Pallavi
    Joy, Anne Mary
    Tripathy, B. K.
    INTERNATIONAL JOURNAL OF COMMUNICATION NETWORKS AND DISTRIBUTED SYSTEMS, 2015, 15 (2-3) : 212 - 234
  • [25] Study of Knowledge Acquisition Using Rough Set Merging Rule from Time Series Data
    Matsumoto, Yoshiyuki
    Watada, Junzo
    2018 INTERNATIONAL CONFERENCE ON UNCONVENTIONAL MODELLING, SIMULATION AND OPTIMIZATION - SOFT COMPUTING AND META HEURISTICS - UMSO, 2018,
  • [26] A novel rough value set categorical clustering technique for supplier base management
    Uddin, Jamal
    Ghazali, Rozaida
    Deris, Mustafa Mat
    Iqbal, Umer
    Shoukat, Ijaz Ali
    COMPUTING, 2021, 103 (09) : 2061 - 2091
  • [27] Rough Set Based Fuzzy Scheme for Clustering and Cluster Head Selection in VANET
    Jinila, Bevish
    Komathy
    ELEKTRONIKA IR ELEKTROTECHNIKA, 2015, 21 (01) : 54 - 59
  • [28] A novel rough value set categorical clustering technique for supplier base management
    Jamal Uddin
    Rozaida Ghazali
    Mustafa Mat Deris
    Umer Iqbal
    Ijaz Ali Shoukat
    Computing, 2021, 103 : 2061 - 2091
  • [29] Ant Based Clustering of Time Series Discrete Data - A Rough Set Approach
    Pancerz, Krzysztof
    Lewicki, Arkadiusz
    Tadeusiewicz, Ryszard
    SWARM, EVOLUTIONARY, AND MEMETIC COMPUTING, PT I, 2011, 7076 : 645 - +
  • [30] A new rough set approach to knowledge discovery in incomplete information systems
    Wu, WZ
    Mi, JS
    Zhang, WX
    2003 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-5, PROCEEDINGS, 2003, : 1713 - 1718