A hybrid-based method for Chinese domain lightweight ontology construction

被引:9
|
作者
Qiu, Jing [1 ]
Qi, Lin [1 ]
Wang, Jianliang [1 ]
Zhang, Guanghua [1 ]
机构
[1] Hebei Univ Sci & Technol, Dept Informat Sci & Engn, Shijiazhuang 050018, Hebei, Peoples R China
基金
中国博士后科学基金; 中国国家自然科学基金;
关键词
Ontology learning; Concept extraction; Taxonomic relationships extraction; Hybrid-based method; LEARNING CONCEPT HIERARCHIES; TAXONOMY; TEXT; WEB; EXTRACTION;
D O I
10.1007/s13042-017-0661-0
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper proposes a framework to automatically construct lightweight ontology from a corpus of Chinese domain Web documents. A hybrid-based method was used for domain lightweight ontology learning. Rule-based method, statistics-based method and cluster-based method were combined to complete two sub-tasks: concept extraction and taxonomic relationships extraction. Firstly, multiword terms were identified based on a set of rules as well as a Named Entity Module. Three statistic methods were employed jointly to rank the order of domain concepts. Secondly, clustering and subsumption methods were joined to construct taxonomy. Concepts were clustered into several groups through clustering method. Three similarity measures were defined to compute similarities between concepts, which aims at capturing semantic, spatial, and co-occurrence information. Subsumption method was adopted to construct taxonomic structure for each concept group, since taxonomic relations only existed between similar concepts. Thirdly, the definitions of the concepts extracted in the first step are collected from online Chinese Encyclopedia. On this collection of concept definitions, the rule-based method and a set of lexico-syntactic patterns were applied to extract taxonomic relationships and refine the taxonomy. Finally, we evaluate our method using gold-standard evaluation on domain of football games. In our evaluation, we compare our method with several classical algorithms. The experimental results show the effectiveness of our method.
引用
收藏
页码:1519 / 1531
页数:13
相关论文
共 50 条
  • [41] Construction of Diesel Domain Ontology
    Sheng, Dingguo
    Chen, Zhaohua
    Hao, Jibin
    Xia, Guijian
    Guo, Zhikun
    PROCEEDINGS OF THE 2017 2ND INTERNATIONAL CONFERENCE ON MATERIALS SCIENCE, MACHINERY AND ENERGY ENGINEERING (MSMEE 2017), 2017, 123 : 1163 - 1173
  • [42] Domain Ontology for Construction Knowledge
    El-Diraby, Tamer E.
    JOURNAL OF CONSTRUCTION ENGINEERING AND MANAGEMENT, 2013, 139 (07) : 768 - 784
  • [43] Hybrid-based Controllers for Switching Converters
    Jezernik, Karel
    PRZEGLAD ELEKTROTECHNICZNY, 2009, 85 (07): : 120 - 124
  • [44] Cross domain-based ontology construction via Jaccard Semantic Similarity with hybrid optimization model
    Kakad, Shital
    Dhage, Sudhir
    EXPERT SYSTEMS WITH APPLICATIONS, 2021, 178
  • [45] A hybrid-based texture synthesis approach
    Chang-Hsing Wu
    Yueh-Yi Lai
    Wen-Kai Tai
    The Visual Computer, 2004, 20 : 106 - 129
  • [46] HYBRID-BASED DENSE STEREO MATCHING
    Chuang, T. Y.
    Ting, H. W.
    Jaw, J. J.
    XXIII ISPRS CONGRESS, COMMISSION III, 2016, 41 (B3): : 495 - 501
  • [47] Personalizing Hybrid-Based Dialogue Agents
    Matveev, Yuri
    Makhnytkina, Olesia
    Posokhov, Pavel
    Matveev, Anton
    Skrylnikov, Stepan
    MATHEMATICS, 2022, 10 (24)
  • [48] Language and domain aware lightweight ontology matching
    Bella, Gabor
    Giunchiglia, Fausto
    McNeill, Fiona
    JOURNAL OF WEB SEMANTICS, 2017, 43 : 1 - 17
  • [49] The Method of Query Expansion based on Domain Ontology
    Zhang, Bing
    Du, YaJun
    Li, HaiMing
    Jia, Libo
    PROCEEDINGS OF THE 2009 PACIFIC-ASIA CONFERENCE ON CIRCUITS, COMMUNICATIONS AND SYSTEM, 2009, : 755 - +
  • [50] A Query Method for Domain Ontology Based on HBase
    Wang, Hong
    Sun, Kang
    Wang, Xuejun
    2017 13TH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY (ICNC-FSKD), 2017,