A hybrid-based method for Chinese domain lightweight ontology construction

被引:9
|
作者
Qiu, Jing [1 ]
Qi, Lin [1 ]
Wang, Jianliang [1 ]
Zhang, Guanghua [1 ]
机构
[1] Hebei Univ Sci & Technol, Dept Informat Sci & Engn, Shijiazhuang 050018, Hebei, Peoples R China
基金
中国博士后科学基金; 中国国家自然科学基金;
关键词
Ontology learning; Concept extraction; Taxonomic relationships extraction; Hybrid-based method; LEARNING CONCEPT HIERARCHIES; TAXONOMY; TEXT; WEB; EXTRACTION;
D O I
10.1007/s13042-017-0661-0
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper proposes a framework to automatically construct lightweight ontology from a corpus of Chinese domain Web documents. A hybrid-based method was used for domain lightweight ontology learning. Rule-based method, statistics-based method and cluster-based method were combined to complete two sub-tasks: concept extraction and taxonomic relationships extraction. Firstly, multiword terms were identified based on a set of rules as well as a Named Entity Module. Three statistic methods were employed jointly to rank the order of domain concepts. Secondly, clustering and subsumption methods were joined to construct taxonomy. Concepts were clustered into several groups through clustering method. Three similarity measures were defined to compute similarities between concepts, which aims at capturing semantic, spatial, and co-occurrence information. Subsumption method was adopted to construct taxonomic structure for each concept group, since taxonomic relations only existed between similar concepts. Thirdly, the definitions of the concepts extracted in the first step are collected from online Chinese Encyclopedia. On this collection of concept definitions, the rule-based method and a set of lexico-syntactic patterns were applied to extract taxonomic relationships and refine the taxonomy. Finally, we evaluate our method using gold-standard evaluation on domain of football games. In our evaluation, we compare our method with several classical algorithms. The experimental results show the effectiveness of our method.
引用
收藏
页码:1519 / 1531
页数:13
相关论文
共 50 条
  • [21] On method and automatic construction theory of domain ontology based on depended text
    Liu Yao
    Sui Zhifang
    Chen Xuefei
    ICICIC 2006: FIRST INTERNATIONAL CONFERENCE ON INNOVATIVE COMPUTING, INFORMATION AND CONTROL, VOL 2, PROCEEDINGS, 2006, : 63 - +
  • [22] Standard Files Based Ontology Construction Method in Power Quality Domain
    Zhang Y.
    Li K.
    Shao Z.
    Luo H.
    Dianli Xitong Zidonghua/Automation of Electric Power Systems, 2020, 44 (17): : 102 - 110
  • [23] Research on Construction Method of Agricultural Domain Ontology
    Yang, Guoxia
    Liu, Ziyu
    Shen, Xiaomin
    PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON MATERIALS, ENVIRONMENTAL AND BIOLOGICAL ENGINEERING, 2015, 10 : 188 - 191
  • [24] A new method of chinese short text classification based on the domain ontology
    Yang, Fengqin
    Zhou, Xu
    Wu, Di
    Yang, Xiquan
    Sun, Tieli
    Sun, T. (suntl@nenu.edu.cn), 1600, ICIC Express Letters Office, Tokai University, Kumamoto Campus, 9-1-1, Toroku, Kumamoto, 862-8652, Japan (06): : 1399 - 1404
  • [25] Hybrid-based mathematical method for enhancing the quantitative research
    Almasarweh, Mohammad Salameh
    Alsaraireh, Ahmed Atallah
    Al Wadi, S.
    Alnawaiseh, Mahmoud Barakat
    ITALIAN JOURNAL OF PURE AND APPLIED MATHEMATICS, 2019, (42): : 944 - 953
  • [26] Automated Chinese domain ontology construction from text documents
    Zheng, Yu
    Dou, Wenxiang
    Wu, Gengfeng
    Li, Xin
    BIO-INSPIRED COMPUTATIONAL INTELLIGENCE AND APPLICATIONS, 2007, 4688 : 639 - +
  • [27] Research on automatic acquisition method of chinese domain ontology backbone based on hownet
    Liu, J. (liujxxxy@126.com), 1600, Inderscience Enterprises Ltd., 29, route de Pre-Bois, Case Postale 856, CH-1215 Geneva 15, CH-1215, Switzerland (07):
  • [28] A Mixed Method for Building the Uyghur and Chinese Domain Ontology
    Hankiz, Yilahun
    Seyyare, Imam
    Askar, Hamdulla
    KNOWLEDGE GRAPH AND SEMANTIC COMPUTING: SEMANTIC, KNOWLEDGE, AND LINKED BIG DATA, 2016, 650 : 124 - 129
  • [29] Construction and Representation of Shipping Domain Ontology Based on Ontology Design Patterns
    Liang, Yiduo
    Zhai, Jun
    2018 15TH INTERNATIONAL CONFERENCE ON SERVICE SYSTEMS AND SERVICE MANAGEMENT (ICSSSM), 2018,
  • [30] A Hybrid-based Feature Selection Method for Intrusion Detection System
    Sun, Xibin
    Ye, Heping
    Liu, Xiaolin
    International Journal of Network Security, 2023, 25 (01) : 131 - 139