A hybrid-based method for Chinese domain lightweight ontology construction

被引:9
|
作者
Qiu, Jing [1 ]
Qi, Lin [1 ]
Wang, Jianliang [1 ]
Zhang, Guanghua [1 ]
机构
[1] Hebei Univ Sci & Technol, Dept Informat Sci & Engn, Shijiazhuang 050018, Hebei, Peoples R China
基金
中国博士后科学基金; 中国国家自然科学基金;
关键词
Ontology learning; Concept extraction; Taxonomic relationships extraction; Hybrid-based method; LEARNING CONCEPT HIERARCHIES; TAXONOMY; TEXT; WEB; EXTRACTION;
D O I
10.1007/s13042-017-0661-0
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper proposes a framework to automatically construct lightweight ontology from a corpus of Chinese domain Web documents. A hybrid-based method was used for domain lightweight ontology learning. Rule-based method, statistics-based method and cluster-based method were combined to complete two sub-tasks: concept extraction and taxonomic relationships extraction. Firstly, multiword terms were identified based on a set of rules as well as a Named Entity Module. Three statistic methods were employed jointly to rank the order of domain concepts. Secondly, clustering and subsumption methods were joined to construct taxonomy. Concepts were clustered into several groups through clustering method. Three similarity measures were defined to compute similarities between concepts, which aims at capturing semantic, spatial, and co-occurrence information. Subsumption method was adopted to construct taxonomic structure for each concept group, since taxonomic relations only existed between similar concepts. Thirdly, the definitions of the concepts extracted in the first step are collected from online Chinese Encyclopedia. On this collection of concept definitions, the rule-based method and a set of lexico-syntactic patterns were applied to extract taxonomic relationships and refine the taxonomy. Finally, we evaluate our method using gold-standard evaluation on domain of football games. In our evaluation, we compare our method with several classical algorithms. The experimental results show the effectiveness of our method.
引用
收藏
页码:1519 / 1531
页数:13
相关论文
共 50 条
  • [31] A New Method for Generating the Chinese News Summary Based on Fuzzy Reasoning and Domain Ontology
    Chen, Shyi-Ming
    Huang, Ming-Hung
    INTELLIGENT INFORMATION AND DATABASE SYSTEMS (ACIIDS 2013), PT I,, 2013, 7802 : 70 - 78
  • [32] An effective automated ontology construction based on the agriculture domain
    Deepa, Rajendran
    Vigneshwari, Srinivasan
    ETRI JOURNAL, 2022, 44 (04) : 573 - 587
  • [33] AUTOMATIC ONTOLOGY CONSTRUCTION IN FICTION-BASED DOMAIN
    Goh, Hui-Ngo
    Kiu, Ching-Chieh
    Soon, Lay-Ki
    Ranaivo-Malancon, Bali
    INTERNATIONAL JOURNAL OF SOFTWARE ENGINEERING AND KNOWLEDGE ENGINEERING, 2011, 21 (08) : 1147 - 1167
  • [34] A Hybrid-Based Verifiable Secret Sharing Scheme Using Chinese Remainder Theorem
    Om Prakash Verma
    Nitin Jain
    S. K. Pal
    Arabian Journal for Science and Engineering, 2020, 45 : 2395 - 2406
  • [35] Anaphora resolution in Chinese texts based on domain ontology
    Shi, Shumin
    Huang, Heyan
    RECENT ADVANCE OF CHINESE COMPUTING TECHNOLOGIES, 2007, : 134 - 139
  • [36] A Hybrid-Based Verifiable Secret Sharing Scheme Using Chinese Remainder Theorem
    Verma, Om Prakash
    Jain, Nitin
    Pal, S. K.
    ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2020, 45 (04) : 2395 - 2406
  • [38] A hybrid-based texture synthesis approach
    Wu, CH
    Lai, YY
    Tai, WK
    VISUAL COMPUTER, 2004, 20 (2-3): : 106 - 129
  • [39] The Construction of Management Domain Ontology
    Li, Dexun
    PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON ELECTRONIC & MECHANICAL ENGINEERING AND INFORMATION TECHNOLOGY (EMEIT-2012), 2012, 23
  • [40] Automatic construction on of domain Ontology
    Liu, Yao
    Sui, Zhi-Fang
    Hu, Yong-Wei
    Ji, Tie-Liang
    Beijing Youdian Daxue Xuebao/Journal of Beijing University of Posts and Telecommunications, 2006, 29 (SUPPL. 2): : 65 - 69