Building domain ontology based on web data and generic ontology

被引:0
作者
Yang, J [1 ]
Wang, L [1 ]
Zhang, S [1 ]
Sui, X [1 ]
Zhang, N [1 ]
Xu, ZQ [1 ]
机构
[1] Peking Univ, Sch EECS, Dept Comp Sci, Beijing 100871, Peoples R China
来源
IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE (WI 2004), PROCEEDINGS | 2004年
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The automatic or semi-automatic construction of ontology has become a research topic of interest in recent years. This paper describes a mechanism for constructing domain specific ontologies automatically based on web data and generic ontology. Firstly, we employ the hierarchical agglomerative clustering(HAC) algorithm, clustering web pages hierarchically and resulting in a binary tree. Then an algorithm is proposed, which selects from the binary tree the significative nodes as topics implying concepts of domain interests. Lastly, the Chinese generic ontology, HowNet, is introduced to evolve the topics (together with their hierarchical structures) into domain ontology. We experiment our method in the field of computer hardware based on web pages collected from Chinese BtoC web sites. An in-depth discussion on the experiment results is also given.
引用
收藏
页码:686 / 689
页数:4
相关论文
共 7 条
[1]  
HE CLT, 2003, INFORMATION RETRIEVA, P613
[2]  
Jain A.K., 1998, ALGORITHMS CLUSTERIN
[3]  
LIIM SJL, 2003, ISMIS 2003, P93
[4]   VECTOR-SPACE MODEL FOR AUTOMATIC INDEXING [J].
SALTON, G ;
WONG, A ;
YANG, CS .
COMMUNICATIONS OF THE ACM, 1975, 18 (11) :613-620
[5]   Bottom-up construction of ontologies [J].
van der Vet, PE ;
Mars, NJI .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 1998, 10 (04) :513-526
[6]  
ZHANG YWS, 2003, IAT 2003, P461
[7]  
ZHOU QB, 2002, P 35 HAW INT COMF SY