Contextual ontological concepts extraction

被引:0
作者
Karoui, Lobna [1 ]
Bennacer, Nacera [1 ]
Aufaure, Marie-Aude [1 ]
机构
[1] Ecole Super Elect, F-91192 Gif Sur Yvette, France
来源
DISCOVERY SCIENCE, PROCEEDINGS | 2006年 / 4265卷
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Ontologies provide a common layer which plays a major role in supporting information exchange and sharing. In this paper, we focus on the ontological concept extraction process from HTML documents. We propose an unsupervised hierarchical clustering algorithm namely "Contextual Ontological Concept Extraction" (COCE) which is an incremental use of a partitioning algorithm and is guided by a structural context. This context exploits the html structure and the location of words to select the semantically closer cooccurrents for each word and to improve the words weighting. Guided by this context definition, we perform an incremental clustering that refines the words' context of each cluster to obtain semantic extracted concepts. The COCE algorithm offers the choice between either an automatic execution or an interactive one. We experiment the COCE algorithm on French documents related to the tourism. Our results show how the execution of our context-based algorithm improves the relevance of the clusters' conceptual quality.
引用
收藏
页码:306 / 310
页数:5
相关论文
共 8 条
[1]  
DAVULCU H, 1998, AAAI 98 IAAI 98 P 15
[2]  
FAURE D, 1998, ICSTR8816 U PAR
[3]  
HAN H, UNPUB WCM2000
[4]  
KAROUI L, 2005, SEMANTIC WEB APPL PE
[5]   Ontology learning for the Semantic Web [J].
Maedche, A ;
Staab, S .
IEEE INTELLIGENT SYSTEMS & THEIR APPLICATIONS, 2001, 16 (02) :72-79
[6]  
MICHELET B, 1988, THESIS U PARIS 7
[7]  
NAVIGLI R, 1998, AAAI 98 IAAI 98 P 15
[8]  
VAZIRGIANNIS M, 2003, UNCERTAINTLY HANDLIN