Automatic generation of semantically enriched web pages by a text mining approach

被引:13
作者
Yang, Hsin-Chang [1 ]
机构
[1] Natl Univ Kaohsiung, Dept Informat Management, Kaohsiung 811, Taiwan
关键词
Metadata generation; Semantic tagging; Text mining; Self-organizing map;
D O I
10.1016/j.eswa.2009.02.022
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Nowadays most of the Wet) pages contain little amount of structure and supporting information that can reveal their semantics or meanings. To enable automated processing of the Web pages, semantic information such as metadata and tags regarding to each page should be added to it. Several authoring tools have been developed to help users tackling this task. However, manual or semi-automatic authoring is implausible when we intend to annotate large amount of Web pages. In this work, we proposed a method to automatically generate some descriptive metadata and tags for a Web page. The idea is to apply the self-organizing map algorithm to cluster the Web pages and discover the relationships between these clusters. In the mean time, the themes of each cluster are also identified. We then use Such relationships and themes to tag the Web pages and generate metadata for the Wet) pages. The result of experiments shows that our method may generate semantically relevant metadata and tags for the Web pages. (C) 2009 Elsevier Ltd. All rights reserved.
引用
收藏
页码:9709 / 9718
页数:10
相关论文
共 23 条
[1]  
[Anonymous], P 11 INT WORLD WID W
[2]  
BECHHOFER S, 2001, P 1 INT C KNOWL CAPT
[3]  
BONINO D, 2003, P 2 INT SEM WEB C FL
[4]   A hybrid movie recommender system based on neural networks [J].
Christakou, Christina ;
Vrettos, Spyros ;
Stafylopatis, Andreas .
INTERNATIONAL JOURNAL ON ARTIFICIAL INTELLIGENCE TOOLS, 2007, 16 (05) :771-792
[5]  
Dill S., 2003, J WEB SEMANT, V1, P115, DOI DOI 10.1016/J.WEBSEM.2003.07.006
[6]  
DINGLI A, 2003, P 2 INT C KNOWL CAPT
[7]  
Erdmann M., 2000, P COLING 2000 WORKSH
[8]  
GRAUBITZ H, 2001, P 1 INT WORKSH DAT D, P61
[9]   CREAM: CREAting Metadata for the Semantic Web [J].
Handschuh, S ;
Staab, S .
COMPUTER NETWORKS, 2003, 42 (05) :579-598
[10]  
Handschuh S, 2002, LECT NOTES ARTIF INT, V2473, P358