An Automated Approach to Product Taxonomy Mapping in E-Commerce

被引:0
作者
Nederstigt, Lennart [1 ]
Vandic, Damir [1 ]
Frasincar, Flavius [1 ]
机构
[1] Erasmus Univ, Inst Econometr, Rotterdam, Netherlands
来源
MANAGEMENT INTELLIGENT SYSTEMS | 2012年 / 171卷
关键词
e-commerce; taxonomy mapping; word sense disambiguation; ONTOLOGY;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Due to the ever-growing amount of information available on Web shops, it has become increasingly difficult to get an overview of Web-based product information. There are clear indications that better search capabilities, such as the exploitation of annotated data, are needed to keep online shopping transparent for the user. For example, annotations can help present information from multiple sources in a uniform manner. This paper proposes an algorithm that can autonomously map heterogeneous product taxonomies for Web shop data integration purposes. The proposed approach uses word sense disambiguation techniques, approximate lexical matching, and a mechanism that deals with composite categories. Our algorithm's performance on three real-life datasets was compared favourably against two other state-of-the-art taxonomy mapping algorithms. The experiments show that our algorithm performs at least twice as good compared to the other algorithms w.r.t. precision and F-measure.
引用
收藏
页码:111 / 120
页数:10
相关论文
共 13 条
[1]  
[Anonymous], 2005, P 2005 ACM SIGMOD IN
[2]  
[Anonymous], 36 PEW INT AM LIF PR
[3]  
Ehrig M, 2004, LECT NOTES COMPUT SC, V3298, P683
[4]  
Hepp M, 2008, LECT NOTES ARTIF INT, V5268, P329, DOI 10.1007/978-3-540-87696-0_29
[5]  
Hongwei Zhu, 2008, International Journal of Electronic Business, V6, P319, DOI 10.1504/IJEB.2008.020672
[6]  
John Li, 2004, 5 WORKSH PERF METR I
[7]  
Lesk M., 1986, P 5 ANN INT C SYSTEM, P24
[8]  
Madhavan J., 2001, Proceedings of the 27th International Conference on Very Large Data Bases, P49
[9]   Similarity flooding: A versatile graph matching algorithm and its application to schema matching [J].
Melnik, S ;
Garcia-Molina, H ;
Rahm, E .
18TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, PROCEEDINGS, 2002, :117-128
[10]   WORDNET - A LEXICAL DATABASE FOR ENGLISH [J].
MILLER, GA .
COMMUNICATIONS OF THE ACM, 1995, 38 (11) :39-41