Automated product taxonomy mapping in an e-commerce environment

被引:22
作者
Aanen, Steven S. [1 ]
Vandic, Damir [1 ]
Frasincar, Flavius [1 ]
机构
[1] Erasmus Univ, NL-3000 DR Rotterdam, Netherlands
关键词
Products; Semantic Web; Schema; Ontology; Matching; Mapping; Merging; E-commerce; Web shop; ONTOLOGY; WORDNET; WEB;
D O I
10.1016/j.eswa.2014.09.032
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Over the last few years, we have experienced a steady growth in e-commerce. This growth introduces many problems for services that want to aggregate product information and offerings. One of the problems that aggregation services face is the matching of product categories from different Web shops. This paper proposes an algorithm to perform this task automatically, making it possible to aggregate product information from multiple Web sites, in order to deploy it for search, comparison, or recommender systems applications. The algorithm uses word sense disambiguation techniques to address varying denominations between different taxonomies. Path similarity is assessed between source and candidate target categories, based on lexical relatedness and structural information. The main focus of the proposed solution is to improve the disambiguation procedure in comparison to an existing state-of-the-art approach, while coping with product taxonomy-specific characteristics, like composite categories, and re-examining lexical similarity and similarity aggregation in this context. The performance evaluation based on data from three real-world Web shops demonstrates that the proposed algorithm improves the bench-marked approach by 62% on average F-1-measure. (C) 2014 Elsevier Ltd. All rights reserved.
引用
收藏
页码:1298 / 1313
页数:16
相关论文
共 70 条
  • [1] [Anonymous], 2004, Romanian Journal of Information Science and Technology
  • [2] [Anonymous], 1996, Proceedings of COLING, DOI DOI 10.3115/992628.992635
  • [3] [Anonymous], 1992, the 14th International Conference on Computational Linguistics
  • [4] [Anonymous], 36 PEW INT AM LIF PR
  • [5] Arlotta L., 2003, WEBDB, P7
  • [6] Aumueller D., 2005, P 2005 ACM SIGMOD IN, DOI [DOI 10.1145/1066157.1066283, 10.1145/106615 7.1066283]
  • [7] Avesani P, 2005, LECT NOTES COMPUT SC, V3729, P67, DOI 10.1007/11574620_8
  • [8] Banerjee S., 2002, Computational Linguistics and Intelligent Text Processing. Third International Conference, CICLing 2002. Proceedings (Lecture Notes in Computer Science Vol.2276), P136
  • [9] An information integration framework for E-commerce
    Benetti, E
    Beneventano, D
    Bergamaschi, S
    Guerra, F
    Vincini, M
    [J]. IEEE INTELLIGENT SYSTEMS, 2002, 17 (01) : 18 - 25
  • [10] Services mashups - The new generation of web applications
    Benslimane, Djamal
    Dustdar, Schahram
    Sheth, Amit
    [J]. IEEE INTERNET COMPUTING, 2008, 12 (05) : 13 - 15