Automatic Extraction of Product Information from Multiple e-Commerce Web Sites

被引:0
作者
Nasti, Samiah Jan [1 ]
Asger, M. [2 ]
Butt, Muheet Ahmad [3 ]
机构
[1] BGSB Univ, Dept Comp Sci, Rajouri, Jammu & Kasmir, India
[2] BGSB Univ, Sch Math Sci & Engn, Rajouri, Jammu & Kasmir, India
[3] Univ Kashmir Hazratbal Srinagar, Dept Comp Sci, Srinagar, Jammu & Kasmir, India
来源
PROCEEDINGS OF RECENT INNOVATIONS IN COMPUTING, ICRIC 2019 | 2020年 / 597卷
关键词
Document Object Model (DOM) tree; Crawling; Clustering; Wrapper generation;
D O I
10.1007/978-3-030-29407-6_53
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
With the growth of e-commerce, shopping online has now become a part and parcel of every one's life. The advantage of e-commerce Web sites is that they can reach to a very large number of customers despite of distance and time limitations. The main aim of this paper is to extract the product information from various e-commerce sites. Extraction of such information can help the business organizations to fetch and attract the large number of customers to their Web site and increase profit. So, in this paper, we propose a fully automatic method which will extract and integrate information from multiple e-commerce Web sites in order to improve business decision making. The proposed method is also comparatively better at precision and recall than other methods.
引用
收藏
页码:739 / 747
页数:9
相关论文
共 8 条
[1]  
Ansari S.L., 1977, ACCOUNT ORG SOC, V2, P101, DOI DOI 10.1016/0361-3682(77)90027-7
[2]  
Kalakota R., 1977, ELECT COMMERCE MANGE
[3]  
Kiruthika M., 2011, PATTERN DISCOVERY US
[4]  
Lee Y.-S., 2004, IEEE INT C E TECHN E
[5]  
Perner P., 2003, 5 INT C ICICS
[6]  
Satokar K., 2010, INT J COMPUT APPL
[7]  
Todi A., 2012, INT J ENG SCI ADV TE, V2, P550
[8]   Integrating purchase patterns and traversal patterns to predict HTTP requests in E-Commerce sites [J].
Vallamkondu, S ;
Gruenwald, L .
IEEE INTERNATIONAL CONFERENCE ON E-COMMERCE, 2003, :256-263