Information Extractor for Small Medium Enterprise Aggregator

被引:0
|
作者
Oktavino, Fabrian H. [1 ]
Maulidevi, Nur Ulfa S. T. [1 ]
机构
[1] Inst Teknol Bandung, Comp Sci Informat, Bandung, Indonesia
来源
2014 INTERNATIONAL CONFERENCE ON DATA AND SOFTWARE ENGINEERING (ICODSE) | 2014年
关键词
SME; Support Vector Machine; Information Extractor; SMOTE;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Indonesia have a massive number of SMEs, but with a very low revenue. An alternative to increase revenue is by using internet. Some SMEs already develop their website, but they don't have same navigation. The websites confuse the potential buyers. So, a website's aggregator is essential. This aggregator is made without the owner of the SMEs to register their website, which means it can automatically show website's content that already been made. For this purpose, two stages is required. First is to find relevant SMEs websites, and the second is to extract information automatically. This paper focuses on information extractor to extract information from SMEs e-commerce website with or without shopping cart feature, used to make an automatic SME aggregator and make prototype database. Learning algorithms is needed to recognize information that will be extracted. The research is about how to preprocessing website pages and what is the best algorithm for automatic information extraction. The system will compare three algorithms, Naive Bayes, Decision Tree, and Support Vector Machine. Algorithm with the best accuracy will be used for the system's model. Support Vector Machine is the best algorithm. SMOTE, which is method to solve imbalanced data set problem by oversampling minority class, is the best filter for system's training model. System can extract information with best performance from SMEs e-commerce website with shopping cart feature.
引用
收藏
页数:5
相关论文
共 50 条