Unsupervised method of clustering and labeling of the online product based on reviews

被引:3
作者
Billah, Md Masum [1 ]
Bhuiyan, Mohammad Nuruzzaman [2 ]
Akterujjaman, Md. [3 ]
机构
[1] Amer Int Univ Bangladesh AIUB, Comp Sci, Dhaka 1229, Bangladesh
[2] Noakhali Sci & Technol Univ, Inst Informat Technol, Noakhali 3814, Bangladesh
[3] Daffodil Int Univ, Software Engn, Dhaka 1207, Bangladesh
关键词
Cluster labeling; construct phrases; baseline method; sentence clustering;
D O I
10.1142/S1793962321500173
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
This paper presents an unsupervised approach to cluster reviews of products collected from Amazon and then generates its labels of each cluster. Instead of using a complete review, this paper splits a review into sentences and considers all sentences from the reviews as inputs for Clustering. Hierarchical Agglomerative Clustering (HAC) is used to cluster sentences. The approaches of cluster labeling are also unsupervised. For labeling, three different methods have been used to find a limited number of essential words for each cluster. Extracted essential words are used to construct phrases. Constructed phrases are used as labels for each cluster. This paper compares the result of the labeling method with baseline labeling. In the result evaluation, all the labeling methods outperform the baseline method. The aim of this research is cluster labeling that makes a set of labels to describe a cluster content and distinguishes the labels from other cluster labels.
引用
收藏
页数:23
相关论文
共 29 条
[1]  
Ahmet Aker, 2016, P 9 INT NATURAL LANG, P61
[2]  
[Anonymous], 2011, J. Mach.Learn. Res.
[3]  
[Anonymous], 2016, TEMPORAL DATA MINING
[4]  
[Anonymous], 2013, COMPUTING RES REPOSI
[5]  
[Anonymous], 2003, P 26 ANN INT ACM SIG
[6]  
[Anonymous], 2010, Introduction to Information Retrieval
[7]  
CALLAN J., 2006, P 2006 INT C DIG GOV, P167, DOI DOI 10.1145/1146598.1146650
[8]  
Chua F. C. T, 2009, TECHNICAL REPORTS
[9]   The PageRank algorithm as a method to optimize swarm behavior through local analysis [J].
Coppola, M. ;
Guo, J. ;
Gill, E. ;
de Croon, G. C. H. E. .
SWARM INTELLIGENCE, 2019, 13 (3-4) :277-319
[10]  
Cover T. M., 2006, ELEMENTS INFORM THEO, V2nd