Integrating Sentiment Analysis on Hybrid Collaborative Filtering Method in a Big Data Environment

被引：7

作者：

Sundari, P. Shanmuga ^{[1
]}

Subaji, M. ^{[2
]}

机构：

[1] VIT Univ, Sch Comp Sci & Engn, Vellore 632014, Tamil Nadu, India

[2] VIT Univ, CHIP, Vellore 632014, Tamil Nadu, India

来源：

INTERNATIONAL JOURNAL OF INFORMATION TECHNOLOGY & DECISION MAKING | 2020年 / 19卷 / 02期

关键词：

Big data; sentiment analysis; hybrid collaborative filtering model; apache's spark; opinion bias; RECOMMENDATION; FACTORIZATION; ALGORITHMS;

D O I：

10.1142/S0219622020500108

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Most of the traditional recommendation systems are based on user ratings. Here, users provide the ratings towards the product after use or experiencing it. Accordingly, the user item transactional database is constructed for recommendation. The rating based collaborative filtering method is well known method for recommendation system. This system leads to data sparsity problem as the user is unaware of other similar items. Web cataloguing service such as tags plays a significant role to analyse the user's perception towards a particular product. Some system use tags as additional resource to reduce the data sparsity issue. But these systems require lot of specific details related to the tags. Existing system either focuses on ratings or tags based recommendation to enhance the accuracy. So these systems suffer from data sparsity and efficiency problem that leads to ineffective recommendations accuracy. To address the above said issues, this paper proposed hybrid recommendation system (Iter_ALS Iterative Alternate Least Square) to enhance the recommendation accuracy by integrating rating and emotion tags. The rating score reveals overall perception of the item and emotion tags reflects user's feelings. In the absence of emotional tags, scores found in rating is assumed as positive or negative emotional tag score. Lexicon based semantic analysis on emotion tags value is adopted to represent the exclusive value of tag. Unified value is represented into iter_ALS model to reduce the sparsity problem. In addition, this method handles opinion bias between ratings and tags. Experiments were tested and verified using a benchmark project of MovieLens dataset. Initially this model was tested with different sparsity levels varied between 0%-100 percent and the results obtained from the experiments shows the proposed method outperforms with baseline methods. Further tests were conducted to authenticate how it handles opinion bias by users before recommending the item. The proposed method is more capable to be adopted in many real world applications

引用

页码：385 / 412

页数：28

共 52 条

[1]

Adeniyi D. A., 2016, Applied Computing and Informatics, V12, P90, DOI 10.1016/j.aci.2014.10.001

[2]

[Anonymous], ADV ARTICIAL INTELLI

[3]

[Anonymous], 2011, PROC RECSYS 2011 WOR

[4]

[Anonymous], GROUPLENS DATASETS M

[5]

[Anonymous], INT J INNOVATIVE TEC

[6]

[Anonymous], ARXIV170300397

[7]

[Anonymous], DIC SENT

[8]

[Anonymous], ARXIV180606192

[9]

Bao LJ, 2012, ELEVENTH WUHAN INTERNATIONAL CONFERENCE ON E-BUSINESS, P1

[10] Recommender systems survey [J].

Bobadilla, J. ;

Ortega, F. ;

Hernando, A. ;

Gutierrez, A. .

KNOWLEDGE-BASED SYSTEMS, 2013, 46 :109-132

← 1 2 3 4 5 6 →