A Novel Item Cluster-Based Collaborative Filtering Recommendation System

被引:3
作者
Lu, Yuching [1 ]
Tozuka, Koki [1 ]
Chakraborty, Goutam [1 ]
Matsuhara, Masafumi [1 ]
机构
[1] Iwate Prefectural Univ, Fac Software & Informat Sci, Sugo 152-52, Takizawa, Iwate 0200693, Japan
关键词
Adjacency matrix; Similarity metrics; Fractional norm; Spectral clustering; Cluster evaluation; EFFICIENT;
D O I
10.1007/s12626-021-00084-7
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Recent exponential expansion of users adopting to applications on the mobile internet, like e-commerce and social networks, warrants mining of the huge data collected from users' past actions, for improving businesses and services. The core step for mining is to cluster the data meaningfully, conforming to the application. Social network data are structured, and graphical presentation reveals that structure. Therefore, graph clustering is an effective way to divulge the underlying structure in the data. For clustering, calculating similarity between a pair of vectors is the first step. The large dimension of the data, which is often noisy and sparse, makes distance measurement hard. In high dimension, most of the conventional distance metrics fail to work, as the data points are distributed over the surface of the high-dimensional hyper-space. The traditional concept of similarity, and nearest-neighbor does not hold. The variance of distance between any pair of points shrinks as the dimension increases. In this work, we investigate the efficacy of various similarity measures and clustering algorithms on high dimensional data. We experimented with a real-world high-dimensional matrix data, the ratings of movies by users. Clustering of movie items depends on a number of factors like movie genre, actors, directors, prominent acclaimed movie or an obscure one, etc. Different similarity measurements and clustering algorithms were experimented. Clustering results were evaluated by matching with known annotations of the movies. Finally, we proposed a novel recommendation algorithm based on item clustering. Its performance was evaluated with different distance metrics and clustering algorithms. Methods elaborated are applicable to other structured data generated in social network applications, or in biological investigations.
引用
收藏
页码:327 / 346
页数:20
相关论文
共 46 条
[11]  
Defferrard M, 2016, ADV NEUR IN, V29
[12]   A Novel K-medoids clustering recommendation algorithm based on probability distribution for collaborative filtering [J].
Deng, Jiangzhou ;
Guo, Junpeng ;
Wang, Yong .
KNOWLEDGE-BASED SYSTEMS, 2019, 175 :96-106
[13]   A scalable collaborative filtering framework based on co-clustering [J].
George, T ;
Merugu, S .
FIFTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2005, :625-628
[14]   Mining User Interest Change for Improving Collaborative Filtering [J].
Gong, SongJie ;
Cheng, GuangHua .
2008 INTERNATIONAL SYMPOSIUM ON INTELLIGENT INFORMATION TECHNOLOGY APPLICATION, VOL III, PROCEEDINGS, 2008, :24-27
[15]  
Govaert G., 2013, COCLUSTERING, DOI [10.1002/9781118649480, DOI 10.1002/9781118649480]
[16]   Recommendation systems: Principles, methods and evaluation [J].
Isinkaye, F. O. ;
Folajimi, Y. O. ;
Ojokoh, B. A. .
EGYPTIAN INFORMATICS JOURNAL, 2015, 16 (03) :261-273
[17]   Data clustering: A review [J].
Jain, AK ;
Murty, MN ;
Flynn, PJ .
ACM COMPUTING SURVEYS, 1999, 31 (03) :264-323
[18]   A New Clustering Method For Collaborative Filtering [J].
Jia Rongfei ;
Jin Maozhong ;
Liu Chao .
2010 INTERNATIONAL CONFERENCE ON NETWORKING AND INFORMATION TECHNOLOGY (ICNIT 2010), 2010, :488-492
[19]  
Khusro S., 2016, Information Science and Applications (ICISA) 2016, P1179, DOI DOI 10.1007/978-981-10-0557-2_112
[20]   A recommender system using GA K-means clustering in an online shopping market [J].
Kim, Kyoung-jae ;
Ahn, Hyunchul .
EXPERT SYSTEMS WITH APPLICATIONS, 2008, 34 (02) :1200-1209