A Clustering Algorithm Based on Weighted Distance and User Preference of Incorporating Time Factors

被引:45
作者
Li, Wenjie [1 ]
Xue, Hua [1 ]
机构
[1] Tianjin Univ Technol, Tianjin, Peoples R China
来源
PROCEEDINGS OF THE 6TH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY: IOT AND SMART CITY (ICIT 2018) | 2018年
关键词
Time factor; user preference; weighted distance; TF-IDF algorithm; K-Means clustering;
D O I
10.1145/3301551.3301564
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Due to the advent of the era of big data, the network information resources are exploding, which not only makes users hard to find available information but also makes the problem of information overload more serious. It is not difficult to see that the recommendation system is one of the effective measures to improve the above problems. In order to improve the user cold start problem and data sparseness problem, this paper proposes a clustering algorithm based on weighted distance and user preference of incorporating time factors. Firstly, this paper mitigates the user's cold start problem by introducing a user-user attribute matrix that constructed by the user's basic objective features, and the improvement of the sparsity problem is mainly through the introduction of project features. Since the characteristics of the project can reflect the user's preference from the aspect of content, as well as the sum of the project features is far less than the number of projects. Secondly, the user-item attribute total score matrix of the small dimension is obtained by introducing the project feature into the user-item score matrix. Last but not least, the project features are also introduced when constructing the user-project attribute preference matrix with the TF-IDF algorithm. At the same time, we also consider the influence of user interest drifts over time on user preferences. Based on the above three matrices we can get the weighted Euclidean distance and then use the K-Means algorithm for clustering. This article takes the recommendation of a movie as an example. The experimental results on the MovieLens data set show that the proposed algorithm has better quality and performance than other related algorithms.
引用
收藏
页码:1 / 6
页数:6
相关论文
共 13 条
[1]   Using Recommendation Agents to Cope with Information Overload [J].
Aljukhadar, Muhammad ;
Senecal, Sylvain ;
Daoust, Charles-Etienne .
INTERNATIONAL JOURNAL OF ELECTRONIC COMMERCE, 2012, 17 (02) :41-70
[2]  
An Zeng, 2017, COMPUTER SCI, V44, P243
[3]  
Aygün C, 2016, 2016 24TH SIGNAL PROCESSING AND COMMUNICATION APPLICATION CONFERENCE (SIU), P1025, DOI 10.1109/SIU.2016.7495917
[4]  
Bo Y U, 2017, COMPUTER SYSTEMS APP
[5]   A Recommendation System to Facilitate Business Process Modeling [J].
Deng, Shuiguang ;
Wang, Dongjing ;
Li, Ying ;
Cao, Bin ;
Yin, Jianwei ;
Wu, Zhaohui ;
Zhou, Mengchu .
IEEE TRANSACTIONS ON CYBERNETICS, 2017, 47 (06) :1380-1394
[6]  
[樊宁 Fan Ning], 2011, [计算机仿真, Computer Simulation], V28, P369
[7]  
He M., 2017, COMPUTER SCI, V44, P391
[8]  
Kong Weiliang, 2013, RES KEY ISSUES COLLA
[9]  
Lei Zhang, 2014, RES RECOMMENDATION A
[10]  
Ming H E, 2017, COMPUTER SCI