Collaborative filtering system based on classification and extended K-means algorithm

被引:0
作者
Wu Y.K. [1 ]
Tang Z.H. [1 ]
机构
[1] School of Information, Zhejiang University of Finance and Economic
来源
Advances in Information Sciences and Service Sciences | 2011年 / 3卷 / 07期
关键词
Classification; Clustering; Collaborative filtering (CF); K-means; Similarity;
D O I
10.4156/aiss.vol3.issue7.22
中图分类号
学科分类号
摘要
Collaborative filtering (CF) is one of the most successful recommending techniques. With the tremendous growth in the number of users and items, however, the system encounters two key challenges, decreased recommending quality and increased response time. New technologies are urgently needed to deal with such large-scale problems. To address these issues, we suggest constructing the item category system based on the user-item rating matrix, calculating the similarity between items and classes, extracting the neighbor-class set, and predicting user scores based on such neighbor-sets. Because the dimension of the item classes is far smaller than the one of the items, the algorithm' computational speed is enormously enhanced. To mitigate the harmful effects on the system's predicting accuracy given by item-class based algorithm, the paper puts forward clustering after classification and extended K-means algorithm to construct the items' accurate category system. The experimental results indicate that classification and extended K-means algorithm have brought promising effects on the system, which ensure considerable predicting accuracy, while in the meantime, provide dramatically better performance than traditional item-based CF. So, the algorithm is a good choice for large-scale recommendation system.
引用
收藏
页码:187 / 194
页数:7
相关论文
共 50 条
  • [41] Clustering analyzing of undergraduate schools based on k-means algorithm
    Yang, Juan
    2017 INTERNATIONAL CONFERENCE ON ADVANCED MECHATRONIC SYSTEMS (ICAMECHS), 2017, : 309 - 311
  • [42] Covering Based Refined Rough K-Means Algorithm.
    Prabhavathy, P.
    Tripathy, B. K.
    Sundaram, Venkatesan Meenakshi
    RESEARCH JOURNAL OF PHARMACEUTICAL BIOLOGICAL AND CHEMICAL SCIENCES, 2016, 7 (05): : 2142 - 2151
  • [43] Silhouette coefficient-based weighting k-means algorithm
    Huixia Lai
    Tao Huang
    BinLong Lu
    Shi Zhang
    Ruliang Xiaog
    Neural Computing and Applications, 2025, 37 (5) : 3061 - 3075
  • [44] An efficient K-means clustering algorithm based on influence factors
    Leng, Mingwei
    Tang, Haitao
    Chen, Xiaoyun
    SNPD 2007: EIGHTH ACIS INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, ARTIFICIAL INTELLIGENCE, NETWORKING, AND PARALLEL/DISTRIBUTED COMPUTING, VOL 2, PROCEEDINGS, 2007, : 815 - +
  • [45] An Improved K-means Clustering Algorithm Based on Hadoop Platform
    Hou, Xiangru
    CYBER SECURITY INTELLIGENCE AND ANALYTICS, 2020, 928 : 1101 - 1109
  • [46] A Novel Framework for Classification of Syncope Disease using K-Means Clustering Algorithm
    Guftar, Madiha
    Raja, Ammar Asjad
    Ali, Syed Hasnain
    Qamar, Usman
    2015 SAI INTELLIGENT SYSTEMS CONFERENCE (INTELLISYS), 2015, : 127 - 132
  • [47] The fast clustering algorithm for the big data based on K-means
    Xie, Ting
    Zhang, Taiping
    INTERNATIONAL JOURNAL OF WAVELETS MULTIRESOLUTION AND INFORMATION PROCESSING, 2020, 18 (06)
  • [48] K-means Clustering Algorithm Based on Kernel Fisher Discrimination
    Peng, Chensong
    Li, Zhong
    INFORMATION-AN INTERNATIONAL INTERDISCIPLINARY JOURNAL, 2012, 15 (11A): : 4641 - 4646
  • [49] A Novel K-Means based Clustering Algorithm for Big Data
    Sinha, Ankita
    Jana, Prasanta K.
    2016 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2016, : 1875 - 1879
  • [50] An Improved K-means Clustering Algorithm Based on Normal Matrix
    Tian Shengwen
    Zhao Yongsheng
    Wang Yilei
    PROCEEDINGS OF THE SECOND INTERNATIONAL SYMPOSIUM ON TEST AUTOMATION AND INSTRUMENTATION, VOL 4, 2008, : 2182 - 2185