ClubCF: A Clustering-Based Collaborative Filtering Approach for Big Data Application

被引:33
作者
Hu, Rong [1 ,2 ]
Dou, Wanchun [1 ]
Liu, Jianxun [2 ]
机构
[1] Nanjing Univ, Dept Comp Sci & Technol, State Key Lab Novel Software Technol, Nanjing 210093, Jiangsu, Peoples R China
[2] Hunan Univ Sci & Technol, Key Lab Knowledge Proc & Networked Mfg, Xiangtan 411201, Peoples R China
基金
美国国家科学基金会;
关键词
Big data application; cluster; collaborative filtering; mashup; SERVICE; ALGORITHMS; SELECTION;
D O I
10.1109/TETC.2014.2310485
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Spurred by service computing and cloud computing, an increasing number of services are emerging on the Internet. As a result, service-relevant data become too big to be effectively processed by traditional approaches. In view of this challenge, a clustering-based collaborative filtering approach is proposed in this paper, which aims at recruiting similar services in the same clusters to recommend services collaboratively. Technically, this approach is enacted around two stages. In the first stage, the available services are divided into small-scale clusters, in logic, for further processing. At the second stage, a collaborative filtering algorithm is imposed on one of the clusters. Since the number of the services in a cluster is much less than the total number of the services available on the web, it is expected to reduce the online execution time of collaborative filtering. At last, several experiments are conducted to verify the availability of the approach, on a real data set of 6225 mashup services collected from ProgrammableWeb.
引用
收藏
页码:302 / 313
页数:12
相关论文
共 42 条
[11]   HireSome-II: Towards Privacy-Aware Cross-Cloud Service Composition for Big Data Applications [J].
Dou, Wanchun ;
Zhang, Xuyun ;
Liu, Jianxun ;
Chen, Jinjun .
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2015, 26 (02) :455-466
[12]  
Elmeleegy Hazem, 2008, 2008 IEEE International Conference on Web Services (ICWS), P337, DOI 10.1109/ICWS.2008.128
[13]  
Greenshpan O., 2009, Proc. VLDB Endow, V2, P538
[14]  
Gupta Vishal, 2013, Journal of Emerging Technologies in Web Intelligence, V5, P157, DOI 10.4304/jetwi.5.2.157-161
[15]   Fuzzy c-Means Algorithms for Very Large Data [J].
Havens, Timothy C. ;
Bezdek, James C. ;
Leckie, Christopher ;
Hall, Lawrence O. ;
Palaniswami, Marimuthu .
IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2012, 20 (06) :1130-1146
[16]   An empirical analysis of design choices in neighborhood-based collaborative filtering algorithms [J].
Herlocker, J ;
Konstan, JA ;
Riedl, J .
INFORMATION RETRIEVAL, 2002, 5 (04) :287-310
[17]   Evaluating collaborative filtering recommender systems [J].
Herlocker, JL ;
Konstan, JA ;
Terveen, K ;
Riedl, JT .
ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2004, 22 (01) :5-53
[18]  
Julie D., 2012, IJART, V2, P69
[19]   A Review-Based Reputation Evaluation Approach for Web Services [J].
Li, Hai-Hua ;
Du, Xiao-Yong ;
Tian, Xuan .
JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2009, 24 (05) :893-900
[20]   Agglomerative fuzzy K-Means clustering algorithm with selection of number of clusters [J].
Li, Mark Junjie ;
Ng, Michael K. ;
Cheung, Yiu-ming ;
Huang, Joshua Zhexue .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2008, 20 (11) :1519-1534