A distributed real-time recommender system for big data streams

被引:4
作者
Hazem, Heidy [1 ,5 ]
Awad, Ahmed [2 ,3 ,4 ]
Yousef, Ahmed Hassan [1 ,5 ]
机构
[1] Nile Univ, Giza, Egypt
[2] Tartu Univ, Tartu, Estonia
[3] Cairo Univ, Giza, Egypt
[4] Narva Rd 18 Tartu City, Tartu Cty, EE-51009 Tartu, Estonia
[5] Juhayna Sq,26th July Corridor, Giza, Egypt
关键词
Streaming; Big data; Online Recommender Systems; MATRIX FACTORIZATION;
D O I
10.1016/j.asej.2022.102026
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Recommender Systems (RS) play a crucial role in our lives. As users become continuously connected to the internet, they are less tolerant of obsolete recommendations made by an RS. Online RS has to address three requirements: continuous training and recommendation, handling concept drifts, and the ability to scale. Streaming RS proposed in the literature address the first two requirements only. That is because they run the training process on a single machine. To tackle the third challenge, we propose a Splitting and Replication mechanism for distributed streaming RS. Our mechanism is inspired by the shared-nothing architecture that underpins contemporary big data processing systems. We have applied our mechanism to two well-known approaches for online RS, namely, matrix factorization and item-based collaborative filtering. We conducted experiments comparing the performance with the baseline (single machine). Evaluating different data sets, experiments show online recall improvement by 40% with more than 50% less memory consumption. (c) 2022 THE AUTHORS. Published by Elsevier BV on behalf of Faculty of Engineering, Ain Shams University. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/
引用
收藏
页数:16
相关论文
共 50 条
[21]   Real-Time Semiparametric Regression for Distributed Data Sets [J].
Luts, Jan .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2015, 27 (02) :545-557
[22]   A Survey on Real-time Big Data Analytics: Applications and Tools [J].
Yadranjiaghdam, Babak ;
Pool, Nathan ;
Tabrizi, Nasseh .
2016 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE & COMPUTATIONAL INTELLIGENCE (CSCI), 2016, :404-409
[23]   Big Data Stream Computing in Healthcare Real-Time Analytics [J].
Ta, Van-Dai ;
Liu, Chuan-Ming ;
Nkabinde, Goodwill Wandile .
PROCEEDINGS OF 2016 IEEE INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND BIG DATA ANALYSIS (ICCCBDA 2016), 2016, :37-42
[24]   An incremental approach for real-time Big Data visual analytics [J].
Garcia, Ignacio ;
Casado, Ruben ;
Bouchachia, Abdelhamid .
2016 IEEE 4TH INTERNATIONAL CONFERENCE ON FUTURE INTERNET OF THINGS AND CLOUD WORKSHOPS (FICLOUDW), 2016, :177-182
[25]   Towards of a Real-time Big Data Architecture to Intensive Care [J].
Goncalves, Andre ;
Portela, Filipe ;
Santos, Manuel Filipe ;
Rua, Fernando .
8TH INTERNATIONAL CONFERENCE ON EMERGING UBIQUITOUS SYSTEMS AND PERVASIVE NETWORKS (EUSPN 2017) / 7TH INTERNATIONAL CONFERENCE ON CURRENT AND FUTURE TRENDS OF INFORMATION AND COMMUNICATION TECHNOLOGIES IN HEALTHCARE (ICTH-2017) / AFFILIATED WORKSHOPS, 2017, 113 :585-590
[26]   A Big Data Architecture for Near Real-time Traffic Analytics [J].
Gong, Yikai ;
Rimba, Paul ;
Sinnott, Richard O. .
COMPANION PROCEEDINGS OF THE 10TH INTERNATIONAL CONFERENCE ON UTILITY AND CLOUD COMPUTING (UCC'17 COMPANION), 2017, :157-162
[27]   Big Data Analytics Architecture for Real-Time Traffic Control [J].
Amini, Sasan ;
Gerostathopoulos, Ilias ;
Prehofer, Christian .
2017 5TH IEEE INTERNATIONAL CONFERENCE ON MODELS AND TECHNOLOGIES FOR INTELLIGENT TRANSPORTATION SYSTEMS (MT-ITS), 2017, :710-715
[28]   Railway Big Data Real-time Processing Based on Storm [J].
Guo, Shihang ;
Zhang, Lichen .
PROCEEDINGS OF THE 2016 2ND WORKSHOP ON ADVANCED RESEARCH AND TECHNOLOGY IN INDUSTRY APPLICATIONS, 2016, 81 :536-539
[29]   Real-time positioning of a specific object in the big data environment [J].
Zhu, Hejun ;
Zhu, Liehuang .
EURASIP JOURNAL ON WIRELESS COMMUNICATIONS AND NETWORKING, 2018,
[30]   A Comparative Performance of Real-time Big Data Analytic Architectures [J].
Sanla, Apisit ;
Numnonda, Thanisa .
PROCEEDINGS OF 2019 IEEE 9TH INTERNATIONAL CONFERENCE ON ELECTRONICS INFORMATION AND EMERGENCY COMMUNICATION (ICEIEC 2019), 2019, :674-678