A distributed real-time recommender system for big data streams

被引:4
作者
Hazem, Heidy [1 ,5 ]
Awad, Ahmed [2 ,3 ,4 ]
Yousef, Ahmed Hassan [1 ,5 ]
机构
[1] Nile Univ, Giza, Egypt
[2] Tartu Univ, Tartu, Estonia
[3] Cairo Univ, Giza, Egypt
[4] Narva Rd 18 Tartu City, Tartu Cty, EE-51009 Tartu, Estonia
[5] Juhayna Sq,26th July Corridor, Giza, Egypt
关键词
Streaming; Big data; Online Recommender Systems; MATRIX FACTORIZATION;
D O I
10.1016/j.asej.2022.102026
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Recommender Systems (RS) play a crucial role in our lives. As users become continuously connected to the internet, they are less tolerant of obsolete recommendations made by an RS. Online RS has to address three requirements: continuous training and recommendation, handling concept drifts, and the ability to scale. Streaming RS proposed in the literature address the first two requirements only. That is because they run the training process on a single machine. To tackle the third challenge, we propose a Splitting and Replication mechanism for distributed streaming RS. Our mechanism is inspired by the shared-nothing architecture that underpins contemporary big data processing systems. We have applied our mechanism to two well-known approaches for online RS, namely, matrix factorization and item-based collaborative filtering. We conducted experiments comparing the performance with the baseline (single machine). Evaluating different data sets, experiments show online recall improvement by 40% with more than 50% less memory consumption. (c) 2022 THE AUTHORS. Published by Elsevier BV on behalf of Faculty of Engineering, Ain Shams University. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/
引用
收藏
页数:16
相关论文
共 50 条
  • [31] RUBA: Real-time Unstructured Big Data Analysis Framework
    Kim, Jaein
    Kim, Nacwoo
    Lee, Byungtak
    Park, Joonho
    Seo, Kwangik
    Park, Hunyoung
    [J]. 2013 INTERNATIONAL CONFERENCE ON ICT CONVERGENCE (ICTC 2013): FUTURE CREATIVE CONVERGENCE TECHNOLOGIES FOR NEW ICT ECOSYSTEMS, 2013, : 520 - 524
  • [32] Real-time Analysis and Visualization for Big Data of Energy Consumption
    Li, Jiaxue
    Song, Wei
    Fong, Simon
    [J]. 2017 INTERNATIONAL CONFERENCE ON SOFTWARE AND E-BUSINESS (ICSEB 2017), 2015, : 13 - 16
  • [33] Real-time positioning of a specific object in the big data environment
    Hejun Zhu
    Liehuang Zhu
    [J]. EURASIP Journal on Wireless Communications and Networking, 2018
  • [34] Real-Time Ship Management through the Lens of Big Data
    Plitsos, Stathis
    Varelas, Takis
    [J]. 2020 IEEE SIXTH INTERNATIONAL CONFERENCE ON BIG DATA COMPUTING SERVICE AND APPLICATIONS (BIGDATASERVICE 2020), 2020, : 143 - 148
  • [35] Near real-time streaming analysis of big fusion data
    Kube, R.
    Churchill, R. M.
    Chang, C. S.
    Choi, J.
    Wang, R.
    Klasky, S.
    Stephey, L.
    Dart, E.
    Choi, M. J.
    [J]. PLASMA PHYSICS AND CONTROLLED FUSION, 2022, 64 (03)
  • [36] Real-time Big Data Analytics for Multimedia Transmission and Storage
    Wang, Kun
    Mi, Jun
    Xu, Chenhan
    Shu, Lei
    Deng, Der-Jiunn
    [J]. 2016 IEEE/CIC INTERNATIONAL CONFERENCE ON COMMUNICATIONS IN CHINA (ICCC), 2016,
  • [37] Dynamic redirection of real-time data streams for elastic stream computing
    Sun, Dawei
    Gao, Shang
    Liu, Xunyun
    You, Xindong
    Buyya, Rajkumar
    [J]. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2020, 112 (112): : 193 - 208
  • [38] Real-Time or Near Real-Time Persisting Daily Healthcare Data Into HDFS and ElasticSearch Index Inside a Big Data Platform
    Chen, Dequan
    Chen, Yi
    Brownlow, Brian N.
    Kanjamala, Pradip P.
    Arredondo, Carlos A. Garcia
    Radspinner, Bryan L.
    Raveling, Matthew A.
    [J]. IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2017, 13 (02) : 595 - 606
  • [39] Real-time Medical Emergency Response System: Exploiting IoT and Big Data for Public Health
    M. Mazhar Rathore
    Awais Ahmad
    Anand Paul
    Jiafu Wan
    Daqiang Zhang
    [J]. Journal of Medical Systems, 2016, 40
  • [40] Big-Data Based Real-Time Interactive Growth Management System in Wireless Communications
    Jonghun Kim
    Heetae Jang
    Jong Tak Kim
    Hee-Jun Pan
    Roy C. Park
    [J]. Wireless Personal Communications, 2019, 105 : 655 - 671