A distributed real-time recommender system for big data streams

被引:4
|
作者
Hazem, Heidy [1 ,5 ]
Awad, Ahmed [2 ,3 ,4 ]
Yousef, Ahmed Hassan [1 ,5 ]
机构
[1] Nile Univ, Giza, Egypt
[2] Tartu Univ, Tartu, Estonia
[3] Cairo Univ, Giza, Egypt
[4] Narva Rd 18 Tartu City, Tartu Cty, EE-51009 Tartu, Estonia
[5] Juhayna Sq,26th July Corridor, Giza, Egypt
关键词
Streaming; Big data; Online Recommender Systems; MATRIX FACTORIZATION;
D O I
10.1016/j.asej.2022.102026
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Recommender Systems (RS) play a crucial role in our lives. As users become continuously connected to the internet, they are less tolerant of obsolete recommendations made by an RS. Online RS has to address three requirements: continuous training and recommendation, handling concept drifts, and the ability to scale. Streaming RS proposed in the literature address the first two requirements only. That is because they run the training process on a single machine. To tackle the third challenge, we propose a Splitting and Replication mechanism for distributed streaming RS. Our mechanism is inspired by the shared-nothing architecture that underpins contemporary big data processing systems. We have applied our mechanism to two well-known approaches for online RS, namely, matrix factorization and item-based collaborative filtering. We conducted experiments comparing the performance with the baseline (single machine). Evaluating different data sets, experiments show online recall improvement by 40% with more than 50% less memory consumption. (c) 2022 THE AUTHORS. Published by Elsevier BV on behalf of Faculty of Engineering, Ain Shams University. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/
引用
收藏
页数:16
相关论文
共 50 条
  • [1] Mapping the Big Data Landscape: Technologies, Platforms and Paradigms for Real-Time Analytics of Data Streams
    Dubuc, Timothee
    Stahl, Frederic
    Roesch, Etienne B.
    IEEE ACCESS, 2021, 9 : 15351 - 15374
  • [2] Real-time prediction of accident using Big data system
    Tantaoui, Mouad
    Laanaoui, My Driss
    Kabil, Mustapha
    3RD INTERNATIONAL CONFERENCE ON NETWORKING, INFORMATION SYSTEM & SECURITY (NISS'20), 2020,
  • [3] An ML Based Anomaly Detection System in real-time data streams
    Diaz Rivera, Javier Jose
    Khan, Talha Ahmed
    Akbar, Waleed
    Afaq, Muhammad
    Song, Wang-Cheol
    2021 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND COMPUTATIONAL INTELLIGENCE (CSCI 2021), 2021, : 1329 - 1334
  • [4] A survey on data stream, big data and real-time
    Gomes E.H.A.
    Plentz P.D.M.
    De Rolt C.R.
    Dantas M.A.R.
    International Journal of Networking and Virtual Organisations, 2019, 20 (02) : 143 - 167
  • [5] Big Data Streaming Platforms to Support Real-time Analytics
    Fernandes, Eliana
    Salgado, Ana Carolina
    Bernardino, Jorge
    ICSOFT: PROCEEDINGS OF THE 15TH INTERNATIONAL CONFERENCE ON SOFTWARE TECHNOLOGIES, 2020, : 426 - 433
  • [6] Distributed in-memory vocabulary tree for real-time retrieval of big data images
    Duan, Hancong
    Peng, Yubing
    Min, Geyong
    Xiang, Xiaoke
    Zhan, Wenhan
    Zou, Hao
    AD HOC NETWORKS, 2015, 35 : 137 - 148
  • [7] Real-Time Signal Identification in Big Data Streams Bragg-Spot Localization in Photon Science
    Becker, Daniel
    Streit, Achim
    PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING & SIMULATION (HPCS 2015), 2015, : 611 - 616
  • [8] A Comparative Study on Performance and Resource Utilization of Real-time Distributed Messaging Systems for Big Data
    Intorruk, Somprasong
    Numnonda, Thanisa
    2019 20TH IEEE/ACIS INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, ARTIFICIAL INTELLIGENCE, NETWORKING AND PARALLEL/DISTRIBUTED COMPUTING (SNPD), 2019, : 102 - 107
  • [9] Design of a Real-Time Big Data Analytic Scheme for Continuous Monitoring with a Distributed Acoustic Sensor
    Nur, Abdusomad
    Di Pasquale, Fabrizio
    Muanenda, Yonas
    SPIE FUTURE SENSING TECHNOLOGIES 2023, 2023, 12327
  • [10] Real-Time DDoS Attack Detection System Using Big Data Approach
    Awan, Mazhar Javed
    Farooq, Umar
    Babar, Hafiz Muhammad Aqeel
    Yasin, Awais
    Nobanee, Haitham
    Hussain, Muzammil
    Hakeem, Owais
    Zain, Azlan Mohd
    SUSTAINABILITY, 2021, 13 (19)