A Novel Approach to Select High-Reward Data Items in Big Data Stream Based on Multiarmed Bandit

被引:2
|
作者
Wang, Shun [1 ]
Zeng, Guosun [1 ]
机构
[1] Tongji Univ, Dept Comp Sci & Technol, Minist Educ, Embedded Syst & Serv Comp Key Lab, Shanghai 201804, Peoples R China
来源
IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS | 2022年 / 9卷 / 04期
基金
中国国家自然科学基金;
关键词
Big Data; Real-time systems; Data models; Probability; Resource management; Program processors; Indexes; Big data stream; data item value; multiarmed bandit; real-time processing; reinforcement learning; selection policy;
D O I
10.1109/TCSS.2021.3114352
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Mining a big data stream with continuous, unbounded, and time-varying data items is a great challenge. In the situation where computational resources for real-time processing are limited, it is especially hard to select high-value data items from a big data stream. This article studies on selection policies based on the multiarmed bandit. We cache the online arriving data items in different buffers according to the characteristics of data items. These buffers are regarded as the arms of a multiarmed bandit. We pay attention to several key factors in selecting data items including the data item value, processing time, resource consumption, and loss value caused by some discarded items. Thus, a comprehensive reward mechanism for each data item is given as the foundation for the selection of data items, that is, gambling decision-making. We design three selection policies: the improved epsilon -greedy, the improved upper confidence bound (UCB), and a data item selection policy named dynamic high-reward incentive (DHRI) with active, dynamic, and incentive reward. They are all trying to balance ``exploitation and exploration'' in a multiarmed bandit. Experimental results show that our proposed approach is effective and outperforms the traditional methods.
引用
收藏
页码:1144 / 1153
页数:10
相关论文
共 50 条
  • [1] A Novel Intelligent Clustering Approach for High Dimensional Data in a Big Data Environment
    Tao, Qian
    Wang, Zhenyu
    Gu, Chunqin
    Chen, Wenyuan
    Lin, Weiqiang
    Lin, Haojie
    2017 13TH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY (ICNC-FSKD), 2017,
  • [2] Data fusion in automotive applicationsEfficient big data stream computing approach
    Amir Haroun
    Ahmed Mostefaoui
    François Dessables
    Personal and Ubiquitous Computing, 2017, 21 : 443 - 455
  • [3] Data fusion in automotive applications Efficient big data stream computing approach
    Haroun, Amir
    Mostefaoui, Ahmed
    Dessables, Francois
    PERSONAL AND UBIQUITOUS COMPUTING, 2017, 21 (03) : 443 - 455
  • [4] A Software Chain Approach to Big Data Stream Processing and Analytics
    Xhafa, Fatos
    Naranjo, Victor
    Caballe, Santi
    Barolli, Leonard
    2015 9TH INTERNATIONAL CONFERENCE ON COMPLEX, INTELLIGENT, AND SOFTWARE INTENSIVE SYSTEMS CISIS 2015, 2015, : 179 - 186
  • [5] A Novel Approach for Big Data Classification and Transportation in Rail Networks
    Saki, Mahdi
    Abolhasan, Mehran
    Lipman, Justin
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2020, 21 (03) : 1239 - 1249
  • [6] High Performance and High Availability Archived Stream System for Big Data
    Miao, Jiajia
    Chen, Guoyou
    Du, Kai
    Fang, Xuelin
    INFORMATION TECHNOLOGY APPLICATIONS IN INDUSTRY, PTS 1-4, 2013, 263-266 : 2792 - +
  • [7] A Novel Approach for Deciphering Big Data Value Using Dark Data
    Bhatia, Surbhi
    Alojail, Mohammed
    INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2022, 33 (02) : 1261 - 1271
  • [8] A Novel Task Provisioning Approach Fusing Reinforcement Learning for Big Data
    Cheng, Yongyi
    Xu, Gaochao
    IEEE ACCESS, 2019, 7 : 143699 - 143709
  • [9] A Big Data Stream-Driven Risk Recognition Approach for Hospital Accounting Management Systems
    Wang, Yining
    Liang, Bin
    Wang, Tian
    Liu, Zihua
    IEEE ACCESS, 2023, 11 : 130089 - 130101
  • [10] Game theoretic approach of a novel decision policy for customers based on big data
    Shasha Liu
    Bingjia Shao
    Yuan Gao
    Su Hu
    Yi Li
    Weigui Zhou
    Electronic Commerce Research, 2018, 18 : 225 - 240