A Reinforcement Learning-Based Incentive Mechanism for Task Allocation Under Spatiotemporal Crowdsensing

被引:9
作者
Jiang, Kaige [1 ]
Wang, Yingjie [1 ]
Wang, Haipeng [2 ]
Liu, Zhaowei [1 ]
Han, Qilong [3 ]
Zhou, Ao [4 ]
Xiang, Chaocan [5 ]
Cai, Zhipeng [6 ]
机构
[1] Yantai Univ, Sch Comp & Control Engn, Yantai 264005, Peoples R China
[2] Naval Aviat Univ, Inst Informat Fus, Yantai 264005, Peoples R China
[3] Harbin Engn Univ, Coll Comp Sci & Technol, Harbin 150001, Peoples R China
[4] Beijing Univ Posts & Telecommun, State Key Lab Networking & Switching Technol, Beijing, Peoples R China
[5] Chongqing Univ, Coll Comp Sci, Chongqing, Peoples R China
[6] Georgia State Univ, Dept Comp Sci, Atlanta, GA USA
基金
中国国家自然科学基金;
关键词
Task analysis; Resource management; Crowdsensing; Technological innovation; Optimization; Time factors; Computer science; Incentive mechanism; Q-learning; spatiotemporal crowdsensing (SC); task allocation; MOBILE; ASSIGNMENT; SELECTION; DESIGN;
D O I
10.1109/TCSS.2023.3263821
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
With the development of the Industrial Internet of Things (IoT), the work of large-scale data collection makes spatiotemporal crowdsensing (SC) play an important role. Mobile devices equipped with sensors could act as workers to collect and process data for uploading. In the task allocation process, a fully static allocation fails to meet the needs of realistic conditions, while a completely dynamic allocation fails to achieve the desired results. Therefore, we assume a task-scheduled execution scenario that combines the above two conditions. In the pre-allocation process, an original time location constraints (ORTA) allocation algorithm is first proposed. Then it is optimized (OPTA) to fully utilize the remaining time of the workers and increase the matched number. In addition, the design of the incentive mechanism is an effective means to improve the task completion rate of the platform. To efficiently utilize the limited platform budget in the long run, a Q-learning-based algorithm is proposed to identify target inspire tasks and subsequently increase their reward to attract workers' participation. Finally, comparison experiments are conducted on real datasets to verify the effectiveness of our algorithm. Furthermore, the experiments on a Raspberry Pi local terminal are conducted under a satellite-based environment.
引用
收藏
页码:2179 / 2189
页数:11
相关论文
共 45 条
[1]  
[Anonymous], GMISSION DATASET
[2]  
[Anonymous], EVERYSENDER DAT
[3]   Trading Private Range Counting over Big IoT Data [J].
Cai, Zhipeng ;
He, Zaobo .
2019 39TH IEEE INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS (ICDCS 2019), 2019, :144-153
[4]   A Private and Efficient Mechanism for Data Uploading in Smart Cyber-Physical Systems [J].
Cai, Zhipeng ;
Zheng, Xu .
IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2020, 7 (02) :766-775
[5]   Collective Data-Sanitization for Preventing Sensitive Information Inference Attacks in Social Networks [J].
Cai, Zhipeng ;
He, Zaobo ;
Guan, Xin ;
Li, Yingshu .
IEEE TRANSACTIONS ON DEPENDABLE AND SECURE COMPUTING, 2018, 15 (04) :577-590
[6]   Game Theory in Internet of Things: A Survey [J].
Chi, Chuanxiu ;
Wang, Yingjie ;
Tong, Xiangrong ;
Siddula, Madhuri ;
Cai, Zhipeng .
IEEE INTERNET OF THINGS JOURNAL, 2021, 9 (14) :12125-12146
[7]   Intelligent Delay-Aware Partial Computing Task Offloading for Multiuser Industrial Internet of Things Through Edge Computing [J].
Deng, Xiaoheng ;
Yin, Jian ;
Guan, Peiyuan ;
Xiong, Neal N. ;
Zhang, Lan ;
Mumtaz, Shahid .
IEEE INTERNET OF THINGS JOURNAL, 2023, 10 (04) :2954-2966
[8]  
Dickerson JP, 2018, PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS (AAMAS' 18), P318
[9]   Dynamic Delayed-Decision Task Assignment Under Spatial-Temporal Constraints in Mobile Crowdsensing [J].
Ding, Yu ;
Zhang, Lichen ;
Guo, Longjiang .
IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2022, 9 (04) :2418-2431
[10]  
Howe J, 2006, WIRED MAGAZINE, V14, P1, DOI DOI 10.1086/599595