Equitable Valuation of Crowdsensing for Machine Learning via Game Theory

被引:1
作者
He, Qiangqiang [1 ]
Qiao, Yu [1 ]
Yang, Shang [1 ]
Wang, Chongjun [1 ]
机构
[1] Nanjing Univ, State Key Lab Novel Software Technol, Nanjing 210046, Peoples R China
来源
WIRELESS ALGORITHMS, SYSTEMS, AND APPLICATIONS, WASA 2021, PT III | 2021年 / 12939卷
基金
中国国家自然科学基金;
关键词
Game theory; Shapley value; Data valuation;
D O I
10.1007/978-3-030-86137-7_15
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In the era of mobile Internet, it has became easier to obtain personal data through crowdsensing platforms, which promotes the development of data-driven machine learning. A fundamental challenge is how to quantify the value of data provided by each worker. In this paper, we use the powerful tool of game theory called Shapely value to solve this challenge. Shapley value is a classic concept in game theory and can satisfy the equitable valuation of data. However, the calculation of Shapley value is exponentially related to the number of workers. Worse still, in the deep learning model, the time cost of retraining the model and evaluating the contribution of each worker's data to the model is unacceptable. Therefore, we propose two algorithms based on Monte Carlo and batch gradient descent to approximate Shapley value in machine learning and deep learning. We take K-fold validation as the benchmark, and prove that our proposed algorithms can reduce the time overhead while ensuring lower error in the experiment. Finally, we find that it can provide better insight into the labor value of each worker in specific learning tasks.
引用
收藏
页码:133 / 141
页数:9
相关论文
共 10 条
  • [1] Cohen S, 2005, 19TH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE (IJCAI-05), P665
  • [2] Cook R. D., 1982, RESIDUALS INFLUENCE
  • [3] DETECTION OF INFLUENTIAL OBSERVATION IN LINEAR-REGRESSION
    COOK, RD
    [J]. TECHNOMETRICS, 1977, 19 (01) : 15 - 18
  • [4] Ghorbani A, 2019, PR MACH LEARN RES, V97
  • [5] Jia R, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P3693
  • [6] Koh PW, 2017, PR MACH LEARN RES, V70
  • [7] Cooperative game theoretic centrality analysis of terrorist networks: The cases of Jemaah Islamiyah and Al Qaeda
    Lindelauf, R. H. A.
    Hamers, H. J. M.
    Husslage, B. G. M.
    [J]. EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2013, 229 (01) : 230 - 238
  • [8] Lundberg SM, 2017, ADV NEUR IN, V30
  • [9] Lundberg SM, 2018, Consistent Individualized Feature Attribution for Tree Ensembles
  • [10] Time-consistent Shapley value allocation of pollution cost reduction
    Petrosjan, L
    Zaccour, G
    [J]. JOURNAL OF ECONOMIC DYNAMICS & CONTROL, 2003, 27 (03) : 381 - 398